cs.AI(2025-05-13)

📊 共 34 篇论文 | 🔗 2 篇有代码

🎯 兴趣领域导航

支柱九:具身大模型 (Embodied Foundation Models) (20 🔗1) 支柱二:RL算法与架构 (RL & Architecture) (12 🔗1) 支柱一:机器人控制 (Robot Control) (2)

🔬 支柱九:具身大模型 (Embodied Foundation Models) (20 篇)

#题目一句话要点标签🔗
1 Ultrasound Report Generation with Multimodal Large Language Models for Standardized Texts 提出多模态大语言模型以解决超声报告生成问题 large language model multimodal
2 Decoding Neighborhood Environments with Large Language Models 利用大型语言模型解码邻里环境以提升健康评估 large language model
3 Optimized Couplings for Watermarking Large Language Models 提出优化耦合方法以改进大语言模型水印技术 large language model
4 DeepMath-Creative: A Benchmark for Evaluating Mathematical Creativity of Large Language Models 提出DeepMath-Creative基准以评估大型语言模型的数学创造力 large language model
5 CellTypeAgent: Trustworthy cell type annotation with Large Language Models 提出CellTypeAgent以解决细胞类型注释的信任性问题 large language model
6 The Truth Becomes Clearer Through Debate! Multi-Agent Systems with Large Language Models Unmask Fake News 提出TruEDebate系统以解决假新闻检测的解释性与有效性问题 large language model
7 Unveiling the Best Practices for Applying Speech Foundation Models to Speech Intelligibility Prediction for Hearing-Impaired People 提出优化语音基础模型以提升听障人士的语音可懂度预测 foundation model
8 Federated Large Language Models: Feasibility, Robustness, Security and Future Directions 提出联邦大语言模型以解决隐私保护与数据孤岛问题 large language model
9 TrialMatchAI: An End-to-End AI-powered Clinical Trial Recommendation System to Streamline Patient-to-Trial Matching 提出TrialMatchAI以解决临床试验患者匹配问题 large language model chain-of-thought
10 Lost in Transmission: When and Why LLMs Fail to Reason Globally 提出BAPO模型以解决LLMs在复杂推理中的局限性 large language model chain-of-thought
11 Improving the Reliability of LLMs: Combining CoT, RAG, Self-Consistency, and Self-Verification 结合CoT与RAG等方法以减少LLM的幻觉现象 large language model chain-of-thought
12 Tests as Prompt: A Test-Driven-Development Benchmark for LLM Code Generation 提出WebApp1K基准以评估LLM在测试驱动开发中的表现 large language model instruction following
13 AI and Generative AI Transforming Disaster Management: A Survey of Damage Assessment and Response Techniques 综述AI与生成式AI在灾害管理中的应用以提升损害评估效率 multimodal
14 AI-Mediated Code Comment Improvement 提出基于AI的代码注释改进方法以提升代码质量 large language model
15 Securing RAG: A Risk Assessment and Mitigation Framework 提出RAG风险评估与缓解框架以解决安全隐患问题 large language model
16 VizCV: AI-assisted visualization of researchers' publications tracks 提出VizCV以解决科研人员出版记录分析问题 large language model
17 Resource-Efficient Language Models: Quantization for Fast and Accessible Inference 提出后训练量化技术以提升大语言模型推理效率 large language model
18 Evaluating LLM Metrics Through Real-World Capabilities 提出基于真实世界能力评估LLM性能的新方法 large language model
19 Communication Styles and Reader Preferences of LLM and Human Experts in Explaining Health Information 研究LLM与人类专家在健康信息解释中的沟通风格差异 large language model
20 Leveraging AI for Productive and Trustworthy HPC Software: Challenges and Research Directions 提出AI技术以解决高性能计算软件开发中的挑战 large language model

🔬 支柱二:RL算法与架构 (RL & Architecture) (12 篇)

#题目一句话要点标签🔗
21 Foundation Models Knowledge Distillation For Battery Capacity Degradation Forecast 提出基础模型知识蒸馏以解决电池容量衰退预测问题 teacher-student distillation foundation model
22 Deep reinforcement learning-based longitudinal control strategy for automated vehicles at signalised intersections 提出基于深度强化学习的自动驾驶车辆信号交叉口纵向控制策略 reinforcement learning deep reinforcement learning DRL
23 Monte Carlo Beam Search for Actor-Critic Reinforcement Learning in Continuous Control 提出蒙特卡洛束搜索以改善连续控制中的策略学习 reinforcement learning policy learning PPO
24 Strategy-Augmented Planning for Large Language Models via Opponent Exploitation 提出策略增强规划以解决对手建模问题 reinforcement learning large language model
25 Learning Like Humans: Advancing LLM Reasoning Capabilities via Adaptive Difficulty Curriculum Learning and Expert-Guided Self-Reformulation 提出自适应难度课程学习与专家引导自我重构以提升LLM推理能力 reinforcement learning imitation learning curriculum learning
26 Improved Algorithms for Differentially Private Language Model Alignment 提出隐私保护的语言模型对齐算法以解决用户数据隐私问题 reinforcement learning RLHF DPO
27 Deep Reinforcement Learning for Power Grid Multi-Stage Cascading Failure Mitigation 提出深度强化学习方法以缓解电网多阶段级联故障问题 reinforcement learning deep reinforcement learning
28 Enhancing Aerial Combat Tactics through Hierarchical Multi-Agent Reinforcement Learning 提出分层多智能体强化学习框架以优化空战战术 reinforcement learning deep reinforcement learning
29 Hyperbolic Contrastive Learning with Model-augmentation for Knowledge-aware Recommendation 提出超曲率对比学习与模型增强以解决知识感知推荐问题 preference learning contrastive learning
30 Automated Meta Prompt Engineering for Alignment with the Theory of Mind 提出自动化元提示工程以解决心智理论对齐问题 reinforcement learning large language model
31 A Mamba-based Network for Semi-supervised Singing Melody Extraction Using Confidence Binary Regularization 提出SpectMamba以解决半监督唱歌旋律提取问题 Mamba
32 A Study of Data-driven Methods for Inventory Optimization 提出数据驱动方法优化超市库存管理 reinforcement learning deep reinforcement learning

🔬 支柱一:机器人控制 (Robot Control) (2 篇)

#题目一句话要点标签🔗
33 Modeling Unseen Environments with Language-guided Composable Causal Components in Reinforcement Learning 提出WM3C框架以解决强化学习中的泛化问题 manipulation reinforcement learning policy learning
34 A Survey of Deep Learning for Complex Speech Spectrograms 综述深度学习在复杂语音谱图处理中的应用与挑战 manipulation

⬅️ 返回 cs.AI 首页 · 🏠 返回主页