cs.AI（2025-05-13）

📊 共 34 篇论文 | 🔗 2 篇有代码

🎯 兴趣领域导航

支柱九：具身大模型 (Embodied Foundation Models) (20 🔗1) 支柱二：RL算法与架构 (RL & Architecture) (12 🔗1) 支柱一：机器人控制 (Robot Control) (2)

🔬 支柱九：具身大模型 (Embodied Foundation Models) (20 篇)

#	题目	一句话要点	标签	🔗
1	Ultrasound Report Generation with Multimodal Large Language Models for Standardized Texts	提出多模态大语言模型以解决超声报告生成问题	large language model multimodal
2	Decoding Neighborhood Environments with Large Language Models	利用大型语言模型解码邻里环境以提升健康评估	large language model
3	Optimized Couplings for Watermarking Large Language Models	提出优化耦合方法以改进大语言模型水印技术	large language model	✅
4	DeepMath-Creative: A Benchmark for Evaluating Mathematical Creativity of Large Language Models	提出DeepMath-Creative基准以评估大型语言模型的数学创造力	large language model
5	CellTypeAgent: Trustworthy cell type annotation with Large Language Models	提出CellTypeAgent以解决细胞类型注释的信任性问题	large language model
6	The Truth Becomes Clearer Through Debate! Multi-Agent Systems with Large Language Models Unmask Fake News	提出TruEDebate系统以解决假新闻检测的解释性与有效性问题	large language model
7	Unveiling the Best Practices for Applying Speech Foundation Models to Speech Intelligibility Prediction for Hearing-Impaired People	提出优化语音基础模型以提升听障人士的语音可懂度预测	foundation model
8	Federated Large Language Models: Feasibility, Robustness, Security and Future Directions	提出联邦大语言模型以解决隐私保护与数据孤岛问题	large language model
9	TrialMatchAI: An End-to-End AI-powered Clinical Trial Recommendation System to Streamline Patient-to-Trial Matching	提出TrialMatchAI以解决临床试验患者匹配问题	large language model chain-of-thought
10	Lost in Transmission: When and Why LLMs Fail to Reason Globally	提出BAPO模型以解决LLMs在复杂推理中的局限性	large language model chain-of-thought
11	Improving the Reliability of LLMs: Combining CoT, RAG, Self-Consistency, and Self-Verification	结合CoT与RAG等方法以减少LLM的幻觉现象	large language model chain-of-thought
12	Tests as Prompt: A Test-Driven-Development Benchmark for LLM Code Generation	提出WebApp1K基准以评估LLM在测试驱动开发中的表现	large language model instruction following
13	AI and Generative AI Transforming Disaster Management: A Survey of Damage Assessment and Response Techniques	综述AI与生成式AI在灾害管理中的应用以提升损害评估效率	multimodal
14	AI-Mediated Code Comment Improvement	提出基于AI的代码注释改进方法以提升代码质量	large language model
15	Securing RAG: A Risk Assessment and Mitigation Framework	提出RAG风险评估与缓解框架以解决安全隐患问题	large language model
16	VizCV: AI-assisted visualization of researchers' publications tracks	提出VizCV以解决科研人员出版记录分析问题	large language model
17	Resource-Efficient Language Models: Quantization for Fast and Accessible Inference	提出后训练量化技术以提升大语言模型推理效率	large language model
18	Evaluating LLM Metrics Through Real-World Capabilities	提出基于真实世界能力评估LLM性能的新方法	large language model
19	Communication Styles and Reader Preferences of LLM and Human Experts in Explaining Health Information	研究LLM与人类专家在健康信息解释中的沟通风格差异	large language model
20	Leveraging AI for Productive and Trustworthy HPC Software: Challenges and Research Directions	提出AI技术以解决高性能计算软件开发中的挑战	large language model

🔬 支柱二：RL算法与架构 (RL & Architecture) (12 篇)

#	题目	一句话要点	标签	🔗
21	Foundation Models Knowledge Distillation For Battery Capacity Degradation Forecast	提出基础模型知识蒸馏以解决电池容量衰退预测问题	teacher-student distillation foundation model
22	Deep reinforcement learning-based longitudinal control strategy for automated vehicles at signalised intersections	提出基于深度强化学习的自动驾驶车辆信号交叉口纵向控制策略	reinforcement learning deep reinforcement learning DRL
23	Monte Carlo Beam Search for Actor-Critic Reinforcement Learning in Continuous Control	提出蒙特卡洛束搜索以改善连续控制中的策略学习	reinforcement learning policy learning PPO
24	Strategy-Augmented Planning for Large Language Models via Opponent Exploitation	提出策略增强规划以解决对手建模问题	reinforcement learning large language model	✅
25	Learning Like Humans: Advancing LLM Reasoning Capabilities via Adaptive Difficulty Curriculum Learning and Expert-Guided Self-Reformulation	提出自适应难度课程学习与专家引导自我重构以提升LLM推理能力	reinforcement learning imitation learning curriculum learning
26	Improved Algorithms for Differentially Private Language Model Alignment	提出隐私保护的语言模型对齐算法以解决用户数据隐私问题	reinforcement learning RLHF DPO
27	Deep Reinforcement Learning for Power Grid Multi-Stage Cascading Failure Mitigation	提出深度强化学习方法以缓解电网多阶段级联故障问题	reinforcement learning deep reinforcement learning
28	Enhancing Aerial Combat Tactics through Hierarchical Multi-Agent Reinforcement Learning	提出分层多智能体强化学习框架以优化空战战术	reinforcement learning deep reinforcement learning
29	Hyperbolic Contrastive Learning with Model-augmentation for Knowledge-aware Recommendation	提出超曲率对比学习与模型增强以解决知识感知推荐问题	preference learning contrastive learning
30	Automated Meta Prompt Engineering for Alignment with the Theory of Mind	提出自动化元提示工程以解决心智理论对齐问题	reinforcement learning large language model
31	A Mamba-based Network for Semi-supervised Singing Melody Extraction Using Confidence Binary Regularization	提出SpectMamba以解决半监督唱歌旋律提取问题	Mamba
32	A Study of Data-driven Methods for Inventory Optimization	提出数据驱动方法优化超市库存管理	reinforcement learning deep reinforcement learning

🔬 支柱一：机器人控制 (Robot Control) (2 篇)

#	题目	一句话要点	标签	🔗	⭐
33	Modeling Unseen Environments with Language-guided Composable Causal Components in Reinforcement Learning	提出WM3C框架以解决强化学习中的泛化问题	manipulation reinforcement learning policy learning
34	A Survey of Deep Learning for Complex Speech Spectrograms	综述深度学习在复杂语音谱图处理中的应用与挑战	manipulation

⬅️ 返回 cs.AI 首页 · 🏠 返回主页

cs.AI（2025-05-13）

🎯 兴趣领域导航

🔬 支柱九：具身大模型 (Embodied Foundation Models) (20 篇)

🔬 支柱二：RL算法与架构 (RL & Architecture) (12 篇)

🔬 支柱一：机器人控制 (Robot Control) (2 篇)

⭐ 我的收藏

📁 新建收藏夹

⚙️ 管理收藏夹

🔍 搜索论文

🔐 登录 / 注册