cs.AI（2025-05-20）

📊 共 45 篇论文 | 🔗 5 篇有代码

🎯 兴趣领域导航

支柱九：具身大模型 (Embodied Foundation Models) (31 🔗3) 支柱二：RL算法与架构 (RL & Architecture) (10 🔗1) 支柱一：机器人控制 (Robot Control) (3) 支柱八：物理动画 (Physics-based Animation) (1 🔗1)

🔬 支柱九：具身大模型 (Embodied Foundation Models) (31 篇)

#	题目	一句话要点	标签	🔗
1	Large Language Model-Driven Distributed Integrated Multimodal Sensing and Semantic Communications	提出LLM驱动的分布式集成多模态感知与语义通信框架，提升复杂环境下的感知精度。	large language model multimodal
2	Multimodal RAG-driven Anomaly Detection and Classification in Laser Powder Bed Fusion using Large Language Models	提出基于多模态RAG的增材制造异常检测与分类框架，无需训练数据。	large language model multimodal
3	Towards a Foundation Model for Communication Systems	提出面向通信系统的Transformer基础模型，实现多特征估计。	foundation model multimodal
4	Debating for Better Reasoning: An Unsupervised Multimodal Approach	提出一种无监督多模态辩论框架，提升视觉问答模型推理能力。	large language model multimodal
5	Large Language Model Powered Decision Support for a Metal Additive Manufacturing Knowledge Graph	提出基于大语言模型的金属增材制造知识图谱决策支持系统，实现自然语言交互。	large language model
6	SAFEPATH: Preventing Harmful Reasoning in Chain-of-Thought via Early Alignment	提出SAFEPATH以解决大型推理模型的安全性问题	chain-of-thought
7	Can Large Language Models Really Recognize Your Name?	揭示大语言模型在识别个人姓名方面存在的系统性缺陷，并提出评估基准AMBENCH。	large language model
8	Guarded Query Routing for Large Language Models	提出GQR-Bench基准测试，并研究了LLM在安全查询路由中的有效性和效率。	large language model	✅
9	Toward Embodied AGI: A Review of Embodied AI and the Road Ahead	提出具身通用人工智能（AGI）分级框架，并展望高阶机器人大脑设计	embodied AI
10	Multimodal Mixture of Low-Rank Experts for Sentiment Analysis and Emotion Recognition	提出多模态低秩专家混合模型MMoLRE，用于情感分析和情绪识别的多任务学习。	multimodal
11	The Multimodal Information Based Speech Processing (MISP) 2025 Challenge: Audio-Visual Diarization and Recognition	MISP 2025挑战赛：提出基于多模态信息的会议场景音视频说话人分离与识别方案	multimodal
12	Agent Context Protocols Enhance Collective Inference	提出Agent Context Protocols (ACPs)以增强多智能体系统的集体推理能力	generalist agent multimodal
13	MLZero: A Multi-Agent System for End-to-end Machine Learning Automation	MLZero：基于LLM的多智能体系统，实现端到端多模态机器学习自动化	large language model multimodal
14	Reasoning Models Better Express Their Confidence	思维链推理模型能更准确地表达其置信度	large language model chain-of-thought
15	ProMind-LLM: Proactive Mental Health Care via Causal Reasoning with Sensor Data	ProMind-LLM：利用因果推理与传感器数据实现主动式心理健康护理	large language model chain-of-thought
16	DrugPilot: LLM-based Parameterized Reasoning Agent for Drug Discovery	DrugPilot：基于LLM的参数化推理Agent，用于药物发现	large language model multimodal
17	Towards Embodied Cognition in Robots via Spatially Grounded Synthetic Worlds	提出基于空间感知的合成数据集，用于训练机器人视觉语言模型以实现具身认知。	embodied AI
18	Two Experts Are All You Need for Steering Thinking: Reinforcing Cognitive Effort in MoE Reasoning Models Without Additional Training	RICE：无需额外训练，仅需两位专家即可引导MoE推理模型进行更有效的思考	instruction following
19	JARVIS: A Multi-Agent Code Assistant for High-Quality EDA Script Generation	JARVIS：用于高质量EDA脚本生成的多智能体代码助手	large language model
20	SATBench: Benchmarking LLMs' Logical Reasoning via Automated Puzzle Generation from SAT Formulas	SATBench：通过SAT公式自动生成逻辑谜题，评估LLM的逻辑推理能力	large language model	✅
21	Balanced and Elastic End-to-end Training of Dynamic LLMs	提出DynMo，实现动态LLM训练的负载均衡与弹性伸缩，提升训练效率。	large language model
22	ContextAgent: Context-Aware Proactive LLM Agents with Open-World Sensory Perceptions	ContextAgent：提出一种利用开放世界感知的上下文感知主动LLM Agent。	large language model	✅
23	Cost-Awareness in Tree-Search LLM Planning: A Systematic Study	系统研究树搜索LLM规划器的成本意识，揭示其在资源约束下的规划能力不足	large language model
24	From nuclear safety to LLM security: Applying non-probabilistic risk management strategies to build safe and secure LLM-powered systems	借鉴核安全等领域经验，提出非概率风险管理策略以提升LLM系统安全性	large language model
25	Towards Reliable Proof Generation with LLMs: A Neuro-Symbolic Approach	提出神经符号方法，提升LLM在几何证明生成中的可靠性	large language model
26	Choosing a Model, Shaping a Future: Comparing LLM Perspectives on Sustainability and its Relationship with AI	评估LLM在可持续性问题上的偏差，揭示模型选择对组织策略的影响	large language model
27	Knowledge Graph Based Repository-Level Code Generation	提出基于知识图谱的代码仓库级代码生成方法，提升上下文准确性。	large language model
28	SCAN: Semantic Document Layout Analysis for Textual and Visual Retrieval-Augmented Generation	提出SCAN：一种语义文档布局分析方法，提升文本和视觉RAG性能	large language model
29	Divide by Question, Conquer by Agent: SPLIT-RAG with Question-Driven Graph Partitioning	提出SPLIT-RAG，通过问题驱动的图划分提升大规模知识图谱上的RAG效率与准确性。	large language model
30	RAG/LLM Augmented Switching Driven Polymorphic Metaheuristic Framework	提出RAG/LLM增强的切换驱动多态元启发式框架，提升复杂优化问题求解效率。	multimodal
31	LLM-based Evaluation Policy Extraction for Ecological Modeling	提出基于LLM的生态建模评估策略提取框架，提升模型评估的解释性和自动化程度	large language model

🔬 支柱二：RL算法与架构 (RL & Architecture) (10 篇)

#	题目	一句话要点	标签	🔗
32	Reinforcement Learning from User Feedback	提出RLUF框架，利用用户隐式反馈直接对齐LLM，提升用户满意度。	reinforcement learning RLHF large language model
33	Reinforcement Learning vs. Distillation: Understanding Accuracy and Capability in LLM Reasoning	对比强化学习与蒸馏，揭示LLM推理能力提升的差异化机制	reinforcement learning distillation
34	Causal Cartographer: From Mapping to Reasoning Over Counterfactual Worlds	Causal Cartographer：构建因果知识图谱，提升LLM在反事实推理中的鲁棒性	world model large language model foundation model
35	DSMentor: Enhancing Data Science Agents with Curriculum Learning and Online Knowledge Accumulation	DSMentor：利用课程学习和在线知识积累增强数据科学Agent能力	curriculum learning large language model
36	SHARP: Synthesizing High-quality Aligned Reasoning Problems for Large Reasoning Models Reinforcement Learning	SHARP：合成高质量对齐推理问题，用于强化学习训练大型推理模型	reinforcement learning chain-of-thought
37	RL of Thoughts: Navigating LLM Reasoning with Inference-time Reinforcement Learning	RLoT：利用推理时强化学习引导LLM进行复杂推理	reinforcement learning large language model
38	Visual Instruction Bottleneck Tuning	提出Visual Instruction Bottleneck Tuning (Vittle)，提升多模态大语言模型在分布偏移下的泛化性和鲁棒性。	representation learning large language model multimodal
39	Embedded Mean Field Reinforcement Learning for Perimeter-defense Game	提出嵌入式均值场强化学习框架EMFAC，解决复杂三维环境下的无人机防御问题。	reinforcement learning representation learning
40	PRL: Prompts from Reinforcement Learning	提出基于强化学习的提示生成方法PRL，提升大语言模型在多任务上的性能。	reinforcement learning	✅
41	TelePlanNet: An AI-Driven Framework for Efficient Telecom Network Planning	TelePlanNet：AI驱动的电信网络高效规划框架，提升基站选址一致性。	reinforcement learning large language model

🔬 支柱一：机器人控制 (Robot Control) (3 篇)

#	题目	一句话要点	标签
42	Self-Evolving Curriculum for LLM Reasoning	提出自适应课程学习（SEC）方法，提升LLM在推理任务中强化学习微调的性能。	dual-arm reinforcement learning curriculum learning
43	EVA: Red-Teaming GUI Agents via Evolving Indirect Prompt Injection	EVA：通过演化间接提示注入进行GUI代理的红队测试	manipulation multimodal
44	Transductively Informed Inductive Program Synthesis	提出TIIPS框架，通过协同机制融合归纳与转导推理，提升程序合成的准确性和泛化性。	manipulation

🔬 支柱八：物理动画 (Physics-based Animation) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
45	AudioJailbreak: Jailbreak Attacks against End-to-End Large Audio-Language Models	提出AudioJailbreak，一种针对端到端大型音频语言模型的异步、通用、隐蔽且具有空中鲁棒性的对抗性攻击。	PULSE	✅

⬅️ 返回 cs.AI 首页 · 🏠 返回主页

cs.AI（2025-05-20）

🎯 兴趣领域导航

🔬 支柱九：具身大模型 (Embodied Foundation Models) (31 篇)

🔬 支柱二：RL算法与架构 (RL & Architecture) (10 篇)

🔬 支柱一：机器人控制 (Robot Control) (3 篇)

🔬 支柱八：物理动画 (Physics-based Animation) (1 篇)

⭐ 我的收藏

📁 新建收藏夹

⚙️ 管理收藏夹

🔍 搜索论文

🔐 登录 / 注册

👤 用户管理