cs.AI（2026-02-03）

📊 共 35 篇论文 | 🔗 4 篇有代码

🎯 兴趣领域导航

支柱九：具身大模型 (Embodied Foundation Models) (23 🔗3) 支柱二：RL算法与架构 (RL & Architecture) (7 🔗1) 支柱一：机器人控制 (Robot Control) (3) 支柱八：物理动画 (Physics-based Animation) (2)

🔬 支柱九：具身大模型 (Embodied Foundation Models) (23 篇)

#	题目	一句话要点	标签	🔗
1	Towards Considerate Embodied AI: Co-Designing Situated Multi-Site Healthcare Robots from Abstract Concepts to High-Fidelity Prototypes	通过共创设计，为多场景医疗机器人打造更周到的具身AI	embodied AI	✅
2	Agentic Proposing: Enhancing Large Language Model Reasoning via Compositional Skill Synthesis	提出Agentic Proposing框架，通过组合技能合成高质量数据，提升大语言模型推理能力。	large language model
3	An Empirical Study of Collective Behaviors and Social Dynamics in Large Language Model Agents	研究大型语言模型Agent的集体行为与社会动态，提出CoST方法抑制有害信息发布	large language model
4	VALUEFLOW: Toward Pluralistic and Steerable Value-based Alignment in Large Language Models	提出VALUEFLOW以解决大语言模型价值对齐问题	large language model
5	De-conflating Preference and Qualification: Constrained Dual-Perspective Reasoning for Job Recommendation with Large Language Models	JobRec：通过约束双视角推理，解耦偏好与资格，用于LLM驱动的职位推荐	large language model
6	Large Language Models Can Take False First Steps at Inference-time Planning	大型语言模型在推理时规划中存在虚假先验步骤问题	large language model
7	CSR-Bench: A Benchmark for Evaluating the Cross-modal Safety and Reliability of MLLMs	提出CSR-Bench，用于评估多模态大语言模型跨模态安全性和可靠性	large language model multimodal
8	Are LLMs Biased Like Humans? Causal Reasoning as a Function of Prior Knowledge, Irrelevant Information, and Reasoning Budget	评估大型语言模型的因果推理与人类偏见的关系	large language model chain-of-thought
9	Conformal Thinking: Risk Control for Reasoning on a Compute Budget	提出基于风险控制的自适应推理框架，优化大语言模型在计算预算下的推理。	large language model
10	DiscoverLLM: From Executing Intents to Discovering Them	提出DiscoverLLM框架，通过意图发现提升LLM在开放式任务中的交互性能。	large language model
11	Methods and Open Problems in Differentiable Social Choice: Learning Mechanisms, Decisions, and Alignment	综述可微社会选择方法：学习机制、决策与对齐，并提出开放性问题。	large language model
12	Persona Generators: Generating Diverse Synthetic Personas at Scale	提出Persona Generators，利用进化算法生成多样化合成角色，用于AI系统评估。	large language model
13	When Routing Collapses: On the Degenerate Convergence of LLM Routers	提出EquiRouter以解决LLM路由中的退化收敛问题，提升成本效益。	multimodal	✅
14	Ontology-to-tools compilation for executable semantic constraint enforcement in LLM agents	提出本体到工具的编译方法，用于LLM Agent中可执行的语义约束强化	large language model
15	Precision in Practice: Knowledge Guided Code Summarizing Grounded in Industrial Expectations	ExpSum：结合工业期望的知识引导代码摘要生成方法	large language model
16	The Necessity of a Unified Framework for LLM-Based Agent Evaluation	提出LLM Agent统一评估框架，解决评估标准不一致问题	large language model
17	Beyond Quantity: Trajectory Diversity Scaling for Code Agents	TDScaling：通过轨迹多样性提升代码智能体性能，突破数量 scaling 瓶颈	large language model
18	Internet of Agentic AI: Incentive-Compatible Distributed Teaming and Workflow	提出Internet of Agentic AI框架，实现可扩展的Agentic AI分布式协作与工作流。	large language model
19	Understanding Multi-Agent LLM Frameworks: A Unified Benchmark and Experimental Analysis	提出MAFBench，用于系统评估多智能体LLM框架架构对性能的影响	large language model
20	Digital Lifelong Learning in the Age of AI: Trends and Insights	分析AI时代终身数字学习趋势，揭示学习动机与平台优化策略	large language model
21	Risky-Bench: Probing Agentic Safety Risks under Real-World Deployment	提出Risky-Bench以解决现实环境中代理安全风险评估问题	large language model
22	MAS-ProVe: Understanding the Process Verification of Multi-Agent Systems	MAS-ProVe：系统性研究多智能体系统过程验证的有效性与挑战	large language model	✅
23	RC-GRPO: Reward-Conditioned Group Relative Policy Optimization for Multi-Turn Tool Calling Agents	提出RC-GRPO，通过奖励调节提升多轮工具调用Agent的性能	large language model

🔬 支柱二：RL算法与架构 (RL & Architecture) (7 篇)

#	题目	一句话要点	标签	🔗
24	EHRWorld: A Patient-Centric Medical World Model for Long-Horizon Clinical Trajectories	EHRWorld：面向长期临床轨迹的以患者为中心的医疗世界模型	world model large language model
25	IntentRL: Training Proactive User-intent Agents for Open-ended Deep Research via Reinforcement Learning	提出IntentRL以解决深度研究代理的用户意图澄清问题	reinforcement learning large language model
26	Mitigating Conversational Inertia in Multi-Turn Agents	提出上下文偏好学习，缓解多轮Agent对话中的惯性问题	preference learning large language model
27	General Agents Contain World Models, even under Partial Observability and Stochasticity	证明通用智能体即使在部分可观测和随机环境下也包含世界模型	world model
28	STAR: Similarity-guided Teacher-Assisted Refinement for Super-Tiny Function Calling Models	STAR：面向超小型函数调用模型的相似性引导的教师辅助精炼	distillation large language model
29	Distilling LLM Reasoning into Graph of Concept Predictors	提出概念预测图（GCP）框架，用于将LLM推理能力蒸馏到小型判别模型中，提升效率和可解释性。	distillation large language model	✅
30	Search-R2: Enhancing Search-Integrated Reasoning via Actor-Refiner Collaboration	提出Search-R2以解决搜索集成推理中的多尺度信用分配问题	reinforcement learning reward design

🔬 支柱一：机器人控制 (Robot Control) (3 篇)

#	题目	一句话要点	标签
31	CRL-VLA: Continual Vision-Language-Action Learning	提出CRL-VLA框架，解决具身智能体持续视觉-语言-动作学习中的稳定性-可塑性难题	manipulation dexterous manipulation reinforcement learning
32	Risk Awareness Injection: Calibrating Vision-Language Models for Safety without Compromising Utility	提出风险感知注入（RAI），提升视觉语言模型安全性且不损失性能	manipulation large language model multimodal
33	Can LLMs Do Rocket Science? Exploring the Limits of Complex Reasoning with GTOC 12	评估LLM在复杂航天任务中的能力：GTOC 12挑战	trajectory optimization large language model

🔬 支柱八：物理动画 (Physics-based Animation) (2 篇)

#	题目	一句话要点	标签	🔗	⭐
34	Morphe: High-Fidelity Generative Video Streaming with Vision Foundation Model	提出Morphe以解决视频流传输中的质量与延迟问题	spatiotemporal foundation model
35	Adaptive Evidence Weighting for Audio-Spatiotemporal Fusion	提出FINCH框架以解决生物声学分类中的证据融合问题	spatiotemporal

⬅️ 返回 cs.AI 首页 · 🏠 返回主页

cs.AI（2026-02-03）

🎯 兴趣领域导航

🔬 支柱九：具身大模型 (Embodied Foundation Models) (23 篇)

🔬 支柱二：RL算法与架构 (RL & Architecture) (7 篇)

🔬 支柱一：机器人控制 (Robot Control) (3 篇)

🔬 支柱八：物理动画 (Physics-based Animation) (2 篇)

⭐ 我的收藏

📁 新建收藏夹

⚙️ 管理收藏夹

🔍 搜索论文

🔐 登录 / 注册

👤 用户管理