cs.AI（2026-05-01）

📊 共 18 篇论文 | 🔗 3 篇有代码

🎯 兴趣领域导航

支柱九：具身大模型 (Embodied Foundation Models) (9 🔗1) 支柱二：RL算法与架构 (RL & Architecture) (7 🔗2) 支柱一：机器人控制 (Robot Control) (2)

🔬 支柱九：具身大模型 (Embodied Foundation Models) (9 篇)

#	题目	一句话要点	标签	🔗	⭐
1	EASE: Federated Multimodal Unlearning via Entanglement-Aware Anchor Closure	EASE：通过解耦感知锚点闭合实现联邦多模态可遗忘学习	multimodal
2	Empowering Heterogeneous Graph Foundation Models via Decoupled Relation Alignment	提出解耦关系子空间对齐(DRSA)框架，提升异构图基础模型跨域知识迁移能力。	foundation model	✅
3	Can Coding Agents Reproduce Findings in Computational Materials Science?	AutoMat：评估LLM智能体在计算材料科学中重现科研结果能力的基准	large language model foundation model
4	LLM-Oriented Information Retrieval: A Denoising-First Perspective	提出面向LLM的信息检索框架，强调去噪以提升检索增强生成质量。	large language model multimodal
5	Social Bias in LLM-Generated Code: Benchmark and Mitigation	提出 Fairness Monitor Agent (FMA) 以缓解 LLM 生成代码中的社会偏见，并提升代码正确性。	large language model chain-of-thought
6	Silicon Showdown: Performance, Efficiency, and Ecosystem Barriers in Consumer-Grade LLM Inference	分析消费级硬件上LLM推理的性能、效率和生态壁垒，揭示Nvidia和Apple Silicon的权衡。	large language model
7	Space Network of Experts: Architecture and Expert Placement	提出Space-XNet框架，解决星载网络中MoE模型的高效分布式部署问题	large language model
8	Skills as Verifiable Artifacts: A Trust Schema and a Biconditional Correctness Criterion for Human-in-the-Loop Agent Runtimes	提出一种基于可验证工件的技能信任模式，用于人机协作Agent运行时环境。	large language model
9	AgentFloor: How Far Up the tool use Ladder Can Small Open-Weight Models Go?	AgentFloor：评估小型开源模型在工具使用Agent中能力的阶梯式基准	instruction following

🔬 支柱二：RL算法与架构 (RL & Architecture) (7 篇)

#	题目	一句话要点	标签	🔗	⭐
10	Physically Native World Models: A Hamiltonian Perspective on Generative World Modeling	提出 Hamiltonian World Models，提升具身智能体物理可靠性和长期预测稳定性。	reinforcement learning world model world models
11	GaMMA: Towards Joint Global-Temporal Music Understanding in Large Multimodal Models	GaMMA：面向联合全局-时序音乐理解的大型多模态模型	reinforcement learning multimodal
12	Towards Improving Speaker Distance Estimation through Generative Impulse Response Augmentation	利用生成式脉冲响应增强提升说话人距离估计精度	MAE PULSE
13	Improving LLM Code Generation via Requirement-Aware Curriculum Reinforcement Learning	提出RECRL框架，通过需求感知的课程强化学习提升LLM代码生成能力。	reinforcement learning large language model
14	AEM: Adaptive Entropy Modulation for Multi-Turn Agentic Reinforcement Learning	提出AEM自适应熵调制方法，解决多轮Agent强化学习中的信用分配难题。	reinforcement learning large language model
15	Learn where to Click from Yourself: On-Policy Self-Distillation for GUI Grounding	提出GUI-SD，用于GUI元素定位的On-Policy自蒸馏框架	reinforcement learning distillation	✅
16	DynamicPO: Dynamic Preference Optimization for Recommendation	DynamicPO：动态偏好优化，解决LLM推荐系统中负样本过多导致的性能退化问题	DPO direct preference optimization large language model	✅

🔬 支柱一：机器人控制 (Robot Control) (2 篇)

#	题目	一句话要点	标签	🔗	⭐
17	Thinking in Text and Images: Interleaved Vision--Language Reasoning Traces for Long-Horizon Robot Manipulation	提出交错视觉-语言推理（IVLR）框架，用于长时程机器人操作任务。	manipulation vision-language-action multimodal
18	Linking Behaviour and Perception to Evaluate Meaningful Human Control over Partially Automated Driving	提出评估部分自动驾驶中人类控制的框架以解决责任与控制的矛盾	shared control

⬅️ 返回 cs.AI 首页 · 🏠 返回主页