cs.AI(2026-03-24)
📊 共 22 篇论文 | 🔗 3 篇有代码
🎯 兴趣领域导航
支柱九:具身大模型 (Embodied Foundation Models) (15 🔗2)
支柱二:RL算法与架构 (RL & Architecture) (6 🔗1)
支柱一:机器人控制 (Robot Control) (1)
🔬 支柱九:具身大模型 (Embodied Foundation Models) (15 篇)
🔬 支柱二:RL算法与架构 (RL & Architecture) (6 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 16 | CoMaTrack: Competitive Multi-Agent Game-Theoretic Tracking with Vision-Language-Action Models | 提出CoMaTrack:基于竞争博弈的多智能体视觉-语言-动作跟踪框架 | reinforcement learning imitation learning vision-language-action | ✅ | |
| 17 | Improving Safety Alignment via Balanced Direct Preference Optimization | 提出B-DPO,通过平衡偏好优化解决LLM安全对齐中的过拟合问题 | reinforcement learning RLHF DPO | ||
| 18 | Describe-Then-Act: Proactive Agent Steering via Distilled Language-Action World Models | 提出Dillo,通过蒸馏语言-动作世界模型实现主动Agent控制。 | world model distillation large language model | ||
| 19 | MemCollab: Cross-Agent Memory Collaboration via Contrastive Trajectory Distillation | MemCollab:通过对比轨迹蒸馏实现跨Agent的记忆协同 | distillation large language model | ||
| 20 | Dynamical Systems Theory Behind a Hierarchical Reasoning Model | 提出基于连续动力系统的Contraction Mapping Model,解决复杂推理任务中递归网络训练不稳定的问题。 | latent dynamics large language model | ||
| 21 | Evaluating LLM-Based Test Generation Under Software Evolution | 评估软件演化下基于LLM的测试生成:揭示其对语义变化的敏感性 | SAC large language model |
🔬 支柱一:机器人控制 (Robot Control) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 22 | Chain-of-Authorization: Internalizing Authorization into Large Language Models via Reasoning Trajectories | 提出Chain-of-Authorization框架,通过推理轨迹将授权机制内化于大语言模型中 | manipulation large language model |