cs.AI(2026-01-21)
📊 共 18 篇论文
🎯 兴趣领域导航
支柱九:具身大模型 (Embodied Foundation Models) (11)
支柱一:机器人控制 (Robot Control) (6)
支柱二:RL算法与架构 (RL & Architecture) (1)
🔬 支柱九:具身大模型 (Embodied Foundation Models) (11 篇)
🔬 支柱一:机器人控制 (Robot Control) (6 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 12 | BayesianVLA: Bayesian Decomposition of Vision Language Action Models via Latent Action Queries | 提出BayesianVLA以解决视觉语言行动模型的泛化问题 | manipulation vision-language-action VLA | ||
| 13 | Gaming the Judge: Unfaithful Chain-of-Thought Can Undermine Agent Evaluation | 揭示LLM评估中链式思维的脆弱性及其影响 | manipulation large language model chain-of-thought | ||
| 14 | Proximal Policy Optimization with Evolutionary Mutations | 提出POEM算法,通过进化变异增强PPO探索能力,解决强化学习早熟收敛问题。 | bipedal biped reinforcement learning | ||
| 15 | CI4A: Semantic Component Interfaces for Agents Empowering Web Automation | 提出CI4A以解决Web组件操作的低效问题 | manipulation reinforcement learning large language model | ||
| 16 | NeuroFilter: Privacy Guardrails for Conversational LLM Agents | NeuroFilter:为对话式LLM Agent提供隐私保护,降低计算成本。 | manipulation large language model | ||
| 17 | An Agentic Operationalization of DISARM for FIMI Investigation on Social Media | 提出基于Agent的DISARM框架,用于大规模社交媒体上的FIMI调查 | manipulation |
🔬 支柱二:RL算法与架构 (RL & Architecture) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 18 | Vehicle Routing with Finite Time Horizon using Deep Reinforcement Learning with Improved Network Embedding | 提出一种改进网络嵌入的深度强化学习方法,解决有限时间范围内的车辆路径问题。 | reinforcement learning deep reinforcement learning |