cs.AI(2025-02-20)
📊 共 20 篇论文 | 🔗 3 篇有代码
🎯 兴趣领域导航
支柱九:具身大模型 (Embodied Foundation Models) (12 🔗3)
支柱二:RL算法与架构 (RL & Architecture) (6)
支柱一:机器人控制 (Robot Control) (2)
🔬 支柱九:具身大模型 (Embodied Foundation Models) (12 篇)
🔬 支柱二:RL算法与架构 (RL & Architecture) (6 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 13 | External Large Foundation Model: How to Efficiently Serve Trillions of Parameters for Online Ads Recommendation | 提出ExFM框架,高效服务在线广告推荐中参数规模达万亿级别的外部大型基础模型。 | distillation foundation model | ||
| 14 | SPRIG: Stackelberg Perception-Reinforcement Learning with Internal Game Dynamics | SPRIG:基于内部博弈动态的Stackelberg感知-强化学习框架 | reinforcement learning deep reinforcement learning PPO | ||
| 15 | Making Universal Policies Universal | 提出跨智能体通用策略学习方法,解决异构动作空间下的通用决策问题 | policy learning imitation learning generalist agent | ||
| 16 | Discovering highly efficient low-weight quantum error-correcting codes with reinforcement learning | 提出基于强化学习的量子纠错码优化方法,显著降低物理量子比特开销。 | reinforcement learning | ||
| 17 | HPS: Hard Preference Sampling for Human Preference Alignment | 提出Hard Preference Sampling (HPS)框架,用于提升LLM人类偏好对齐的鲁棒性和效率。 | RLHF large language model | ||
| 18 | Causal Mean Field Multi-Agent Reinforcement Learning | 提出因果平均场Q学习(CMFQ)算法,提升多智能体强化学习在非平稳环境下的可扩展性。 | reinforcement learning |
🔬 支柱一:机器人控制 (Robot Control) (2 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 19 | Multi-Agent Coordination across Diverse Applications: A Survey | 多智能体协同综述:跨领域应用中的协同机制与未来方向 | humanoid large language model | ||
| 20 | Towards Secure Program Partitioning for Smart Contracts with LLM's In-Context Learning | PartitionGPT:利用LLM上下文学习实现智能合约安全程序划分 | manipulation large language model |