cs.AI(2025-01-03)
📊 共 16 篇论文 | 🔗 4 篇有代码
🎯 兴趣领域导航
支柱九:具身大模型 (Embodied Foundation Models) (11 🔗2)
支柱二:RL算法与架构 (RL & Architecture) (4 🔗2)
支柱一:机器人控制 (Robot Control) (1)
🔬 支柱九:具身大模型 (Embodied Foundation Models) (11 篇)
🔬 支柱二:RL算法与架构 (RL & Architecture) (4 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 12 | BLAST: A Stealthy Backdoor Leverage Attack against Cooperative Multi-Agent Deep Reinforcement Learning based Systems | 提出BLAST:一种针对合作多智能体深度强化学习系统的隐蔽后门杠杆攻击 | reinforcement learning deep reinforcement learning spatiotemporal | ||
| 13 | Auto-RT: Automatic Jailbreak Strategy Exploration for Red-Teaming Large Language Models | Auto-RT:一种自动化的LLM红队测试框架,用于探索和优化对抗攻击策略。 | reinforcement learning large language model | ||
| 14 | SDPO: Segment-Level Direct Preference Optimization for Social Agents | 提出SDPO:用于社交智能体的段落级直接偏好优化方法 | DPO direct preference optimization large language model | ✅ | |
| 15 | Contrastive Learning Augmented Social Recommendations | 提出对比学习增强的社交推荐模型CLSRec,解决冷启动用户推荐问题。 | contrastive learning distillation | ✅ |
🔬 支柱一:机器人控制 (Robot Control) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 16 | Proposing Hierarchical Goal-Conditioned Policy Planning in Multi-Goal Reinforcement Learning | 提出分层目标条件策略规划,解决人形机器人多目标强化学习稀疏奖励问题 | humanoid humanoid robot reinforcement learning |