cs.AI(2024-09-12)
📊 共 15 篇论文 | 🔗 1 篇有代码
🎯 兴趣领域导航
支柱九:具身大模型 (Embodied Foundation Models) (9 🔗1)
支柱二:RL算法与架构 (RL & Architecture) (5)
支柱八:物理动画 (Physics-based Animation) (1)
🔬 支柱九:具身大模型 (Embodied Foundation Models) (9 篇)
🔬 支柱二:RL算法与架构 (RL & Architecture) (5 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 10 | A Spatiotemporal Stealthy Backdoor Attack against Cooperative Multi-Agent Deep Reinforcement Learning | 提出时空隐蔽后门攻击,提升合作多智能体深度强化学习的安全性 | reinforcement learning deep reinforcement learning spatiotemporal | ||
| 11 | Towards Opinion Shaping: A Deep Reinforcement Learning Approach in Bot-User Interactions | 提出基于深度强化学习的社交网络舆论引导方法,通过用户-机器人交互影响舆论。 | reinforcement learning deep reinforcement learning DRL | ||
| 12 | Design Optimization of Nuclear Fusion Reactor through Deep Reinforcement Learning | 提出基于深度强化学习的核聚变反应堆设计优化方法,降低建造成本。 | reinforcement learning deep reinforcement learning DRL | ||
| 13 | Tidal MerzA: Combining affective modelling and autonomous code generation through Reinforcement Learning | Tidal-MerzA:结合情感建模与自主代码生成的强化学习音乐共创系统 | reinforcement learning | ||
| 14 | Fitted Q-Iteration via Max-Plus-Linear Approximation | 提出基于Max-Plus线性近似的Fitted Q-Iteration算法,用于离线强化学习。 | reinforcement learning offline reinforcement learning |
🔬 支柱八:物理动画 (Physics-based Animation) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 15 | Autonomous Vehicle Controllers From End-to-End Differentiable Simulation | 提出基于可微仿真和解析策略梯度的自动驾驶车辆控制器训练方法 | differentiable simulation |