cs.LG(2024-08-04)
📊 共 7 篇论文
🎯 兴趣领域导航
支柱二:RL算法与架构 (RL & Architecture) (4)
支柱九:具身大模型 (Embodied Foundation Models) (2)
支柱一:机器人控制 (Robot Control) (1)
🔬 支柱二:RL算法与架构 (RL & Architecture) (4 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 1 | SelfBC: Self Behavior Cloning for Offline Reinforcement Learning | SelfBC:离线强化学习中基于自行为克隆的动态策略约束方法 | reinforcement learning offline reinforcement learning behavior cloning | ||
| 2 | Scenario-based Thermal Management Parametrization Through Deep Reinforcement Learning | 提出基于深度强化学习的热管理参数自动整定方法,提升电动汽车热管理控制器的性能。 | reinforcement learning deep reinforcement learning | ||
| 3 | Shaping Rewards, Shaping Routes: On Multi-Agent Deep Q-Networks for Routing in Satellite Constellation Networks | 提出基于多智能体深度Q网络的卫星星座网络路由方法,优化延迟和负载均衡。 | reinforcement learning deep reinforcement learning reward shaping | ||
| 4 | Top K Enhanced Reinforcement Learning Attacks on Heterogeneous Graph Node Classification | 提出HeteroKRLAttack,一种针对异构图节点分类的Top-K增强强化学习攻击方法 | reinforcement learning |
🔬 支柱九:具身大模型 (Embodied Foundation Models) (2 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 5 | Towards Automatic Hands-on-Keyboard Attack Detection Using LLMs in EDR Solutions | 提出一种基于LLM的EDR解决方案,用于自动检测键盘手动攻击 | large language model | ||
| 6 | Distribution-Level Memory Recall for Continual Learning: Preserving Knowledge and Avoiding Confusion | 提出分布级别记忆回溯(DMR)方法,解决持续学习中特征空间分布失真导致的知识遗忘问题。 | multimodal |
🔬 支柱一:机器人控制 (Robot Control) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 7 | RVI-SAC: Average Reward Off-Policy Deep Reinforcement Learning | 提出RVI-SAC,一种基于平均奖励的Off-Policy深度强化学习方法,适用于持续性任务。 | locomotion reinforcement learning deep reinforcement learning |