cs.AI(2024-06-28)
📊 共 17 篇论文 | 🔗 1 篇有代码
🎯 兴趣领域导航
支柱九:具身大模型 (Embodied Foundation Models) (11 🔗1)
支柱二:RL算法与架构 (RL & Architecture) (5)
支柱一:机器人控制 (Robot Control) (1)
🔬 支柱九:具身大模型 (Embodied Foundation Models) (11 篇)
🔬 支柱二:RL算法与架构 (RL & Architecture) (5 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 12 | Tradeoffs When Considering Deep Reinforcement Learning for Contingency Management in Advanced Air Mobility | 针对先进空中交通的应急管理,探索深度强化学习的权衡方案 | reinforcement learning deep reinforcement learning DRL | ||
| 13 | Beyond Human Preferences: Exploring Reinforcement Learning Trajectory Evaluation and Improvement through LLMs | 提出LLM4PG框架,利用大语言模型提升强化学习在复杂约束游戏任务中的轨迹评估与策略优化。 | reinforcement learning large language model | ||
| 14 | External Model Motivated Agents: Reinforcement Learning for Enhanced Environment Sampling | 提出基于外部模型的强化学习智能体,提升环境采样效率 | reinforcement learning | ||
| 15 | Optimizing Cyber Defense in Dynamic Active Directories through Reinforcement Learning | 提出基于强化学习的动态Active Directory网络防御优化方法 | reinforcement learning | ||
| 16 | Fuzzy Logic Guided Reward Function Variation: An Oracle for Testing Reinforcement Learning Programs | 提出基于模糊逻辑的奖励函数变异方法,解决强化学习程序测试中的Oracle问题 | reinforcement learning |
🔬 支柱一:机器人控制 (Robot Control) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 17 | Function+Data Flow: A Framework to Specify Machine Learning Pipelines for Digital Twinning | 提出Function+Data Flow领域特定语言,简化数字孪生AI流水线设计。 | manipulation |