cs.LG(2024-06-28)
📊 共 15 篇论文 | 🔗 2 篇有代码
🎯 兴趣领域导航
支柱九:具身大模型 (Embodied Foundation Models) (7 🔗2)
支柱二:RL算法与架构 (RL & Architecture) (6)
支柱一:机器人控制 (Robot Control) (2)
🔬 支柱九:具身大模型 (Embodied Foundation Models) (7 篇)
🔬 支柱二:RL算法与架构 (RL & Architecture) (6 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 8 | Edge-DIRECT: A Deep Reinforcement Learning-based Method for Solving Heterogeneous Electric Vehicle Routing Problem with Time Window Constraints | 提出Edge-DIRECT,基于深度强化学习解决带时间窗约束的异构电动汽车路径优化问题 | reinforcement learning deep reinforcement learning DRL | ||
| 9 | Operator World Models for Reinforcement Learning | 提出基于算子世界模型的强化学习算法POWR,解决策略镜像下降法在强化学习中的应用难题。 | reinforcement learning world model | ||
| 10 | Towards Stable and Storage-efficient Dataset Distillation: Matching Convexified Trajectory | 提出匹配凸化轨迹(MCT)方法,解决数据集蒸馏中训练轨迹匹配的不稳定性和存储效率问题。 | distillation large language model | ||
| 11 | TabSketchFM: Sketch-based Tabular Representation Learning for Data Discovery over Data Lakes | TabSketchFM:提出基于草图的表格表示学习方法,用于数据湖中的数据发现。 | representation learning | ||
| 12 | Reinforcement Learning for Efficient Design and Control Co-optimisation of Energy Systems | 提出基于强化学习的能源系统设计与控制协同优化框架,提升可再生能源利用率。 | reinforcement learning | ||
| 13 | LLM Critics Help Catch LLM Bugs | 利用LLM评论员辅助发现LLM代码缺陷,提升人工评估准确性 | reinforcement learning RLHF |
🔬 支柱一:机器人控制 (Robot Control) (2 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 14 | Modeling the Real World with High-Density Visual Particle Dynamics | 提出高密度视觉粒子动力学模型,用于模拟真实场景物理动态 | bi-manual world model linear attention | ||
| 15 | Model Predictive Simulation Using Structured Graphical Models and Transformers | 提出基于Transformer和概率图模型的模型预测模拟方法,提升多智能体交互场景的安全性。 | MPC model predictive control |