cs.LG(2025-07-08)
📊 共 8 篇论文 | 🔗 2 篇有代码
🎯 兴趣领域导航
支柱二:RL算法与架构 (RL & Architecture) (5 🔗2)
支柱一:机器人控制 (Robot Control) (2)
支柱九:具身大模型 (Embodied Foundation Models) (1)
🔬 支柱二:RL算法与架构 (RL & Architecture) (5 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 1 | AutoTriton: Automatic Triton Programming with Reinforcement Learning in LLMs | AutoTriton:利用强化学习在LLM中自动生成高性能Triton程序 | reinforcement learning reward design | ✅ | |
| 2 | Assuring the Safety of Reinforcement Learning Components: AMLAS-RL | 提出AMLAS-RL框架,保障强化学习组件在网络物理系统中的安全性 | reinforcement learning | ||
| 3 | Differential Mamba | 提出差分Mamba,缓解Mamba模型对无关上下文的过度关注问题 | Mamba | ✅ | |
| 4 | MobileGUI-RL: Advancing Mobile GUI Agent through Reinforcement Learning in Online Environment | 提出MobileGUI-RL,通过在线强化学习提升移动GUI代理的自动化能力 | reinforcement learning | ||
| 5 | Sample-Efficient Reinforcement Learning Controller for Deep Brain Stimulation in Parkinson's Disease | 提出SEA-DBS,一种用于帕金森病DBS的样本高效强化学习控制器 | reinforcement learning |
🔬 支柱一:机器人控制 (Robot Control) (2 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 6 | Safe Domain Randomization via Uncertainty-Aware Out-of-Distribution Detection and Policy Adaptation | 提出不确定性感知强化学习以解决安全领域随机化问题 | quadruped domain randomization reinforcement learning | ||
| 7 | Robust Power System State Estimation using Physics-Informed Neural Networks | 提出基于物理信息神经网络的电力系统状态估计方法,提升鲁棒性和准确性。 | manipulation |
🔬 支柱九:具身大模型 (Embodied Foundation Models) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 8 | Growing Transformers: Modular Composition and Layer-wise Expansion on a Frozen Substrate | 提出一种基于冻结底层和模块化扩展的Transformer增长方法,实现高效可扩展的LLM。 | large language model |