cs.LG(2025-07-08)

📊 共 8 篇论文 | 🔗 2 篇有代码

🎯 兴趣领域导航

支柱二:RL算法与架构 (RL & Architecture) (5 🔗2) 支柱一:机器人控制 (Robot Control) (2) 支柱九:具身大模型 (Embodied Foundation Models) (1)

🔬 支柱二:RL算法与架构 (RL & Architecture) (5 篇)

#题目一句话要点标签🔗
1 AutoTriton: Automatic Triton Programming with Reinforcement Learning in LLMs AutoTriton:利用强化学习在LLM中自动生成高性能Triton程序 reinforcement learning reward design
2 Assuring the Safety of Reinforcement Learning Components: AMLAS-RL 提出AMLAS-RL框架,保障强化学习组件在网络物理系统中的安全性 reinforcement learning
3 Differential Mamba 提出差分Mamba,缓解Mamba模型对无关上下文的过度关注问题 Mamba
4 MobileGUI-RL: Advancing Mobile GUI Agent through Reinforcement Learning in Online Environment 提出MobileGUI-RL,通过在线强化学习提升移动GUI代理的自动化能力 reinforcement learning
5 Sample-Efficient Reinforcement Learning Controller for Deep Brain Stimulation in Parkinson's Disease 提出SEA-DBS,一种用于帕金森病DBS的样本高效强化学习控制器 reinforcement learning

🔬 支柱一:机器人控制 (Robot Control) (2 篇)

#题目一句话要点标签🔗
6 Safe Domain Randomization via Uncertainty-Aware Out-of-Distribution Detection and Policy Adaptation 提出不确定性感知强化学习以解决安全领域随机化问题 quadruped domain randomization reinforcement learning
7 Robust Power System State Estimation using Physics-Informed Neural Networks 提出基于物理信息神经网络的电力系统状态估计方法,提升鲁棒性和准确性。 manipulation

🔬 支柱九:具身大模型 (Embodied Foundation Models) (1 篇)

#题目一句话要点标签🔗
8 Growing Transformers: Modular Composition and Layer-wise Expansion on a Frozen Substrate 提出一种基于冻结底层和模块化扩展的Transformer增长方法,实现高效可扩展的LLM。 large language model

⬅️ 返回 cs.LG 首页 · 🏠 返回主页