cs.LG(2024-09-07)

📊 共 4 篇论文

🎯 兴趣领域导航

支柱二:RL算法与架构 (RL & Architecture) (3) 支柱九:具身大模型 (Embodied Foundation Models) (1)

🔬 支柱二:RL算法与架构 (RL & Architecture) (3 篇)

#题目一句话要点标签🔗
1 Reward Guidance for Reinforcement Learning Tasks Based on Large Language Models: The LMGT Framework 提出LMGT框架以解决稀疏奖励下的强化学习问题 reinforcement learning large language model
2 Improving Deep Reinforcement Learning by Reducing the Chain Effect of Value and Policy Churn 提出CHAIN方法,通过减少价值和策略的链式漂移来提升深度强化学习性能 reinforcement learning deep reinforcement learning DRL
3 Sample and Oracle Efficient Reinforcement Learning for MDPs with Linearly-Realizable Value Functions 针对线性可实现值函数的MDP,提出样本和Oracle高效的强化学习算法 reinforcement learning

🔬 支柱九:具身大模型 (Embodied Foundation Models) (1 篇)

#题目一句话要点标签🔗
4 Optimization Hyper-parameter Laws for Large Language Models 提出优化超参数定律(Opt-Laws),用于大语言模型训练中学习率调度的优化选择。 large language model

⬅️ 返回 cs.LG 首页 · 🏠 返回主页