cs.LG(2024-07-15)

📊 共 6 篇论文

🎯 兴趣领域导航

支柱二:RL算法与架构 (RL & Architecture) (2) 支柱九:具身大模型 (Embodied Foundation Models) (2) 支柱一:机器人控制 (Robot Control) (2)

🔬 支柱二:RL算法与架构 (RL & Architecture) (2 篇)

#题目一句话要点标签🔗
1 SuperPADL: Scaling Language-Directed Physics-Based Control with Progressive Supervised Distillation SuperPADL:通过渐进式监督蒸馏扩展语言驱动的物理控制 reinforcement learning distillation text-to-motion
2 Learning Dynamics of LLM Finetuning 研究LLM微调的学习动态,揭示幻觉和偏好优化现象,并提出改进对齐方法。 DPO direct preference optimization large language model

🔬 支柱九:具身大模型 (Embodied Foundation Models) (2 篇)

#题目一句话要点标签🔗
3 Mechanistic interpretability of large language models with applications to the financial services industry 利用机制可解释性分析大型语言模型在金融服务中的应用,聚焦公平贷款合规性 large language model
4 LLM Circuit Analyses Are Consistent Across Training and Scale LLM回路分析在训练和规模扩展中保持一致性,揭示小模型分析对大模型的适用性 large language model

🔬 支柱一:机器人控制 (Robot Control) (2 篇)

#题目一句话要点标签🔗
5 Balancing the Scales: Reinforcement Learning for Fair Classification 提出基于强化学习的公平分类方法,通过调整奖励函数缓解不平衡数据集中的偏见。 manipulation reinforcement learning
6 Fast Matrix Multiplications for Lookup Table-Quantized LLMs FLUTE:加速查找表量化LLM的矩阵乘法,提升推理速度 manipulation large language model

⬅️ 返回 cs.LG 首页 · 🏠 返回主页