cs.LG(2024-11-03)

📊 共 9 篇论文 | 🔗 2 篇有代码

🎯 兴趣领域导航

支柱二:RL算法与架构 (RL & Architecture) (6) 支柱九:具身大模型 (Embodied Foundation Models) (3 🔗2)

🔬 支柱二:RL算法与架构 (RL & Architecture) (6 篇)

#题目一句话要点标签🔗
1 GITSR: Graph Interaction Transformer-based Scene Representation for Multi Vehicle Collaborative Decision-making 提出基于图交互Transformer的场景表示框架GITSR,用于多车协同决策。 reinforcement learning interaction transformer
2 Sample-Efficient Alignment for LLMs 提出SEA算法,通过上下文决斗强盗框架实现LLM高效对齐 preference learning RLHF DPO
3 Learning World Models for Unconstrained Goal Navigation 提出MUN算法,解决无约束目标导航中世界模型泛化性问题。 reinforcement learning world model
4 Learning Hidden Subgoals under Temporal Ordering Constraints in Reinforcement Learning 提出LSTOC算法,解决强化学习中时序约束下学习隐藏子目标的问题 reinforcement learning contrastive learning
5 Exploring the Edges of Latent State Clusters for Goal-Conditioned Reinforcement Learning 提出聚类边缘探索算法CE²,提升目标条件强化学习在未知环境中的探索效率 reinforcement learning
6 Decoupling Dark Knowledge via Block-wise Logit Distillation for Feature-level Alignment 提出块级Logit蒸馏框架,通过隐式特征对齐提升知识蒸馏性能 distillation

🔬 支柱九:具身大模型 (Embodied Foundation Models) (3 篇)

#题目一句话要点标签🔗
7 Rethinking Weight Decay for Robust Fine-Tuning of Foundation Models 提出选择性投影衰减(SPD),提升预训练模型微调的鲁棒性和泛化性 foundation model
8 Classifier-guided Gradient Modulation for Enhanced Multimodal Learning 提出分类器引导的梯度调制(CGGM)方法,用于增强多模态学习中的模态平衡。 multimodal
9 Unlocking the Theory Behind Scaling 1-Bit Neural Networks 首次从理论上证明1-bit神经网络的Scaling Law,揭示其高效扩展潜力 large language model

⬅️ 返回 cs.LG 首页 · 🏠 返回主页