cs.LG（2024-11-03）

📊 共 9 篇论文 | 🔗 2 篇有代码

🎯 兴趣领域导航

#	题目	一句话要点	标签
1	GITSR: Graph Interaction Transformer-based Scene Representation for Multi Vehicle Collaborative Decision-making	提出基于图交互Transformer的场景表示框架GITSR，用于多车协同决策。	reinforcement learning interaction transformer
2	Sample-Efficient Alignment for LLMs	提出SEA算法，通过上下文决斗强盗框架实现LLM高效对齐	preference learning RLHF DPO
3	Learning World Models for Unconstrained Goal Navigation	提出MUN算法，解决无约束目标导航中世界模型泛化性问题。	reinforcement learning world model
4	Learning Hidden Subgoals under Temporal Ordering Constraints in Reinforcement Learning	提出LSTOC算法，解决强化学习中时序约束下学习隐藏子目标的问题	reinforcement learning contrastive learning
5	Exploring the Edges of Latent State Clusters for Goal-Conditioned Reinforcement Learning	提出聚类边缘探索算法CE²，提升目标条件强化学习在未知环境中的探索效率	reinforcement learning
6	Decoupling Dark Knowledge via Block-wise Logit Distillation for Feature-level Alignment	提出块级Logit蒸馏框架，通过隐式特征对齐提升知识蒸馏性能	distillation

#	题目	一句话要点	标签	🔗
7	Rethinking Weight Decay for Robust Fine-Tuning of Foundation Models	提出选择性投影衰减(SPD)，提升预训练模型微调的鲁棒性和泛化性	foundation model	✅
8	Classifier-guided Gradient Modulation for Enhanced Multimodal Learning	提出分类器引导的梯度调制（CGGM）方法，用于增强多模态学习中的模态平衡。	multimodal	✅
9	Unlocking the Theory Behind Scaling 1-Bit Neural Networks	首次从理论上证明1-bit神经网络的Scaling Law，揭示其高效扩展潜力	large language model