cs.LG（2025-06-14）

📊 共 18 篇论文 | 🔗 1 篇有代码

🎯 兴趣领域导航

支柱二：RL算法与架构 (RL & Architecture) (10) 支柱九：具身大模型 (Embodied Foundation Models) (7 🔗1) 支柱八：物理动画 (Physics-based Animation) (1)

🔬 支柱二：RL算法与架构 (RL & Architecture) (10 篇)

#	题目	一句话要点	标签	🔗	⭐
1	Generalizable Trajectory Prediction via Inverse Reinforcement Learning with Mamba-Graph Architecture	提出基于Mamba-Graph架构的逆强化学习方法，提升轨迹预测的泛化能力	reinforcement learning inverse reinforcement learning Mamba
2	DR-SAC: Distributionally Robust Soft Actor-Critic for Reinforcement Learning under Uncertainty	提出DR-SAC算法，增强SAC在不确定环境下强化学习的鲁棒性	reinforcement learning deep reinforcement learning SAC
3	Similarity as Reward Alignment: Robust and Versatile Preference-based Reinforcement Learning	提出SARA：一种基于相似性的奖励对齐方法，提升偏好强化学习的鲁棒性和通用性	reinforcement learning offline RL reward shaping
4	Merlin: Multi-View Representation Learning for Robust Multivariate Time Series Forecasting with Unfixed Missing Rates	提出Merlin，通过多视角表征学习增强MTSF模型在非固定缺失率下的鲁棒性。	representation learning contrastive learning distillation
5	Relative Entropy Regularized Reinforcement Learning for Efficient Encrypted Policy Synthesis	提出基于相对熵正则化强化学习的加密策略合成方法，实现高效隐私保护。	reinforcement learning OMOMO
6	PLD: A Choice-Theoretic List-Wise Knowledge Distillation	提出基于选择理论的列表式知识蒸馏方法PLD，提升模型压缩性能	teacher-student distillation
7	Mapping Neural Signals to Agent Performance, A Step Towards Reinforcement Learning from Neural Feedback	提出NEURO-LOOP框架，探索脑信号到智能体性能的映射，为神经反馈强化学习奠定基础	reinforcement learning
8	Information fusion strategy integrating pre-trained language model and contrastive learning for materials knowledge mining	提出融合预训练语言模型与对比学习的信息融合策略，用于材料知识挖掘。	contrastive learning
9	Interpretable Causal Representation Learning for Biological Data in the Pathway Space	提出SENA-discrepancy-VAE，用于生物数据因果表征学习，提升模型可解释性。	representation learning
10	PROTOCOL: Partial Optimal Transport-enhanced Contrastive Learning for Imbalanced Multi-view Clustering	提出PROTOCOL框架，解决不平衡多视图聚类中的表征退化问题。	contrastive learning

🔬 支柱九：具身大模型 (Embodied Foundation Models) (7 篇)

#	题目	一句话要点	标签	🔗	⭐
11	Unveiling Confirmation Bias in Chain-of-Thought Reasoning	揭示思维链推理中大语言模型的确认偏差现象	large language model chain-of-thought	✅
12	Exploring the Secondary Risks of Large Language Models	探索大语言模型在良性交互下的次生风险，提出SecLens评估框架。	large language model
13	HYPER: A Foundation Model for Inductive Link Prediction with Knowledge Hypergraphs	提出HYPER模型以解决知识超图的归纳链接预测问题	foundation model
14	Beyond Frequency: The Role of Redundancy in Large Language Model Memorization	揭示冗余在大型语言模型记忆中的作用，提出基于冗余的数据预处理方法。	large language model
15	A Framework for Generating Conversational Recommendation Datasets from Behavioral Interactions	ConvRecStudio：基于行为交互生成对话式推荐数据集的框架	large language model
16	Automatic Expert Discovery in LLM Upcycling via Sparse Interpolated Mixture-of-Experts	提出SIMoE，通过稀疏插值混合专家模型实现LLM的自动专家发现与能力提升。	large language model
17	QiMeng-Attention: SOTA Attention Operator is generated by SOTA Attention Algorithm	提出QiMeng-Attention，通过LLM自动生成高性能Attention算子，解决长文本场景下的性能瓶颈。	large language model

🔬 支柱八：物理动画 (Physics-based Animation) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
18	Path-specific effects for pulse-oximetry guided decisions in critical care	利用路径特定效应，研究脉搏血氧仪偏差对重症监护决策的影响	PULSE

⬅️ 返回 cs.LG 首页 · 🏠 返回主页