cs.LG（2025-11-07）

📊 共 20 篇论文 | 🔗 2 篇有代码

🎯 兴趣领域导航

支柱二：RL算法与架构 (RL & Architecture) (9 🔗1) 支柱九：具身大模型 (Embodied Foundation Models) (8) 支柱八：物理动画 (Physics-based Animation) (2 🔗1) 支柱三：空间感知与语义 (Perception & Semantics) (1)

🔬 支柱二：RL算法与架构 (RL & Architecture) (9 篇)

#	题目	一句话要点	标签	🔗	⭐
1	An End-to-End Deep Reinforcement Learning Approach for Solving the Traveling Salesman Problem with Drones	提出基于Transformer和Minimal Gated Unit的深度强化学习框架，解决带无人机的旅行商问题。	reinforcement learning deep reinforcement learning policy learning
2	You Need Reasoning to Learn Reasoning: The Limitations of Label-Free RL in Weak Base Models	研究表明，无监督强化学习提升小模型推理能力受限，并提出课程学习与数据筛选方法。	reinforcement learning curriculum learning large language model	✅
3	A Metamorphic Testing Perspective on Knowledge Distillation for Language Models of Code: Does the Student Deeply Mimic the Teacher?	提出MetaCompress，用于评估代码语言模型知识蒸馏的保真度。	distillation large language model
4	Multi-agent Coordination via Flow Matching	提出MAC-Flow，通过流匹配实现高效多智能体协同	reinforcement learning flow matching
5	MedFedPure: A Medical Federated Framework with MAE-based Detection and Diffusion Purification for Inference-Time Attacks	MedFedPure：一种基于MAE检测和扩散净化的医学联邦学习框架，用于防御推理时攻击。	masked autoencoder MAE
6	Sample Complexity of Distributionally Robust Off-Dynamics Reinforcement Learning with Online Interaction	提出基于在线交互的分布鲁棒离策略强化学习算法，解决探索难题。	reinforcement learning
7	SAD-Flower: Flow Matching for Safe, Admissible, and Dynamically Consistent Planning	SAD-Flower：通过流匹配实现安全、容许和动态一致的规划	flow matching
8	On Flow Matching KL Divergence	推导Flow Matching KL散度的确定性上界，提升生成模型统计效率	flow matching
9	Distributionally Robust Self Paced Curriculum Reinforcement Learning	提出DR-SPCRL，通过自步课程学习调整鲁棒性预算，提升强化学习在分布偏移下的性能。	reinforcement learning

🔬 支柱九：具身大模型 (Embodied Foundation Models) (8 篇)

#	题目	一句话要点	标签	🔗	⭐
10	Turning Adversaries into Allies: Reversing Typographic Attacks for Multimodal E-Commerce Product Retrieval	提出一种反转印刷攻击的视觉-文本压缩方法，提升电商多模态商品检索性能。	foundation model multimodal
11	Model Merging Improves Zero-Shot Generalization in Bioacoustic Foundation Models	模型融合提升生物声学基础模型零样本泛化能力	foundation model instruction following
12	Distributionally Robust Multimodal Machine Learning	提出分布鲁棒多模态学习框架，提升不确定性下的多模态融合性能。	multimodal
13	Multimodal Deep Learning for Prediction of Progression-Free Survival in Patients with Neuroendocrine Tumors Undergoing 177Lu-based Peptide Receptor Radionuclide Therapy	多模态深度学习预测177Lu-PRRT治疗神经内分泌肿瘤患者的无进展生存期	multimodal
14	Beyond Redundancy: Diverse and Specialized Multi-Expert Sparse Autoencoder	提出多样化专家混合稀疏自编码器，提升大语言模型的可解释性与效率。	large language model
15	Usando LLMs para Programar Jogos de Tabuleiro e Variações	利用大型语言模型辅助棋盘游戏及其变种的程序开发	large language model
16	OvA-LP: A Simple and Efficient Framework for Federated Learning on Non-IID Data	OvA-LP：一种简单高效的联邦学习框架，用于解决非独立同分布数据下的模型漂移问题	foundation model
17	Leak@$k$: Unlearning Does Not Make LLMs Forget Under Probabilistic Decoding	揭示LLM在概率解码下遗忘失效：提出Leak@$k$评估知识泄漏	large language model

🔬 支柱八：物理动画 (Physics-based Animation) (2 篇)

#	题目	一句话要点	标签	🔗	⭐
18	Evaluating Spatio-Temporal Forecasting Trade-offs Between Graph Neural Networks and Foundation Models	评估图神经网络与时间序列基础模型在时空预测中的权衡，优化传感器部署。	spatiotemporal foundation model	✅
19	SSTODE: Ocean-Atmosphere Physics-Informed Neural ODEs for Sea Surface Temperature Prediction	SSTODE：基于海洋-大气物理信息的神经ODE，用于海面温度预测	spatiotemporal

🔬 支柱三：空间感知与语义 (Perception & Semantics) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
20	Precipitation nowcasting of satellite data using physically-aligned neural networks	提出TUPANN，一种基于物理对齐神经网络的卫星降水临近预报模型	optical flow

⬅️ 返回 cs.LG 首页 · 🏠 返回主页