cs.LG（2025-05-07）

📊 共 27 篇论文 | 🔗 5 篇有代码

🎯 兴趣领域导航

支柱二：RL算法与架构 (RL & Architecture) (16 🔗3) 支柱九：具身大模型 (Embodied Foundation Models) (8 🔗1) 支柱一：机器人控制 (Robot Control) (3 🔗1)

🔬 支柱二：RL算法与架构 (RL & Architecture) (16 篇)

#	题目	一句话要点	标签	🔗	⭐
1	ALFEE: Adaptive Large Foundation Model for EEG Representation	ALFEE：用于脑电信号表征的自适应大规模预训练模型	representation learning foundation model	✅
2	A Heuristic-Integrated DRL Approach for Phase Optimization in Large-Scale RISs	提出一种启发式集成的DRL方法，用于大规模RIS的相位优化	reinforcement learning deep reinforcement learning DRL
3	On-Device LLM for Context-Aware Wi-Fi Roaming	提出一种基于设备端LLM的上下文感知Wi-Fi漫游方法，优化无线网络连接。	DRL large language model chain-of-thought
4	InfoNCE is a Free Lunch for Semantically guided Graph Contrastive Learning	IFL-GCL：利用InfoNCE进行语义引导的图对比学习，解决负样本偏差问题	contrastive learning foundation model	✅
5	ARDNS-FN-Quantum: A Quantum-Enhanced Reinforcement Learning Framework with Cognitive-Inspired Adaptive Exploration for Dynamic Environments	提出ARDNS-FN-Quantum框架以解决动态环境中的强化学习探索问题	reinforcement learning PPO
6	ABKD: Pursuing a Proper Allocation of the Probability Mass in Knowledge Distillation via $α$-$β$-Divergence	ABKD：通过α-β散度实现知识蒸馏中概率质量的合理分配	teacher-student distillation	✅
7	Adaptive and Robust DBSCAN with Multi-agent Reinforcement Learning	提出AR-DBSCAN，利用多智能体强化学习自适应解决DBSCAN在多密度数据集上的聚类问题。	reinforcement learning deep reinforcement learning
8	DMRL: Data- and Model-aware Reward Learning for Data Extraction	提出DMRL：一种数据与模型感知的奖励学习方法，用于从大型语言模型中提取敏感数据。	reinforcement learning inverse reinforcement learning large language model
9	Fight Fire with Fire: Defending Against Malicious RL Fine-Tuning via Reward Neutralization	提出奖励中和方法，防御恶意RL微调对语言模型的安全攻击。	reinforcement learning large language model
10	A Two-Timescale Primal-Dual Framework for Reinforcement Learning via Online Dual Variable Guidance	提出PGDA-RL以解决强化学习中的数据利用与探索问题	reinforcement learning
11	Distillation-Enabled Knowledge Alignment Protocol for Semantic Communication in AI Agent Networks	提出DeKAP协议，通过知识蒸馏对齐AI Agent网络中的语义通信知识。	distillation
12	A Survey on Temporal Interaction Graph Representation Learning: Progress, Challenges, and Opportunities	时序交互图表示学习综述：进展、挑战与机遇	representation learning
13	Extending a Quantum Reinforcement Learning Exploration Policy with Flags to Connect Four	扩展量子强化学习探索策略，应用于四子棋游戏	reinforcement learning
14	Optimization of Infectious Disease Intervention Measures Based on Reinforcement Learning -- Empirical analysis based on UK COVID-19 epidemic data	基于强化学习优化传染病干预措施，以英国COVID-19疫情数据为实证分析	reinforcement learning
15	Putting the Value Back in RL: Better Test-Time Scaling by Unifying LLM Reasoners With Verifiers	RL$^V$: 统一LLM推理器与验证器，提升强化学习测试时可扩展性	reinforcement learning PPO
16	Deep Learning Innovations for Energy Efficiency: Advances in Non-Intrusive Load Monitoring and EV Charging Optimization for a Sustainable Grid	利用深度学习创新，通过非侵入式负荷监测和电动汽车充电优化，提升能源效率。	reinforcement learning deep reinforcement learning

🔬 支柱九：具身大模型 (Embodied Foundation Models) (8 篇)

#	题目	一句话要点	标签	🔗	⭐
17	ORBIT-2: Scaling Exascale Vision Foundation Models for Weather and Climate Downscaling	ORBIT-2：通过可扩展的视觉基础模型实现天气和气候的降尺度	foundation model
18	Lossless Compression of Large Language Model-Generated Text via Next-Token Prediction	利用下一token预测，实现对大语言模型生成文本的无损压缩	large language model
19	Nature's Insight: A Novel Framework and Comprehensive Analysis of Agentic Reasoning Through the Lens of Neuroscience	提出神经科学启发的智能体推理框架，提升AI自主性和泛化能力	foundation model multimodal chain-of-thought	✅
20	A Large Language Model for Feasible and Diverse Population Synthesis	提出基于LLM和贝叶斯网络的混合方法，用于生成可行且多样化的人口合成数据。	large language model
21	When Bad Data Leads to Good Models	通过预训练阶段引入不良数据，提升后训练阶段大语言模型的可控性与安全性。	large language model
22	Towards Effectively Leveraging Execution Traces for Program Repair with Code LLMs	利用程序执行轨迹增强代码大语言模型以提升程序修复效果	large language model
23	STRGCN: Capturing Asynchronous Spatio-Temporal Dependencies for Irregular Multivariate Time Series Forecasting	提出STRGCN，用于解决不规则多元时间序列预测中异步时空依赖建模难题。	TAMP
24	Rethinking LLM Advancement: Compute-Dependent and Independent Paths to Progress	区分计算依赖与独立创新，评估算法进步对大语言模型能力的影响	large language model

🔬 支柱一：机器人控制 (Robot Control) (3 篇)

#	题目	一句话要点	标签	🔗	⭐
25	Merging and Disentangling Views in Visual Reinforcement Learning for Robotic Manipulation	提出MAD算法，通过融合与解耦多视角信息，提升机器人操作视觉强化学习的效率与鲁棒性。	manipulation reinforcement learning	✅
26	Trajectory Entropy Reinforcement Learning for Predictable and Robust Control	提出轨迹熵强化学习，提升控制策略的预测性与鲁棒性	locomotion reinforcement learning deep reinforcement learning
27	Robust ML Auditing using Prior Knowledge	提出一种利用先验知识的鲁棒机器学习审计方法，防止平台操纵审计结果。	manipulation

⬅️ 返回 cs.LG 首页 · 🏠 返回主页