cs.LG（2025-05-04）

📊 共 14 篇论文 | 🔗 3 篇有代码

🎯 兴趣领域导航

支柱二：RL算法与架构 (RL & Architecture) (7 🔗1) 支柱九：具身大模型 (Embodied Foundation Models) (6 🔗2) 支柱一：机器人控制 (Robot Control) (1)

🔬 支柱二：RL算法与架构 (RL & Architecture) (7 篇)

#	题目	一句话要点	标签	🔗	⭐
1	SkillMimic-V2: Learning Robust and Generalizable Interaction Skills from Sparse and Noisy Demonstrations	SkillMimic-V2：从稀疏噪声交互演示中学习鲁棒且泛化的技能	reinforcement learning
2	Learning Local Causal World Models with State Space Models and Attention	利用状态空间模型和注意力机制学习局部因果世界模型	world model SSM state space model
3	Efficient Multivariate Time Series Forecasting via Calibrated Language Models with Privileged Knowledge Distillation	TimeKD：利用校准语言模型和特权知识蒸馏的高效多元时间序列预测框架	distillation privileged information large language model
4	Deep Representation Learning for Electronic Design Automation	利用深度表征学习提升电子设计自动化（EDA）的效率与准确性	representation learning multimodal
5	D3HRL: A Distributed Hierarchical Reinforcement Learning Approach Based on Causal Discovery and Spurious Correlation Detection	提出D3HRL，通过因果发现与伪相关检测解决分层强化学习中的延迟效应和伪相关问题	reinforcement learning
6	Universal Approximation Theorem of Deep Q-Networks	通过随机控制和FBSDE，证明DQN在连续时间MDP中的通用逼近定理	reinforcement learning deep reinforcement learning
7	Meta-Black-Box-Optimization through Offline Q-function Learning	提出基于离线Q函数学习的元黑盒优化框架Q-Mamba，提升算法配置效率。	conservative q-learning Mamba	✅

🔬 支柱九：具身大模型 (Embodied Foundation Models) (6 篇)

#	题目	一句话要点	标签	🔗	⭐
8	Restoring Calibration for Aligned Large Language Models: A Calibration-Aware Fine-Tuning Approach	提出校准感知微调方法，恢复对齐后大语言模型的校准性。	large language model
9	From Biometrics to Environmental Control: AI-Enhanced Digital Twins for Personalized Health Interventions in Healing Landscapes	提出AI增强的数字孪生框架，用于个性化健康干预和疗愈环境控制。	multimodal
10	An Empirical Study of Qwen3 Quantization	针对Qwen3大语言模型，系统性研究了不同量化方案对其性能的影响。	large language model	✅
11	GRAIL: Graph Edit Distance and Node Alignment Using LLM-Generated Code	GRAIL：利用LLM生成代码进行图编辑距离计算和节点对齐	large language model
12	Lightweight Defense Against Adversarial Attacks in Time Series Classification	提出基于数据增强的时间序列分类对抗攻击轻量级防御方法	foundation model
13	Wide & Deep Learning for Node Classification	GCNIII：结合Wide & Deep架构提升节点分类性能，关注节点特征利用	large language model	✅

🔬 支柱一：机器人控制 (Robot Control) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
14	Coupled Distributional Random Expert Distillation for World Model Online Imitation Learning	提出耦合分布随机专家蒸馏，用于世界模型在线模仿学习	locomotion manipulation imitation learning

⬅️ 返回 cs.LG 首页 · 🏠 返回主页