cs.LG（2025-04-24）

📊 共 20 篇论文 | 🔗 2 篇有代码

🎯 兴趣领域导航

支柱九：具身大模型 (Embodied Foundation Models) (9) 支柱二：RL算法与架构 (RL & Architecture) (8 🔗2) 支柱一：机器人控制 (Robot Control) (2) 支柱八：物理动画 (Physics-based Animation) (1)

🔬 支柱九：具身大模型 (Embodied Foundation Models) (9 篇)

#	题目	一句话要点	标签	🔗	⭐
1	Effortless, Simulation-Efficient Bayesian Inference using Tabular Foundation Models	提出NPE-PFN，利用表格型预训练模型高效解决模拟推理中的样本效率问题	foundation model
2	A Simple Review of EEG Foundation Models: Datasets, Advancements and Future Perspectives	综述脑电图(EEG)基础模型：数据集、进展与未来展望	foundation model
3	Class-Conditional Distribution Balancing for Group Robust Classification	提出类条件分布平衡方法，无需偏见标注实现群体鲁棒分类	foundation model
4	Statistical Runtime Verification for LLMs via Robustness Estimation	提出基于鲁棒性估计的LLM统计运行时验证方法RoMA	large language model
5	L3: DIMM-PIM Integrated Architecture and Coordination for Scalable Long-Context LLM Inference	提出L3架构以解决长文本序列推理中的内存瓶颈问题	large language model
6	Towards Harnessing the Collaborative Power of Large and Small Models for Domain Tasks	探索大模型与小模型协同，加速领域任务自适应	large language model
7	On-Device Qwen2.5: Efficient LLM Inference with Model Compression and Hardware Acceleration	提出基于AWQ量化与FPGA加速的Qwen2.5模型端侧高效推理框架	large language model
8	Symbolic Representation for Any-to-Any Generative Tasks	提出一种基于符号表示的通用生成框架，无需训练即可完成多模态任务。	multimodal
9	High-Fidelity And Complex Test Data Generation For Google SQL Code Generation Services	利用LLM生成高保真复杂测试数据，用于Google SQL代码生成服务	large language model

🔬 支柱二：RL算法与架构 (RL & Architecture) (8 篇)

#	题目	一句话要点	标签	🔗	⭐
10	Training Large Language Models to Reason via EM Policy Gradient	提出EM策略梯度算法，提升LLM在复杂推理任务中的性能与可解释性	reinforcement learning PPO large language model
11	Cooperative Task Offloading through Asynchronous Deep Reinforcement Learning in Mobile Edge Computing for Future Networks	提出基于异步深度强化学习的协同任务卸载框架CTO-TP，优化未来网络MEC中的延迟和能耗。	reinforcement learning deep reinforcement learning
12	Plasticine: Accelerating Research in Plasticity-Motivated Deep Reinforcement Learning	Plasticine：加速塑性驱动的深度强化学习研究的开源框架	reinforcement learning deep reinforcement learning	✅
13	RAGEN: Understanding Self-Evolution in LLM Agents via Multi-Turn Reinforcement Learning	RAGEN：通过多轮强化学习理解LLM Agent的自我进化	reinforcement learning large language model	✅
14	CaRL: Learning Scalable Planning Policies with Simple Rewards	CaRL：通过简单奖励学习可扩展的规划策略，应用于自动驾驶。	reinforcement learning PPO imitation learning
15	ExOSITO: Explainable Off-Policy Learning with Side Information for Intensive Care Unit Blood Test Orders	ExOSITO：结合辅助信息的ICU血检医嘱可解释离线策略学习	policy learning privileged information
16	Do We Need Transformers to Play FPS Video Games?	在第一人称射击游戏中使用Transformer不如传统方法	reinforcement learning offline reinforcement learning decision transformer
17	Collaborative Multi-Agent Reinforcement Learning for Automated Feature Transformation with Graph-Driven Path Optimization	提出TCTO框架，通过图驱动路径优化实现自动化特征工程，提升下游任务性能。	reinforcement learning

🔬 支柱一：机器人控制 (Robot Control) (2 篇)

#	题目	一句话要点	标签	🔗	⭐
18	High-Performance Reinforcement Learning on Spot: Optimizing Simulation Parameters with Distributional Measures	在Spot机器人上实现高性能强化学习：利用分布度量优化仿真参数	locomotion sim2real reinforcement learning
19	The effects of Hessian eigenvalue spectral density type on the applicability of Hessian analysis to generalization capability assessment of neural networks	研究Hessian特征值谱密度类型对神经网络泛化能力评估的影响	manipulation

🔬 支柱八：物理动画 (Physics-based Animation) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
20	OmegAMP: Targeted AMP Discovery through Biologically Informed Generation	OmegAMP：通过生物信息指导的生成方法实现靶向抗菌肽发现	AMP

⬅️ 返回 cs.LG 首页 · 🏠 返回主页