cs.LG（2025-07-14）

📊 共 29 篇论文 | 🔗 6 篇有代码

🎯 兴趣领域导航

支柱二：RL算法与架构 (RL & Architecture) (13 🔗3) 支柱九：具身大模型 (Embodied Foundation Models) (10 🔗3) 支柱八：物理动画 (Physics-based Animation) (2) 支柱七：动作重定向 (Motion Retargeting) (2) 支柱一：机器人控制 (Robot Control) (1) 支柱三：空间感知与语义 (Perception & Semantics) (1)

🔬 支柱二：RL算法与架构 (RL & Architecture) (13 篇)

#	题目	一句话要点	标签	🔗	⭐
1	Pimba: A Processing-in-Memory Acceleration for Post-Transformer Large Language Model Serving	Pimba：面向后Transformer大语言模型服务的存内计算加速方案	SSM state space model linear attention
2	Offline Reinforcement Learning with Wasserstein Regularization via Optimal Transport Maps	提出基于最优传输映射和Wasserstein正则化的离线强化学习方法，解决分布偏移问题。	reinforcement learning offline RL offline reinforcement learning	✅
3	GHPO: Adaptive Guidance for Stable and Efficient LLM Reinforcement Learning	提出GHPO：自适应引导的稳定高效LLM强化学习框架	reinforcement learning imitation learning curriculum learning
4	Graph World Model	提出图世界模型GWM，统一处理非结构化和图结构数据，支持多模态任务。	world model foundation model	✅
5	Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data Contamination	揭示数据污染对RL微调大模型数学推理能力评估的影响，提出清洁数据集RandomCalculation。	reinforcement learning large language model
6	MoCap-Impute: A Comprehensive Benchmark and Comparative Analysis of Imputation Methods for IMU-based Motion Capture Data	MoCap-Impute：针对IMU运动捕捉数据缺失值插补的综合基准与对比分析	MAE IMU-based motion
7	Recognizing Dementia from Neuropsychological Tests with State Space Models	提出基于状态空间模型的Demenba框架，用于神经心理学测试的痴呆症自动识别。	state space model large language model
8	A Generalizable Physics-Enhanced State Space Model for Long-Term Dynamics Forecasting in Complex Environments	提出Phy-SSM，融合物理知识的状态空间模型，用于复杂环境下的长期动态预测。	SSM state space model	✅
9	Compression Method for Deep Diagonal State Space Model Based on $H^2$ Optimal Reduction	提出基于$H^2$最优降阶的深对角状态空间模型压缩方法	SSM state space model
10	FusionFactory: Fusing LLM Capabilities with Multi-LLM Log Data	FusionFactory：融合多LLM日志数据，提升LLM在不同任务上的性能。	distillation large language model
11	Feature Distillation is the Better Choice for Model-Heterogeneous Federated Learning	提出FedFD：一种基于特征蒸馏的模型异构联邦学习方法	distillation
12	Text-Driven Causal Representation Learning for Source-Free Domain Generalization	提出TDCRL，通过文本驱动的因果表示学习解决无源域泛化问题	representation learning
13	Multi-Armed Sampling Problem and the End of Exploration	提出多臂采样框架，证明采样无需探索，为熵正则化强化学习等提供理论基础。	reinforcement learning RLHF

🔬 支柱九：具身大模型 (Embodied Foundation Models) (10 篇)

#	题目	一句话要点	标签	🔗	⭐
14	Towards Applying Large Language Models to Complement Single-Cell Foundation Models	提出scMPT模型，融合单细胞Foundation模型与LLM，提升单细胞分析性能。	large language model foundation model
15	ElasticMM: Efficient Multimodal LLMs Serving with Elastic Multimodal Parallelism	ElasticMM：通过弹性多模态并行加速多模态LLM服务	large language model multimodal
16	LaCache: Ladder-Shaped KV Caching for Efficient Long-Context Modeling of Large Language Models	LaCache：一种梯形KV缓存方法，用于高效的大语言模型长文本建模	large language model	✅
17	TolerantECG: A Foundation Model for Imperfect Electrocardiogram	TolerantECG：一种对噪声和导联缺失具有鲁棒性的心电图（ECG）基础模型	foundation model
18	AdaBrain-Bench: Benchmarking Brain Foundation Models for Brain-Computer Interface Applications	提出AdaBrain-Bench，用于评估脑机接口应用中脑基础模型的性能	foundation model
19	Semantic Context for Tool Orchestration	提出基于语义上下文的工具编排方法，提升LLM在复杂任务中的性能	large language model
20	Iceberg: Enhancing HLS Modeling with Synthetic Data	Iceberg：通过合成数据增强HLS建模，提升泛化能力	large language model	✅
21	Memorization Sinks: Isolating Memorization during LLM Training	提出MemSinks，通过隔离记忆神经元解决LLM训练中的隐私和版权问题。	large language model	✅
22	Mechanistic Interpretability of LoRA-Adapted Language Models for Nuclear Reactor Safety Applications	针对核反应堆安全应用，提出LoRA微调语言模型的可解释性分析方法	large language model
23	Rethinking Prompt Optimization: Reinforcement, Diversification, and Migration in Blackbox LLMs	提出一种新型的提示优化框架以提升LLM性能	large language model

🔬 支柱八：物理动画 (Physics-based Animation) (2 篇)

#	题目	一句话要点	标签	🔗	⭐
24	Boosted Enhanced Quantile Regression Neural Networks with Spatiotemporal Permutation Entropy for Complex System Prognostics	提出基于时空排列熵与增强分位数回归神经网络的复杂系统预测框架	spatiotemporal multimodal
25	HEIMDALL: a grapH-based sEIsMic Detector And Locator for microseismicity	提出基于图神经网络的地震检测与定位模型HEIMDALL，用于微震监测和地震目录生成。	spatiotemporal

🔬 支柱七：动作重定向 (Motion Retargeting) (2 篇)

#	题目	一句话要点	标签	🔗	⭐
26	ZClassifier: Temperature Tuning and Manifold Approximation via KL Divergence on Logit Space	ZClassifier：通过KL散度在Logit空间进行温度调整和流形逼近	geometric consistency
27	Rethinking Inductive Bias in Geographically Neural Network Weighted Regression	GNNWR的归纳偏置再思考：融合CNN、RNN和Transformer以提升空间回归性能	spatial relationship

🔬 支柱一：机器人控制 (Robot Control) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
28	MTF-Grasp: A Multi-tier Federated Learning Approach for Robotic Grasping	MTF-Grasp：一种用于机器人抓取的的多层联邦学习方法	manipulation

🔬 支柱三：空间感知与语义 (Perception & Semantics) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
29	Algorithm Development in Neural Networks: Insights from the Streaming Parity Task	通过流式奇偶校验任务，揭示神经网络算法涌现的机制	implicit representation

⬅️ 返回 cs.LG 首页 · 🏠 返回主页