cs.LG（2025-12-19）

📊 共 19 篇论文

🎯 兴趣领域导航

支柱九：具身大模型 (Embodied Foundation Models) (9) 支柱二：RL算法与架构 (RL & Architecture) (6) 支柱八：物理动画 (Physics-based Animation) (3) 支柱一：机器人控制 (Robot Control) (1)

🔬 支柱九：具身大模型 (Embodied Foundation Models) (9 篇)

#	题目	一句话要点	标签	🔗	⭐
1	Wireless Traffic Prediction with Large Language Model	提出TIDES框架，利用大语言模型进行城市无线流量预测，显著提升预测精度和鲁棒性。	large language model foundation model
2	Towards Benchmarking Privacy Vulnerabilities in Selective Forgetting with Large Language Models	构建选择性遗忘隐私漏洞基准，评估大语言模型的隐私泄露风险	large language model
3	Sampling from multimodal distributions with warm starts: Non-asymptotic bounds for the Reweighted Annealed Leap-Point Sampler	提出Re-ALPS算法，加速多模态分布采样，无需高斯近似。	multimodal
4	Disentangling Fact from Sentiment: A Dynamic Conflict-Consensus Framework for Multimodal Fake News Detection	提出动态冲突-共识框架DCCF，用于增强多模态假新闻检测中矛盾信息的利用。	multimodal
5	Enabling Disaggregated Multi-Stage MLLM Inference via GPU-Internal Scheduling and Resource Sharing	提出FlashCodec和UnifiedServe，通过GPU内调度和资源共享加速多阶段MLLM推理。	large language model multimodal
6	Graph-based Nearest Neighbors with Dynamic Updates via Random Walks	提出基于随机游走的动态更新图最近邻搜索算法，支持高效删除操作	large language model
7	A Dataset and Benchmarks for Atrial Fibrillation Detection from Electrocardiograms of Intensive Care Unit Patients	发布ICU心电图房颤检测数据集与基准，验证ECG基础模型有效性	foundation model
8	Weighted Stochastic Differential Equation to Implement Wasserstein-Fisher-Rao Gradient Flow	提出基于加权随机微分方程的Wasserstein-Fisher-Rao梯度流方法，提升生成模型采样效率。	multimodal
9	Hierarchical Sparse Plus Low Rank Compression of LLM	提出分层稀疏加低秩压缩（HSS）方法，用于压缩LLM并保持性能。	large language model

🔬 支柱二：RL算法与架构 (RL & Architecture) (6 篇)

#	题目	一句话要点	标签	🔗	⭐
10	Shuttling Compiler for Trapped-Ion Quantum Computers Based on Large Language Models	提出基于大语言模型的穿梭编译器，优化囚禁离子量子计算机的量子比特路由	DPO direct preference optimization large language model
11	Probabilistic Digital Twins of Users: Latent Representation Learning with Statistically Validated Semantics	提出基于概率数字孪生的用户建模框架，实现可解释的用户表征学习。	predictive model representation learning
12	Trust-Region Adaptive Policy Optimization	提出TRAPO框架，交错SFT与RL优化LLM推理能力，提升探索与稳定性。	reinforcement learning large language model
13	Assessing Long-Term Electricity Market Design for Ambitious Decarbonization Targets using Multi-Agent Reinforcement Learning	提出基于多智能体强化学习的电力市场长期设计评估框架，助力实现深度脱碳目标。	reinforcement learning
14	AdvJudge-Zero: Binary Decision Flips in LLM-as-a-Judge via Adversarial Control Tokens	AdvJudge-Zero：通过对抗控制令牌翻转LLM评判器的二元决策	RLHF DPO
15	A Theoretical Analysis of State Similarity Between Markov Decision Processes	提出广义双模拟度量以解决多马尔可夫决策过程间状态相似性问题	reinforcement learning representation learning

🔬 支柱八：物理动画 (Physics-based Animation) (3 篇)

#	题目	一句话要点	标签	🔗	⭐
16	MINPO: Memory-Informed Neural Pseudo-Operator to Resolve Nonlocal Spatiotemporal Dynamics	提出MINPO：一种记忆增强神经伪算子，用于求解非局部时空动力学问题	spatiotemporal
17	Perfect reconstruction of sparse signals using nonconvexity control and one-step RSB message passing	提出基于非凸性控制和一步RSB消息传递的稀疏信号完美重构方法	AMP
18	Learning solution operator of dynamical systems with diffusion maps kernel ridge regression	提出基于扩散映射核岭回归的动力系统解算子学习方法，实现长期预测。	spatiotemporal

🔬 支柱一：机器人控制 (Robot Control) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
19	Biosecurity-Aware AI: Agentic Risk Auditing of Soft Prompt Attacks on ESM-Based Variant Predictors	提出SAGE框架，用于评估ESM等基因组模型在软提示攻击下的安全性。	manipulation large language model foundation model

⬅️ 返回 cs.LG 首页 · 🏠 返回主页