cs.LG（2025-12-05）

📊 共 23 篇论文

🎯 兴趣领域导航

支柱九：具身大模型 (Embodied Foundation Models) (14) 支柱二：RL算法与架构 (RL & Architecture) (6) 支柱一：机器人控制 (Robot Control) (1) 支柱三：空间感知与语义 (Perception & Semantics) (1) 支柱八：物理动画 (Physics-based Animation) (1)

🔬 支柱九：具身大模型 (Embodied Foundation Models) (14 篇)

#	题目	一句话要点	标签	🔗	⭐
1	TS-HINT: Enhancing Semiconductor Time Series Regression Using Attention Hints From Large Language Model Reasoning	TS-HINT：利用大语言模型推理提示增强半导体时序回归	large language model foundation model chain-of-thought
2	Taxonomy-Adaptive Moderation Model with Robust Guardrails for Large Language Models	提出Roblox Guard 1.0，增强LLM系统输入输出安全性的分类自适应审核模型	large language model chain-of-thought
3	Scaling and Transferability of Annealing Strategies in Large Language Model Training	提出一种可迁移的学习率退火策略优化框架，提升大语言模型训练效率。	large language model
4	Poodle: Seamlessly Scaling Down Large Language Models with Just-in-Time Model Replacement	Poodle：即时模型替换，无缝缩减大语言模型规模	large language model
5	The Forgotten Shield: Safety Grafting in Parameter-Space for Medical MLLMs	提出参数空间安全嫁接方法，提升医学多模态大语言模型的安全性。	large language model multimodal
6	Physics-Informed Neural Koopman Machine for Interpretable Longitudinal Personalized Alzheimer's Disease Forecasting	提出神经Koopman机(NKM)，用于可解释的阿尔茨海默病纵向个性化预测。	multimodal
7	MaxShapley: Towards Incentive-compatible Generative Search with Fair Context Attribution	提出MaxShapley算法，用于检索增强生成搜索中的激励兼容和公平内容归因。	large language model
8	Impugan: Learning Conditional Generative Models for Robust Data Imputation	Impugan：一种用于鲁棒数据插补的条件生成对抗网络模型	multimodal
9	KQ-SVD: Compressing the KV Cache with Provable Guarantees on Attention Fidelity	KQ-SVD：通过优化Attention矩阵低秩分解压缩KV缓存，提升LLM推理效率	large language model
10	Mitigating Catastrophic Forgetting in Mathematical Reasoning Finetuning through Mixed Training	提出混合训练策略，缓解数学推理微调中的灾难性遗忘问题	large language model
11	Bootstrapping Fuzzers for Compilers of Low-Resource Language Dialects Using Language Models	Germinator利用语言模型为低资源语言编译器自动生成fuzzer，提升测试效率。	large language model
12	Feasibility of AI-Assisted Programming for End-User Development	探索AI辅助编程在终端用户开发中的可行性，以替代或补充低代码/无代码平台。	large language model
13	RevoNAD: Reflective Evolutionary Exploration for Neural Architecture Design	RevoNAD：一种反射式进化探索的神经架构设计方法，提升架构搜索的可靠性和部署性。	large language model
14	When Forgetting Builds Reliability: LLM Unlearning for Reliable Hardware Code Generation	提出面向硬件代码生成的LLM遗忘框架，提升代码可靠性	large language model

🔬 支柱二：RL算法与架构 (RL & Architecture) (6 篇)

#	题目	一句话要点	标签	🔗	⭐
15	Entropy Ratio Clipping as a Soft Global Constraint for Stable Reinforcement Learning	提出熵率裁剪(ERC)机制，稳定强化学习训练，提升大语言模型后训练效果。	reinforcement learning PPO large language model
16	Quantifying Memory Use in Reinforcement Learning with Temporal Range	提出Temporal Range指标，量化强化学习策略对历史观测的记忆依赖。	reinforcement learning SSM
17	Average-reward reinforcement learning in semi-Markov decision processes via relative value iteration	提出基于相对值迭代的平均奖励SMDP强化学习算法，并证明其收敛性。	reinforcement learning
18	JaxWildfire: A GPU-Accelerated Wildfire Simulator for Reinforcement Learning	提出JaxWildfire，一种GPU加速的野火模拟器，用于强化学习。	reinforcement learning
19	Utility Boundary of Dataset Distillation: Scaling and Configuration-Coverage Laws	提出配置-动态-误差分析框架，揭示数据集蒸馏的缩放律和配置覆盖律。	distillation
20	FieldSeer I: Physics-Guided World Models for Long-Horizon Electromagnetic Dynamics under Partial Observability	FieldSeer I：基于物理引导的世界模型，用于部分观测下长时程电磁动力学预测	world model

🔬 支柱一：机器人控制 (Robot Control) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
21	Hierarchical Task Offloading and Trajectory Optimization in Low-Altitude Intelligent Networks Via Auction and Diffusion-based MARL	提出基于拍卖和扩散MARL的低空智能网络分层任务卸载与轨迹优化方案	trajectory optimization reinforcement learning

🔬 支柱三：空间感知与语义 (Perception & Semantics) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
22	Over-the-Air Semantic Alignment with Stacked Intelligent Metasurfaces	提出基于堆叠智能超表面的无线语义对齐框架，解决异构模型潜在表示失准问题。	semantic mapping semantic map

🔬 支柱八：物理动画 (Physics-based Animation) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
23	Designing an Optimal Sensor Network via Minimizing Information Loss	提出一种基于物理模拟和贝叶斯实验设计的传感器网络优化方法，最小化信息损失。	spatiotemporal

⬅️ 返回 cs.LG 首页 · 🏠 返回主页