cs.LG（2026-02-17）

📊 共 17 篇论文 | 🔗 1 篇有代码

🎯 兴趣领域导航

支柱九：具身大模型 (Embodied Foundation Models) (10) 支柱二：RL算法与架构 (RL & Architecture) (4 🔗1) 支柱八：物理动画 (Physics-based Animation) (2) 支柱一：机器人控制 (Robot Control) (1)

🔬 支柱九：具身大模型 (Embodied Foundation Models) (10 篇)

#	题目	一句话要点	标签	🔗	⭐
1	On the Out-of-Distribution Generalization of Reasoning in Multimodal LLMs for Simple Visual Planning Tasks	评估多模态LLM在简单视觉规划任务中的推理泛化能力	large language model multimodal chain-of-thought
2	MRC-GAT: A Meta-Relational Copula-Based Graph Attention Network for Interpretable Multimodal Alzheimer's Disease Diagnosis	提出基于Meta关系Copula图注意力网络(MRC-GAT)，用于可解释的多模态阿尔茨海默病诊断。	multimodal
3	ER-MIA: Black-Box Adversarial Memory Injection Attacks on Long-Term Memory-Augmented Large Language Models	提出ER-MIA框架，针对长期记忆增强的大语言模型进行黑盒对抗性记忆注入攻击。	large language model
4	Discovering Implicit Large Language Model Alignment Objectives	提出Obj-Disco框架以解决LLM对齐目标不明确问题	large language model
5	Operationalising the Superficial Alignment Hypothesis via Task Complexity	通过任务复杂度量化，揭示大语言模型中的表层对齐假设	large language model instruction following
6	Neural Scaling Laws for Boosted Jet Tagging	研究喷注标记任务的神经标度律，揭示算力、数据与性能间的关系。	large language model foundation model
7	CrispEdit: Low-Curvature Projections for Scalable Non-Destructive LLM Editing	提出CrispEdit以解决大语言模型编辑中的能力保持问题	large language model
8	LLM-as-Judge on a Budget	提出基于多臂赌博机理论的LLM评估方法以优化查询分配	large language model
9	Prescriptive Scaling Reveals the Evolution of Language Model Capabilities	提出Prescriptive Scaling方法，揭示语言模型能力随算力演进规律，并评估其稳定性。	foundation model
10	On Surprising Effectiveness of Masking Updates in Adaptive Optimizers	提出Magma，通过掩码更新优化LLM训练，显著提升性能。	large language model

🔬 支柱二：RL算法与架构 (RL & Architecture) (4 篇)

#	题目	一句话要点	标签	🔗	⭐
11	Latency-aware Human-in-the-Loop Reinforcement Learning for Semantic Communications	提出时延感知的人在环强化学习框架，用于语义通信中的资源调度。	reinforcement learning PPO reward shaping
12	Solving Parameter-Robust Avoid Problems with Unknown Feasibility using Reinforcement Learning	提出可行性引导探索(FGE)算法，解决参数鲁棒避障问题中未知可行域的挑战。	reinforcement learning deep reinforcement learning
13	CDRL: A Reinforcement Learning Framework Inspired by Cerebellar Circuits and Dendritic Computational Strategies	提出CDRL：一种受小脑电路和树突计算策略启发的强化学习框架，提升样本效率和泛化能力。	reinforcement learning representation learning
14	GLM-5: from Vibe Coding to Agentic Engineering	GLM-5：从Vibe Coding到Agentic Engineering的范式转变，提升软件工程能力	reinforcement learning foundation model	✅

🔬 支柱八：物理动画 (Physics-based Animation) (2 篇)

#	题目	一句话要点	标签	🔗	⭐
15	Guided Diffusion by Optimized Loss Functions on Relaxed Parameters for Inverse Material Design	提出基于优化损失函数和松弛参数的引导扩散方法，用于逆向材料设计。	differentiable simulation
16	Neural-POD: A Plug-and-Play Neural Operator Framework for Infinite-Dimensional Functional Nonlinear Proper Orthogonal Decomposition	提出Neural-POD，一种即插即用的神经算子框架，用于无限维函数非线性本征正交分解。	spatiotemporal

🔬 支柱一：机器人控制 (Robot Control) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
17	The Information Geometry of Softmax: Probing and Steering	利用信息几何探究Softmax表征，提出双重引导方法实现概念操控	manipulation

⬅️ 返回 cs.LG 首页 · 🏠 返回主页