cs.LG(2025-08-12)

📊 共 25 篇论文 | 🔗 1 篇有代码

🎯 兴趣领域导航

支柱二:RL算法与架构 (RL & Architecture) (13 🔗1) 支柱九:具身大模型 (Embodied Foundation Models) (10) 支柱八:物理动画 (Physics-based Animation) (2)

🔬 支柱二:RL算法与架构 (RL & Architecture) (13 篇)

#题目一句话要点标签🔗
1 $\text{M}^{2}$LLM: Multi-view Molecular Representation Learning with Large Language Models 提出M²LLM以解决分子属性预测的多视角问题 representation learning large language model
2 Bridging Formal Language with Chain-of-Thought Reasoning to Geometry Problem Solving 提出GF-Reasoner以解决几何问题求解中的推理不足 reinforcement learning chain-of-thought
3 Scaling Up Active Testing to Large Language Models 提出高效的主动测试方法以评估大型语言模型 predictive model large language model
4 Generative Modeling for Robust Deep Reinforcement Learning on the Traveling Salesman Problem 提出COGS以解决旅行商问题的分布鲁棒性挑战 reinforcement learning deep reinforcement learning
5 Distilling Reinforcement Learning into Single-Batch Datasets 提出强化学习蒸馏方法以生成单批次数据集 reinforcement learning distillation
6 Interpretable Reward Model via Sparse Autoencoder 提出稀疏自编码器增强的奖励模型以解决传统模型可解释性不足问题 reinforcement learning RLHF large language model
7 Multi-level Collaborative Distillation Meets Global Workspace Model: A Unified Framework for OCIL 提出多层协作蒸馏以解决在线增量学习中的稳定性与适应性问题 distillation
8 A Personalized Exercise Assistant using Reinforcement Learning (PEARL): Results from a four-arm Randomized-controlled Trial 提出个性化运动助手PEARL以解决身体活动不足问题 reinforcement learning
9 Pattern-based Knowledge Component Extraction from Student Code Using Representation Learning 提出基于模式的知识组件提取框架以解决编程教育中的自动化问题 representation learning
10 Constrained Black-Box Attacks Against Multi-Agent Reinforcement Learning 提出约束黑箱攻击方法以解决多智能体强化学习的脆弱性问题 reinforcement learning
11 PersRM-R1: Enhance Personalized Reward Modeling with Reinforcement Learning 提出PersRM-R1以解决个性化奖励建模中的数据稀缺问题 reinforcement learning
12 GRAVITY: A Controversial Graph Representation Learning for Vertex Classification 提出GRAVITY以解决图节点分类中的动态聚合问题 representation learning
13 MCLPD:Multi-view Contrastive Learning for EEG-based PD Detection Across Datasets 提出MCLPD以解决跨数据集的帕金森病检测问题 contrastive learning

🔬 支柱九:具身大模型 (Embodied Foundation Models) (10 篇)

#题目一句话要点标签🔗
14 KnowDR-REC: A Benchmark for Referring Expression Comprehension with Real-World Knowledge 提出KnowDR-REC以解决多模态推理能力不足问题 large language model multimodal visual grounding
15 A Generative Imputation Method for Multimodal Alzheimer's Disease Diagnosis 提出生成填补方法以解决阿尔茨海默病多模态数据缺失问题 multimodal
16 Oblivionis: A Lightweight Learning and Unlearning Framework for Federated Large Language Models 提出Oblivionis框架以解决联邦大语言模型的遗忘问题 large language model
17 Resurrecting the Salmon: Rethinking Mechanistic Interpretability with Domain-Specific Sparse Autoencoders 提出领域特定稀疏自编码器以提升语言模型的可解释性 large language model foundation model
18 Teaching Code Refactoring Using LLMs 利用大型语言模型提升代码重构教学效果 large language model
19 xRFM: Accurate, scalable, and interpretable feature learning models for tabular data 提出xRFM以解决表格数据特征学习问题 foundation model
20 LLM Empowered Prototype Learning for Zero and Few-Shot Tasks on Tabular Data 提出基于LLM的原型学习框架以解决表格数据的零样本和少样本问题 large language model
21 Differentiated Information Mining: A Semi-supervised Learning Framework for GNNs 提出差异化因子一致性半监督框架以解决GNN伪标签偏差问题 multimodal
22 MiGrATe: Mixed-Policy GRPO for Adaptation at Test-Time 提出MiGrATe以解决黑箱优化任务中的适应性问题 large language model
23 Classifier Language Models: Unifying Sparse Finetuning and Adaptive Tokenization for Specialized Classification Tasks 提出稀疏微调与自适应标记化结合的方法以解决专业分类任务问题 large language model

🔬 支柱八:物理动画 (Physics-based Animation) (2 篇)

#题目一句话要点标签🔗
24 GSMT: Graph Fusion and Spatiotemporal TaskCorrection for Multi-Bus Trajectory Prediction 提出GSMT以解决城市公交轨迹预测问题 spatiotemporal multimodal
25 UQGNN: Uncertainty Quantification of Graph Neural Networks for Multivariate Spatiotemporal Prediction 提出UQGNN以解决多变量时空预测中的不确定性量化问题 spatiotemporal

⬅️ 返回 cs.LG 首页 · 🏠 返回主页