cs.LG(2025-04-29)
📊 共 17 篇论文 | 🔗 1 篇有代码
🎯 兴趣领域导航
支柱九:具身大模型 (Embodied Foundation Models) (9)
支柱二:RL算法与架构 (RL & Architecture) (7 🔗1)
支柱八:物理动画 (Physics-based Animation) (1)
🔬 支柱九:具身大模型 (Embodied Foundation Models) (9 篇)
🔬 支柱二:RL算法与架构 (RL & Architecture) (7 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 10 | Reinforcement Learning for Reasoning in Large Language Models with One Training Example | 提出单样本强化学习与可验证奖励(1-shot RLVR),提升大语言模型数学推理能力。 | reinforcement learning PPO large language model | ✅ | |
| 11 | Toward Efficient Exploration by Large Language Model Agents | 提出基于LLM的后验采样强化学习方法,提升自然语言任务中的探索效率 | reinforcement learning large language model | ||
| 12 | Token-Efficient RL for LLM Reasoning | 提出Token高效强化学习方法,解决LLM推理中内存和计算资源限制问题 | reinforcement learning large language model | ||
| 13 | Q-Fusion: Diffusing Quantum Circuits | Q-Fusion:提出基于扩散模型的量子电路生成方法,解决量子架构搜索难题。 | reinforcement learning large language model | ||
| 14 | Quantum-Enhanced Hybrid Reinforcement Learning Framework for Dynamic Path Planning in Autonomous Systems | 提出量子增强混合强化学习框架,用于自主系统动态路径规划 | reinforcement learning | ||
| 15 | Representation Learning Preserving Ignorability and Covariate Matching for Treatment Effects | 提出一种新的表征学习方法,同时解决因果效应估计中的混淆偏差和协变量失配问题。 | representation learning | ||
| 16 | Group Relative Knowledge Distillation: Learning from Teacher's Relational Inductive Bias | 提出组相对知识蒸馏(GRKD),利用教师模型的相对关系归纳偏置提升学生模型泛化能力。 | distillation |
🔬 支柱八:物理动画 (Physics-based Animation) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 17 | Efficient LLMs with AMP: Attention Heads and MLP Pruning | 提出AMP:一种高效的LLM剪枝方法,用于加速推理并降低资源消耗 | AMP large language model |