cs.LG(2025-09-10)
📊 共 4 篇论文
🎯 兴趣领域导航
支柱二:RL算法与架构 (RL & Architecture) (2)
支柱九:具身大模型 (Embodied Foundation Models) (1)
支柱八:物理动画 (Physics-based Animation) (1)
🔬 支柱二:RL算法与架构 (RL & Architecture) (2 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 1 | Merge-of-Thought Distillation | 提出Merge-of-Thought Distillation,解决长链思维模型蒸馏中多教师冲突问题。 | distillation chain-of-thought | ||
| 2 | AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning | AgentGym-RL:通过多轮强化学习训练LLM智能体,解决长程决策问题 | reinforcement learning |
🔬 支柱九:具身大模型 (Embodied Foundation Models) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 3 | Adapting Vision-Language Models for Neutrino Event Classification in High-Energy Physics | 利用视觉-语言模型进行高能物理中微子事件分类 | large language model multimodal |
🔬 支柱八:物理动画 (Physics-based Animation) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 4 | Energy-convergence trade off for the training of neural networks on bio-inspired hardware | 针对神经形态硬件,提出能量-收敛权衡方法,优化神经网络训练。 | PULSE |