cs.LG(2025-05-18)
📊 共 2 篇论文 | 🔗 1 篇有代码
🎯 兴趣领域导航
🔬 支柱二:RL算法与架构 (RL & Architecture) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 1 | Observe-R1: Unlocking Reasoning Abilities of MLLMs with Dynamic Progressive Reinforcement Learning | Observe-R1:通过动态渐进强化学习提升多模态大语言模型的推理能力 | reinforcement learning large language model multimodal | ✅ |
🔬 支柱九:具身大模型 (Embodied Foundation Models) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 2 | STAR: Stage-Wise Attention-Guided Token Reduction for Efficient Large Vision-Language Models Inference | 提出STAR:一种阶段式注意力引导的Token缩减方法,用于高效的大型视觉-语言模型推理。 | multimodal |