cs.LG(2025-01-01)

📊 共 2 篇论文

🎯 兴趣领域导航

支柱二:RL算法与架构 (RL & Architecture) (1) 支柱九:具身大模型 (Embodied Foundation Models) (1)

🔬 支柱二:RL算法与架构 (RL & Architecture) (1 篇)

#题目一句话要点标签🔗
1 Adjoint sharding for very long context training of state space models 提出 adjoint sharding 方法,解决超长上下文状态空间模型训练中的内存瓶颈问题。 state space model large language model

🔬 支柱九:具身大模型 (Embodied Foundation Models) (1 篇)

#题目一句话要点标签🔗
2 IGC: Integrating a Gated Calculator into an LLM to Solve Arithmetic Tasks Reliably and Efficiently 提出IGC:将门控计算器集成到LLM中,以可靠高效地解决算术任务 large language model

⬅️ 返回 cs.LG 首页 · 🏠 返回主页