cs.CL(2026-03-23)
📊 共 10 篇论文 | 🔗 2 篇有代码
🎯 兴趣领域导航
🔬 支柱九:具身大模型 (Embodied Foundation Models) (6 篇)
🔬 支柱二:RL算法与架构 (RL & Architecture) (4 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 7 | Dual-Space Knowledge Distillation with Key-Query Matching for Large Language Models with Vocabulary Mismatch | 提出DSKD-CMA-GA,通过生成对抗学习解决LLM蒸馏中词表不匹配问题。 | distillation large language model | ||
| 8 | DRTriton: Large-Scale Synthetic Data Reinforcement Learning for Triton Kernel Generation | DRTriton:利用大规模合成数据强化学习生成Triton内核,显著提升CUDA内核效率。 | reinforcement learning large language model | ||
| 9 | TAMTRL: Teacher-Aligned Reward Reshaping for Multi-Turn Reinforcement Learning in Long-Context Compression | 提出TAMTRL,通过教师对齐奖励重塑解决长文本压缩中的多轮强化学习问题。 | reinforcement learning large language model | ||
| 10 | Gumbel Distillation for Parallel Text Generation | 提出Gumbel蒸馏,提升并行文本生成模型质量,缩小与自回归模型的差距 | distillation | ✅ |