cs.CL(2025-05-10)
📊 共 12 篇论文 | 🔗 4 篇有代码
🎯 兴趣领域导航
支柱九:具身大模型 (Embodied Foundation Models) (8 🔗3)
支柱二:RL算法与架构 (RL & Architecture) (3 🔗1)
支柱六:视频提取与匹配 (Video Extraction) (1)
🔬 支柱九:具身大模型 (Embodied Foundation Models) (8 篇)
🔬 支柱二:RL算法与架构 (RL & Architecture) (3 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 9 | Gated Attention for Large Language Models: Non-linearity, Sparsity, and Attention-Sink-Free | 提出门控注意力机制,提升大语言模型非线性、稀疏性和长文本外推能力。 | state space model linear attention large language model | ✅ | |
| 10 | REFINE-AF: A Task-Agnostic Framework to Align Language Models via Self-Generated Instructions using Reinforcement Learning from Automated Feedback | 提出REFINE-AF框架,通过自生成指令和强化学习对小型语言模型进行任务无关对齐。 | reinforcement learning large language model | ||
| 11 | xGen-small Technical Report | xGen-small:面向长文本应用的4B/9B Transformer解码器模型 | reinforcement learning preference learning |
🔬 支柱六:视频提取与匹配 (Video Extraction) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 12 | Boosting Neural Language Inference via Cascaded Interactive Reasoning | 提出级联交互推理网络CIRN,通过多层级交互提升自然语言推理性能。 | feature matching |