cs.CL(2025-04-18)
📊 共 16 篇论文 | 🔗 3 篇有代码
🎯 兴趣领域导航
支柱九:具身大模型 (Embodied Foundation Models) (10 🔗2)
支柱二:RL算法与架构 (RL & Architecture) (5 🔗1)
支柱一:机器人控制 (Robot Control) (1)
🔬 支柱九:具身大模型 (Embodied Foundation Models) (10 篇)
🔬 支柱二:RL算法与架构 (RL & Architecture) (5 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 11 | Feature Alignment and Representation Transfer in Knowledge Distillation for Large Language Models | 针对大语言模型,提出特征对齐与表征迁移的知识蒸馏方法 | distillation large language model | ||
| 12 | Prejudge-Before-Think: Enhancing Large Language Models at Test-Time by Process Prejudge Reasoning | 提出过程预判推理,提升大语言模型在测试时的复杂推理能力 | reinforcement learning large language model | ✅ | |
| 13 | From Large to Super-Tiny: End-to-End Optimization for Cost-Efficient LLMs | 提出三阶段端到端优化方法,实现高性价比超小型LLM部署 | reinforcement learning distillation large language model | ||
| 14 | Simulating Before Planning: Constructing Intrinsic User World Model for User-Tailored Dialogue Policy Planning | 提出UDP框架,通过构建用户世界模型实现用户定制的对话策略规划 | world model | ||
| 15 | Improving Generalization in Intent Detection: GRPO with Reward-Based Curriculum Sampling | 提出基于奖励的课程采样的GRPO方法,提升意图检测的泛化性能 | reinforcement learning chain-of-thought |
🔬 支柱一:机器人控制 (Robot Control) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 16 | Thought Manipulation: External Thought Can Be Efficient for Large Reasoning Models | 提出Thought Manipulation方法,通过外部CoT引导,提升大模型推理效率并降低计算成本。 | manipulation |