cs.AI(2025-12-17)
📊 共 16 篇论文 | 🔗 1 篇有代码
🎯 兴趣领域导航
支柱九:具身大模型 (Embodied Foundation Models) (10 🔗1)
支柱二:RL算法与架构 (RL & Architecture) (5)
支柱一:机器人控制 (Robot Control) (1)
🔬 支柱九:具身大模型 (Embodied Foundation Models) (10 篇)
🔬 支柱二:RL算法与架构 (RL & Architecture) (5 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 11 | Beyond Fast and Slow: Cognitive-Inspired Elastic Reasoning for Large Language Models | 提出CogER框架,通过认知启发的弹性推理提升大语言模型在不同难度问题上的效率与准确性。 | reinforcement learning large language model chain-of-thought | ||
| 12 | LADY: Linear Attention for Autonomous Driving Efficiency without Transformers | 提出LADY:一种基于线性注意力的高效自动驾驶模型,无需Transformer。 | linear attention spatiotemporal | ||
| 13 | Stepwise Think-Critique: A Unified Framework for Robust and Interpretable LLM Reasoning | 提出Stepwise Think-Critique框架,提升LLM推理能力和可解释性 | reinforcement learning large language model | ||
| 14 | Nemotron-Math: Efficient Long-Context Distillation of Mathematical Reasoning from Multi-Mode Supervision | Nemotron-Math:通过多模式监督,高效地进行数学推理长文本蒸馏。 | distillation | ||
| 15 | Graph Contextual Reinforcement Learning for Efficient Directed Controller Synthesis | 提出GCRL,利用图上下文强化学习高效合成有向控制器 | reinforcement learning |
🔬 支柱一:机器人控制 (Robot Control) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 16 | QoS-Aware Hierarchical Reinforcement Learning for Joint Link Selection and Trajectory Optimization in SAGIN-Supported UAV Mobility Management | 提出基于QoS感知的分层强化学习方法,解决SAGIN支持的UAV移动性管理中的联合链路选择和轨迹优化问题。 | trajectory optimization reinforcement learning deep reinforcement learning |