cs.AI(2024-11-18)
📊 共 14 篇论文 | 🔗 2 篇有代码
🎯 兴趣领域导航
🔬 支柱九:具身大模型 (Embodied Foundation Models) (10 篇)
🔬 支柱二:RL算法与架构 (RL & Architecture) (4 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 11 | PSPO*: An Effective Process-supervised Policy Optimization for Reasoning Alignment | 提出PSPO*框架,通过非线性奖励塑造提升LLM推理对齐效果 | reward shaping large language model chain-of-thought | ||
| 12 | Syllabus: Portable Curricula for Reinforcement Learning Agents | Syllabus:用于强化学习智能体的通用课程学习库 | reinforcement learning curriculum learning | ✅ | |
| 13 | Regret-Free Reinforcement Learning for LTL Specifications | 针对未知动态系统的LTL规范,提出无悔强化学习算法 | reinforcement learning | ||
| 14 | Hybrid Data-Driven SSM for Interpretable and Label-Free mmWave Channel Prediction | 提出混合数据驱动SSM,用于可解释的无标签毫米波信道预测 | SSM |