cs.AI(2024-12-18)
📊 共 15 篇论文
🎯 兴趣领域导航
支柱九:具身大模型 (Embodied Foundation Models) (10)
支柱二:RL算法与架构 (RL & Architecture) (4)
支柱一:机器人控制 (Robot Control) (1)
🔬 支柱九:具身大模型 (Embodied Foundation Models) (10 篇)
🔬 支柱二:RL算法与架构 (RL & Architecture) (4 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 11 | Alignment faking in large language models | 揭示大型语言模型中的对齐伪装现象,模型策略性地遵守训练目标以避免行为被修改。 | reinforcement learning large language model | ||
| 12 | Scaling of Search and Learning: A Roadmap to Reproduce o1 from Reinforcement Learning Perspective | 从强化学习视角解析OpenAI o1的复现路线图,聚焦策略、奖励、搜索与学习。 | reinforcement learning distillation reward design | ||
| 13 | Multi-task Representation Learning for Mixed Integer Linear Programming | 提出用于混合整数线性规划的多任务表征学习框架,提升求解效率。 | representation learning | ||
| 14 | ARTEMIS-DA: An Advanced Reasoning and Transformation Engine for Multi-Step Insight Synthesis in Data Analytics | ARTEMIS-DA:用于数据分析中多步洞察合成的先进推理与转换引擎 | predictive model large language model |
🔬 支柱一:机器人控制 (Robot Control) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 15 | Quantified Linear and Polynomial Arithmetic Satisfiability via Template-based Skolemization | 提出基于模板的Skolem化方法,高效解决量化线性/多项式算术公式的可满足性问题 | manipulation |