cs.AI(2026-02-08)
📊 共 23 篇论文 | 🔗 2 篇有代码
🎯 兴趣领域导航
支柱九:具身大模型 (Embodied Foundation Models) (16 🔗2)
支柱二:RL算法与架构 (RL & Architecture) (6)
支柱三:空间感知与语义 (Perception & Semantics) (1)
🔬 支柱九:具身大模型 (Embodied Foundation Models) (16 篇)
🔬 支柱二:RL算法与架构 (RL & Architecture) (6 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 17 | Graph-Enhanced Deep Reinforcement Learning for Multi-Objective Unrelated Parallel Machine Scheduling | 提出基于图增强深度强化学习的多目标不相关并行机调度方法 | reinforcement learning deep reinforcement learning PPO | ||
| 18 | Time Series Reasoning via Process-Verifiable Thinking Data Synthesis and Scheduling for Tailored LLM Reasoning | VeriTime:通过可验证过程的思维数据合成与调度,为时序推理定制LLM | reinforcement learning large language model multimodal | ||
| 19 | Generative Reasoning Re-ranker | 提出生成式推理重排序器(GR2),利用强化学习提升LLM在推荐系统中的重排序性能。 | reinforcement learning reward design large language model | ||
| 20 | Objective Decoupling in Social Reinforcement Learning: Recovering Ground Truth from Sycophantic Majorities | 提出Epistemic Source Alignment解决社交强化学习中因谄媚导致的客观目标解耦问题 | reinforcement learning | ||
| 21 | ToolSelf: Unifying Task Execution and Self-Reconfiguration via Tool-Driven Intrinsic Adaptation | ToolSelf:通过工具驱动的内在适应统一任务执行和自我重配置 | reinforcement learning large language model | ||
| 22 | SAGE: Scalable AI Governance & Evaluation | SAGE:可扩展的AI治理与评估框架,提升大规模搜索系统相关性。 | teacher-student distillation |
🔬 支柱三:空间感知与语义 (Perception & Semantics) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 23 | Transforming Science Learning Materials in the Era of Artificial Intelligence | AI赋能科学教育:革新学习材料设计,实现个性化、真实化与可访问性 | affordance multimodal |