cs.AI(2026-04-22)
📊 共 17 篇论文 | 🔗 3 篇有代码
🎯 兴趣领域导航
支柱九:具身大模型 (Embodied Foundation Models) (12 🔗3)
支柱二:RL算法与架构 (RL & Architecture) (4)
支柱八:物理动画 (Physics-based Animation) (1)
🔬 支柱九:具身大模型 (Embodied Foundation Models) (12 篇)
🔬 支柱二:RL算法与架构 (RL & Architecture) (4 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 13 | V-tableR1: Process-Supervised Multimodal Table Reasoning with Critic-Guided Policy Optimization | V-tableR1:提出流程监督的多模态表格推理框架,通过评论家引导的策略优化实现可验证的推理。 | reinforcement learning large language model multimodal | ||
| 14 | Self-Guided Plan Extraction for Instruction-Following Tasks with Goal-Conditional Reinforcement Learning | SuperIgor:基于自学习计划提取的指令跟随任务框架 | reinforcement learning instruction following | ||
| 15 | HiPO: Hierarchical Preference Optimization for Adaptive Reasoning in LLMs | HiPO:分层偏好优化提升LLM在复杂推理任务中的自适应推理能力 | preference learning DPO direct preference optimization | ||
| 16 | Vibrotactile Preference Learning: Uncertainty-Aware Preference Learning for Personalized Vibration Feedback | 提出VPL:基于高斯过程不确定性感知偏好学习的个性化振动反馈系统 | preference learning |
🔬 支柱八:物理动画 (Physics-based Animation) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 17 | AI models of unstable flow exhibit hallucination | 提出DeepFingers框架,解决AI流体模型中不稳定性流动模拟的幻觉问题 | spatiotemporal large language model |