cs.AI(2025-09-10)
📊 共 5 篇论文
🎯 兴趣领域导航
🔬 支柱九:具身大模型 (Embodied Foundation Models) (4 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 1 | PianoVAM: A Multimodal Piano Performance Dataset | PianoVAM:一个包含视频、音频、MIDI、手部关键点等多模态钢琴演奏数据集 | multimodal | ||
| 2 | GAUSS: Benchmarking Structured Mathematical Skills for Large Language Models | GAUSS:构建结构化数学能力基准,评估大语言模型 | large language model | ||
| 3 | Scaling Truth: The Confidence Paradox in AI Fact-Checking | 提出多语言基准以解决AI事实核查中的信心悖论问题 | large language model | ||
| 4 | Architecting Resilient LLM Agents: A Guide to Secure Plan-then-Execute Implementations | 构建弹性LLM Agent:安全Plan-then-Execute实现指南 | large language model |
🔬 支柱一:机器人控制 (Robot Control) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 5 | HumanAgencyBench: Scalable Evaluation of Human Agency Support in AI Assistants | 提出HumanAgencyBench,用于评估AI助手对人类自主性的支持程度 | manipulation RLHF large language model |