cs.AI(2025-06-14)

📊 共 5 篇论文

🎯 兴趣领域导航

支柱九:具身大模型 (Embodied Foundation Models) (5)

🔬 支柱九:具身大模型 (Embodied Foundation Models) (5 篇)

#题目一句话要点标签🔗
1 CMI-Bench: A Comprehensive Benchmark for Evaluating Music Instruction Following 提出CMI-Bench以解决音乐指令跟随评估问题 large language model instruction following
2 The Foundation Cracks: A Comprehensive Study on Bugs and Testing Practices in LLM Libraries 提出全面研究以解决LLM库中的缺陷与测试实践问题 large language model
3 The Budget AI Researcher and the Power of RAG Chains 提出预算AI研究者以解决科研创意生成难题 large language model
4 QGuard:Question-based Zero-shot Guard for Multi-modal LLM Safety 提出QGuard以解决多模态LLM安全问题 large language model
5 The SWE-Bench Illusion: When State-of-the-Art LLMs Remember Instead of Reason 提出新诊断任务以揭示LLMs在编码能力评估中的记忆偏差 large language model

⬅️ 返回 cs.AI 首页 · 🏠 返回主页