cs.CV(2026-06-03)
📊 共 4 篇论文 | 🔗 1 篇有代码
🎯 兴趣领域导航
支柱二:RL算法与架构 (RL & Architecture) (2)
支柱九:具身大模型 (Embodied Foundation Models) (1 🔗1)
支柱四:生成式动作 (Generative Motion) (1)
🔬 支柱二:RL算法与架构 (RL & Architecture) (2 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 1 | Continual Visual and Verbal Learning Through a Child's Egocentric Input | 提出BabyCL框架以解决儿童语言学习中的数据处理问题 | representation learning egocentric multimodal | ||
| 2 | MusaCoder: Native GPU Kernel Generation with Full-Stack Training on Moore Threads GPU | 提出MusaCoder以解决GPU内核生成效率低下问题 | reinforcement learning large language model |
🔬 支柱九:具身大模型 (Embodied Foundation Models) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 3 | BreastGPT: A Multimodal Large Language Model for the Full Spectrum of Breast Cancer Clinical Routine | 提出BreastGPT以解决乳腺癌临床管理中的多模态推理问题 | large language model multimodal instruction following | ✅ |
🔬 支柱四:生成式动作 (Generative Motion) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 4 | NextMotionQA: Benchmarking and Judging Human Motion Understanding with Vision-Language Models | 提出NextMotionQA以解决人类动作理解评估问题 | text-to-motion human motion embodied AI |