cs.AI(2024-07-18)
📊 共 16 篇论文 | 🔗 2 篇有代码
🎯 兴趣领域导航
支柱二:RL算法与架构 (RL & Architecture) (8 🔗1)
支柱九:具身大模型 (Embodied Foundation Models) (7 🔗1)
支柱六:视频提取与匹配 (Video Extraction) (1)
🔬 支柱二:RL算法与架构 (RL & Architecture) (8 篇)
🔬 支柱九:具身大模型 (Embodied Foundation Models) (7 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 9 | Training Foundation Models as Data Compression: On Information, Model Weights and Copyright Law | 将大模型训练视为数据压缩,探讨信息论、模型权重与版权法问题 | foundation model | ||
| 10 | Handling Numeric Expressions in Automatic Speech Recognition | 提出一种结合数据生成策略的端到端方法,用于自动语音识别中数值表达式的正确格式化。 | large language model TAMP | ||
| 11 | Generative AI Augmented Induction-based Formal Verification | 利用生成式AI增强基于归纳的硬件形式化验证,提升验证效率 | large language model | ||
| 12 | CellularLint: A Systematic Approach to Identify Inconsistent Behavior in Cellular Network Specifications | CellularLint:利用自然语言处理技术系统性识别蜂窝网络规范中的不一致性 | large language model | ||
| 13 | CoDefeater: Using LLMs To Find Defeaters in Assurance Cases | CoDefeater:利用大型语言模型自动发现保障案例中的反驳论证 | large language model | ||
| 14 | DISCOVER: A Data-driven Interactive System for Comprehensive Observation, Visualization, and ExploRation of Human Behaviour | DISCOVER:一个数据驱动的交互式系统,用于全面观察、可视化和探索人类行为 | multimodal | ||
| 15 | MMAU: A Holistic Benchmark of Agent Capabilities Across Diverse Domains | 提出MMAU:一个综合性的多领域Agent能力评估基准 | large language model | ✅ |
🔬 支柱六:视频提取与匹配 (Video Extraction) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 16 | Visuospatial navigation from the bottom-up: without vestibular integration, distance prediction, or maps | 提出一种无需前庭整合、距离预测或地图构建的自下而上视觉空间导航方法 | egocentric |