cs.AI(2025-12-07)
📊 共 24 篇论文 | 🔗 4 篇有代码
🎯 兴趣领域导航
支柱九:具身大模型 (Embodied Foundation Models) (17 🔗3)
支柱二:RL算法与架构 (RL & Architecture) (6 🔗1)
支柱一:机器人控制 (Robot Control) (1)
🔬 支柱九:具身大模型 (Embodied Foundation Models) (17 篇)
🔬 支柱二:RL算法与架构 (RL & Architecture) (6 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 18 | JT-DA: Enhancing Data Analysis with Tool-Integrated Table Reasoning Large Language Models | JT-DA:通过工具集成表格推理大语言模型增强数据分析能力 | reinforcement learning large language model foundation model | ||
| 19 | Decouple to Generalize: Context-First Self-Evolving Learning for Data-Scarce Vision-Language Reasoning | 提出DoGe框架,通过解耦学习上下文和问题求解,提升数据稀缺场景下VLM的泛化能力 | reinforcement learning curriculum learning multimodal | ||
| 20 | On Memory: A comparison of memory mechanisms in world models | 研究Transformer世界模型中的记忆机制,提升长时规划能力 | world model | ||
| 21 | Predictive Modeling of I/O Performance for Machine Learning Training Pipelines: A Data-Driven Approach to Storage Optimization | 提出基于机器学习的I/O性能预测模型,优化机器学习训练pipeline的存储配置。 | predictive model | ✅ | |
| 22 | Towards Small Language Models for Security Query Generation in SOC Workflows | 提出面向安全运营中心工作流的小型语言模型KQL查询生成方法,降低查询成本。 | distillation chain-of-thought | ||
| 23 | LightSearcher: Efficient DeepSearch via Experiential Memory | LightSearcher:通过经验记忆实现高效的深度搜索 | reinforcement learning reward shaping |
🔬 支柱一:机器人控制 (Robot Control) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 24 | PrivLLMSwarm: Privacy-Preserving LLM-Driven UAV Swarms for Secure IoT Surveillance | PrivLLMSwarm:面向安全物联网监控的隐私保护LLM驱动无人机群 | MPC reinforcement learning large language model |