| 1 |
Advancing Multimodal Agent Reasoning with Long-Term Neuro-Symbolic Memory |
提出NS-Mem神经符号记忆框架,提升多模态Agent在复杂环境下的推理能力。 |
large language model multimodal |
|
|
| 2 |
VTC-Bench: Evaluating Agentic Multimodal Models via Compositional Visual Tool Chaining |
VTC-Bench:通过组合式视觉工具链评估Agentic多模态模型 |
large language model multimodal |
|
|
| 3 |
BrainBench: Exposing the Commonsense Reasoning Gap in Large Language Models |
BrainBench:揭示大型语言模型中常识推理的差距 |
large language model |
|
|
| 4 |
Brain-Inspired Graph Multi-Agent Systems for LLM Reasoning |
提出脑启发图多智能体系统BIGMAS,提升LLM复杂推理能力 |
large language model chain-of-thought |
|
|
| 5 |
OpenSeeker: Democratizing Frontier Search Agents by Fully Open-Sourcing Training Data |
OpenSeeker:通过完全开源训练数据,实现前沿搜索Agent的普及化。 |
large language model |
|
|
| 6 |
InterveneBench: Benchmarking LLMs for Intervention Reasoning and Causal Study Design in Real Social Systems |
InterveneBench:评估LLM在真实社会系统中干预推理和因果研究设计能力 |
large language model |
✅ |
|
| 7 |
Unlocking the Value of Text: Event-Driven Reasoning and Multi-Level Alignment for Time Series Forecasting |
提出VoT,利用事件驱动推理和多层次对齐,提升文本增强时间序列预测性能。 |
multimodal |
✅ |
|
| 8 |
SKILLS: Structured Knowledge Injection for LLM-Driven Telecommunications Operations |
SKILLS:通过结构化知识注入提升LLM在电信运营中的可靠性 |
large language model |
|
|
| 9 |
PMAx: An Agentic Framework for AI-Driven Process Mining |
PMAx:一个用于AI驱动的过程挖掘的Agentic框架,解决LLM直接应用于过程挖掘的局限性。 |
large language model |
|
|
| 10 |
Why the Valuable Capabilities of LLMs Are Precisely the Unexplainable Ones |
论证大语言模型最有价值的能力恰恰是那些无法解释的部分 |
large language model |
|
|
| 11 |
To See is Not to Master: Teaching LLMs to Use Private Libraries for Code Generation |
PriCoder:通过数据合成提升LLM在私有库API代码生成中的能力 |
large language model |
✅ |
|
| 12 |
Why Agents Compromise Safety Under Pressure |
揭示Agentic Pressure:压力下大语言模型Agent的安全妥协现象 |
large language model |
|
|
| 13 |
$p^2$RAG: Privacy-Preserving RAG Service Supporting Arbitrary Top-$k$ Retrieval |
提出$p^2$RAG,一种支持任意Top-$k$检索的隐私保护RAG服务。 |
large language model |
|
|
| 14 |
Beyond Local Code Optimization: Multi-Agent Reasoning for Software System Optimization |
提出基于多智能体推理的软件系统优化框架,提升微服务性能。 |
large language model |
|
|