| 1 |
Placenta Accreta Spectrum Detection using Multimodal Deep Learning |
提出基于多模态深度学习的胎盘植入谱检测方法,提升诊断精度。 |
multimodal |
|
|
| 2 |
Islamic Chatbots in the Age of Large Language Models |
分析LLM驱动的伊斯兰聊天机器人对宗教实践的影响与挑战 |
large language model |
|
|
| 3 |
Developmental trajectories of decision making and affective dynamics in large language models |
通过赌博任务和情感评估,揭示大型语言模型决策和情感发展轨迹 |
large language model |
|
|
| 4 |
SynRAG: A Large Language Model Framework for Executable Query Generation in Heterogeneous SIEM System |
SynRAG:用于异构SIEM系统中可执行查询生成的大语言模型框架 |
large language model |
|
|
| 5 |
RAIR: A Rule-Aware Benchmark Uniting Challenging Long-Tail and Visual Salience Subset for E-commerce Relevance Assessment |
提出RAIR:一个面向电商相关性评估的规则感知、长尾和视觉显著性基准 |
large language model multimodal |
|
|
| 6 |
GenZ: Foundational models as latent variable generators within traditional statistical models |
GenZ:利用统计模型中的潜在变量生成器作为基础模型,弥合领域知识与数据集特定模式。 |
large language model multimodal |
|
|
| 7 |
LeanCat: A Benchmark Suite for Formal Category Theory in Lean (Part I: 1-Categories) |
提出LeanCat基准测试集,用于评估LLM在范畴论形式化证明中的能力。 |
large language model |
|
|
| 8 |
Constructing a Neuro-Symbolic Mathematician from First Principles |
提出Mathesis神经符号架构,解决大语言模型在复杂推理中缺乏公理框架的问题。 |
large language model |
|
|
| 9 |
Ask, Clarify, Optimize: Human-LLM Agent Collaboration for Smarter Inventory Control |
提出人机协同框架,利用LLM优化库存控制,降低企业成本。 |
large language model |
|
|
| 10 |
Mortar: Evolving Mechanics for Automatic Game Design |
Mortar:一种用于自动游戏设计的演化机制 |
large language model |
|
|
| 11 |
The Agentic Leash: Extracting Causal Feedback Fuzzy Cognitive Maps with LLMs |
提出Agentic Leash框架,利用LLM提取因果反馈模糊认知地图 |
large language model |
|
|
| 12 |
Vulcan: Instance-Optimal Systems Heuristics Through LLM-Driven Search |
Vulcan:通过LLM驱动搜索合成实例最优的系统启发式算法 |
large language model |
|
|
| 13 |
Context-aware LLM-based AI Agents for Human-centered Energy Management Systems in Smart Buildings |
提出基于LLM的智能建筑能源管理系统,实现情境感知和自然语言交互 |
large language model |
|
|
| 14 |
AMAP Agentic Planning Technical Report |
提出STAgent,一个用于时空理解的Agentic大语言模型,解决复杂POI发现和行程规划任务。 |
large language model |
|
|
| 15 |
Semi-Automated Data Annotation in Multisensor Datasets for Autonomous Vehicle Testing |
提出一种半自动标注流水线,加速自动驾驶多传感器数据标注 |
multimodal |
|
|
| 16 |
Enhancing Retrieval-Augmented Generation with Topic-Enriched Embeddings: A Hybrid Approach Integrating Traditional NLP Techniques |
提出主题增强嵌入,融合传统NLP技术,提升检索增强生成效果 |
large language model |
|
|
| 17 |
DynaFix: Iterative Automated Program Repair Driven by Execution-Level Dynamic Information |
DynaFix:一种执行级动态信息驱动的迭代式自动程序修复方法 |
large language model |
|
|
| 18 |
Chat-Driven Optimal Management for Virtual Network Services |
提出聊天驱动的网络管理框架,实现虚拟网络服务的优化管理 |
large language model |
|
|
| 19 |
Group Deliberation Oriented Multi-Agent Conversational Model for Complex Reasoning |
提出面向群体审议的多智能体对话模型,用于复杂推理任务 |
large language model |
|
|
| 20 |
Recursive Language Models |
提出递归语言模型(RLM),通过推理时扩展处理超长上下文,显著提升长文本任务性能。 |
large language model |
✅ |
|
| 21 |
MCPAgentBench: A Real-world Task Benchmark for Evaluating LLM Agent MCP Tool Use |
提出MCPAgentBench,用于评估LLM Agent在真实MCP工具使用中的能力。 |
large language model |
|
|
| 22 |
Localized Calibrated Uncertainty in Code Language Models |
提出局部校准不确定性方法,定位代码语言模型生成中的错误 |
large language model |
|
|