| 1 |
Holmes: Automated Fact Check with Large Language Models |
提出Holmes框架以解决多模态虚假信息检测问题 |
large language model multimodal |
|
|
| 2 |
Can Large Language Models Predict Parallel Code Performance? |
利用大型语言模型预测并行代码性能以解决GPU性能评估问题 |
large language model |
|
|
| 3 |
LogiDebrief: A Signal-Temporal Logic based Automated Debriefing Approach with Large Language Models Integration |
提出LogiDebrief以解决911呼叫评估效率低下问题 |
large language model |
|
|
| 4 |
OSUniverse: Benchmark for Multimodal GUI-navigation AI Agents |
提出OSUniverse基准以评估多模态GUI导航AI代理的能力 |
multimodal |
✅ |
|
| 5 |
Validating the Effectiveness of a Large Language Model-based Approach for Identifying Children's Development across Various Free Play Settings in Kindergarten |
提出基于大型语言模型的方法以评估幼儿园儿童发展 |
large language model |
|
|
| 6 |
Synthline: A Product Line Approach for Synthetic Requirements Engineering Data Generation using Large Language Models |
提出Synthline以解决需求工程数据稀缺问题 |
large language model |
|
|
| 7 |
LlamaFirewall: An open source guardrail system for building secure AI agents |
提出LlamaFirewall以解决AI代理安全风险问题 |
large language model chain-of-thought |
|
|
| 8 |
AI-Driven Scholarly Peer Review via Persistent Workflow Prompting, Meta-Prompting, and Meta-Reasoning |
提出持久工作流提示以解决科学论文同行评审问题 |
large language model multimodal |
|
|
| 9 |
Capability-Driven Skill Generation with LLMs: A RAG-Based Approach for Reusing Existing Libraries and Interfaces |
提出基于RAG的能力驱动技能生成方法以提升自动化系统开发效率 |
large language model |
|
|
| 10 |
Prism: Unleashing GPU Sharing for Cost-Efficient Multi-LLM Serving |
提出Prism以解决多LLM服务中的GPU共享效率问题 |
large language model |
|
|
| 11 |
Binding threshold units with artificial oscillatory neurons |
提出人工振荡神经元耦合阈值单元以提升神经编码能力 |
large language model |
|
|
| 12 |
am-ELO: A Stable Framework for Arena-based LLM Evaluation |
提出am-ELO以解决ELO评分系统不稳定问题 |
large language model |
|
|
| 13 |
Procedural Memory Is Not All You Need: Bridging Cognitive Gaps in LLM-Based Agents |
提出语义记忆与联想学习系统以增强LLM智能 |
large language model |
|
|
| 14 |
Graph Drawing for LLMs: An Empirical Evaluation |
通过图形绘制提升LLM在图相关任务中的表现 |
large language model |
|
|
| 15 |
A Hashgraph-Inspired Consensus Mechanism for Reliable Multi-Model Reasoning |
提出一种Hashgraph启发的共识机制以解决多模型推理中的不一致问题 |
large language model |
|
|
| 16 |
STORY2GAME: Generating (Almost) Everything in an Interactive Fiction Game |
提出STORY2GAME以生成互动小说游戏的所有元素 |
large language model |
|
|
| 17 |
Domain Adversarial Training for Mitigating Gender Bias in Speech-based Mental Health Detection |
提出领域对抗训练以缓解语音心理健康检测中的性别偏见 |
foundation model |
|
|
| 18 |
RAG-MCP: Mitigating Prompt Bloat in LLM Tool Selection via Retrieval-Augmented Generation |
提出RAG-MCP以解决大型语言模型工具选择中的提示膨胀问题 |
large language model |
|
|
| 19 |
Accelerating Evolution: Integrating PSO Principles into Real-Coded Genetic Algorithm Crossover |
提出PSOX交叉操作以加速实数编码遗传算法的收敛 |
multimodal |
|
|
| 20 |
DocSpiral: A Platform for Integrated Assistive Document Annotation through Human-in-the-Spiral |
提出DocSpiral以解决图像文档结构化数据提取问题 |
large language model |
|
|
| 21 |
Patterns and Mechanisms of Contrastive Activation Engineering |
提出对比激活工程以优化大型语言模型输出控制 |
large language model |
|
|
| 22 |
Assessing and Enhancing the Robustness of LLM-based Multi-Agent Systems Through Chaos Engineering |
通过混沌工程提升LLM多智能体系统的鲁棒性 |
large language model |
|
|