| 1 |
Benchmarking Multimodal LLMs on Recognition and Understanding over Chemical Tables |
提出ChemTable基准以解决化学表格理解与识别问题 |
large language model multimodal |
|
|
| 2 |
MRI-CORE: A Foundation Model for Magnetic Resonance Imaging |
提出MRI-CORE以解决MRI数据标注不足问题 |
foundation model |
|
|
| 3 |
DIVER-0 : A Fully Channel Equivariant EEG Foundation Model |
提出DIVER-0以解决EEG模型通道等变性不足问题 |
foundation model |
|
|
| 4 |
Investigating the Potential of Large Language Model-Based Router Multi-Agent Architectures for Foundation Design Automation: A Task Classification and Expert Selection Study |
提出基于大语言模型的路由器多智能体架构以自动化基础设计计算 |
large language model |
|
|
| 5 |
Dr. GPT Will See You Now, but Should It? Exploring the Benefits and Harms of Large Language Models in Medical Diagnosis using Crowdsourced Clinical Cases |
提出众包评估方法以解决大语言模型在医疗诊断中的有效性问题 |
large language model |
|
|
| 6 |
FAA Framework: A Large Language Model-Based Approach for Credit Card Fraud Investigations |
提出FAA框架以解决信用卡欺诈调查中的分析师疲劳问题 |
large language model |
|
|
| 7 |
Large Language Model-Powered Conversational Agent Delivering Problem-Solving Therapy (PST) for Family Caregivers: Enhancing Empathy and Therapeutic Alliance Using In-Context Learning |
提出基于大语言模型的对话代理以解决家庭照顾者心理健康问题 |
large language model |
|
|
| 8 |
Cloud Infrastructure Management in the Age of AI Agents |
提出基于AI代理的云基础设施管理自动化解决方案 |
large language model |
|
|
| 9 |
SSLAM: Enhancing Self-Supervised Models with Audio Mixtures for Polyphonic Soundscapes |
提出SSLAM以解决多音频场景下自监督模型的性能不足问题 |
large language model |
|
|
| 10 |
LiveCodeBench Pro: How Do Olympiad Medalists Judge LLMs in Competitive Programming? |
提出LiveCodeBench Pro以评估LLMs在竞赛编程中的表现 |
large language model |
|
|
| 11 |
On the Performance of LLMs for Real Estate Appraisal |
利用大语言模型提升房地产评估的透明度与可解释性 |
large language model |
|
|
| 12 |
Because we have LLMs, we Can and Should Pursue Agentic Interpretability |
提出代理可解释性以提升人类对LLM的理解 |
large language model |
|
|
| 13 |
Tracing LLM Reasoning Processes with Strategic Games: A Framework for Planning, Revision, and Resource-Constrained Decision Making |
提出战略游戏框架以评估LLM推理过程 |
large language model |
|
|
| 14 |
Addressing Bias in LLMs: Strategies and Application to Fair AI-based Recruitment |
提出隐私增强框架以解决LLMs中的性别偏见问题 |
large language model |
|
|
| 15 |
Revealing Political Bias in LLMs through Structured Multi-Agent Debate |
通过结构化多智能体辩论揭示大型语言模型的政治偏见 |
large language model |
|
|
| 16 |
Semantic Preprocessing for LLM-based Malware Analysis |
提出基于专家知识的预处理方法以提升恶意软件分析 |
large language model |
|
|
| 17 |
LLMs on support of privacy and security of mobile apps: state of the art and research directions |
利用大型语言模型提升移动应用的隐私与安全性 |
large language model |
|
|
| 18 |
GraphRAG-Causal: A novel graph-augmented framework for causal reasoning and annotation in news |
提出GraphRAG-Causal框架以增强新闻因果推理能力 |
large language model |
|
|
| 19 |
Efficient LLM Collaboration via Planning |
提出COPE框架以实现小大模型高效协作 |
large language model |
|
|
| 20 |
Identifying Helpful Context for LLM-based Vulnerability Repair: A Preliminary Study |
探讨GPT-4o在Java漏洞修复中的上下文影响 |
large language model |
|
|
| 21 |
Leveraging GPT-4 for Vulnerability-Witnessing Unit Test Generation |
利用GPT-4生成漏洞见证单元测试以提升软件安全性 |
large language model |
|
|
| 22 |
DRIFT: Dynamic Rule-Based Defense with Injection Isolation for Securing LLM Agents |
提出DRIFT以解决大语言模型代理系统的安全性问题 |
large language model |
✅ |
|