| 1 |
Spilled Energy in Large Language Models |
提出基于能量模型的LLM推理方法,无需训练即可检测幻觉。 |
large language model |
|
|
| 2 |
Beyond Description: A Multimodal Agent Framework for Insightful Chart Summarization |
提出Chart Insight Agent Flow框架,提升多模态大语言模型在图表总结中洞察力提取能力 |
large language model multimodal |
|
|
| 3 |
Adaptive Collaboration of Arena-Based Argumentative LLMs for Explainable and Contestable Legal Reasoning |
提出ACAL框架,结合论辩LLM与人机交互,提升法律推理的可解释性和可辩论性 |
large language model chain-of-thought |
✅ |
|
| 4 |
Early Evidence of Vibe-Proving with Consumer LLMs: A Case Study on Spectral Region Characterization with ChatGPT-5.2 (Thinking) |
利用消费级LLM验证数学猜想:ChatGPT-5.2在谱区域刻画中的案例研究 |
large language model |
|
|
| 5 |
Operational Robustness of LLMs on Code Generation |
提出场景域分析方法以评估LLMs在代码生成中的鲁棒性 |
large language model |
|
|
| 6 |
Many AI Analysts, One Dataset: Navigating the Agentic Data Science Multiverse |
利用AI分析师群体解决数据分析结果依赖分析决策的问题 |
large language model |
|
|
| 7 |
Give Users the Wheel: Towards Promptable Recommendation Paradigm |
提出解耦可提示序列推荐(DPR)框架,利用自然语言提示动态引导推荐过程。 |
large language model |
|
|
| 8 |
Orchestrating LLM Agents for Scientific Research: A Pilot Study of Multiple Choice Question (MCQ) Generation and Evaluation |
探索LLM智能体在科学研究中的应用:以多项选择题生成与评估为例 |
large language model |
|
|