| 1 |
Do Large Language Models Reason Causally Like Us? Even Better? |
评估大语言模型因果推理能力:部分模型超越人类,但仍有局限 |
large language model |
|
|
| 2 |
Decision Information Meets Large Language Models: The Future of Explainable Operations Research |
提出可解释的运筹学框架以解决决策透明性问题 |
large language model |
|
|
| 3 |
Has My System Prompt Been Used? Large Language Model Prompt Membership Inference |
Prompt Detective:基于输出分布差异的大语言模型提示词成员推断方法 |
large language model |
|
|
| 4 |
MuDoC: An Interactive Multimodal Document-grounded Conversational AI System |
提出MuDoC:一个交互式多模态文档对话AI系统,支持图文混合的文档内容理解与交互。 |
multimodal |
|
|
| 5 |
Automatic Evaluation Metrics for Artificially Generated Scientific Research |
提出基于引用预测和评审评分预测的自动评估指标,用于评估AI生成的科学研究 |
large language model foundation model |
|
|
| 6 |
Artificial Intelligence in Spectroscopy: Advancing Chemistry from Prediction to Generation and Beyond |
综述光谱机器学习:从预测到生成,推进化学领域AI应用 |
foundation model multimodal |
✅ |
|
| 7 |
AutoS$^2$earch: Unlocking the Reasoning Potential of Large Models for Web-based Source Search |
AutoS$^2$earch:利用大模型进行Web环境下的零样本源搜索 |
large language model chain-of-thought |
|
|
| 8 |
MEADOW: Memory-efficient Dataflow and Data Packing for Low Power Edge LLMs |
MEADOW:面向低功耗边缘LLM的内存高效数据流和数据打包 |
large language model |
|
|
| 9 |
GraphiT: Efficient Node Classification on Text-Attributed Graphs with Prompt Optimized LLMs |
GraphiT:利用提示优化LLM实现文本属性图上的高效节点分类 |
large language model |
|
|
| 10 |
LLM-Powered Preference Elicitation in Combinatorial Assignment |
提出基于LLM的偏好启发框架,提升组合分配中的资源分配效率 |
large language model |
|
|
| 11 |
MIR-Bench: Can Your LLM Recognize Complicated Patterns via Many-Shot In-Context Reasoning? |
MIR-Bench:提出多示例上下文推理基准,评估LLM在复杂模式识别中的能力 |
large language model |
|
|
| 12 |
MathConstruct: Challenging LLM Reasoning with Constructive Proofs |
MathConstruct:提出构造性证明数学基准,挑战LLM推理能力 |
large language model |
|
|
| 13 |
The Ann Arbor Architecture for Agent-Oriented Programming |
提出Ann Arbor架构,用于面向Agent的大语言模型编程,优化上下文学习。 |
large language model |
|
|
| 14 |
ArchRAG: Attributed Community-based Hierarchical Retrieval-Augmented Generation |
提出ArchRAG,利用属性社区分层检索增强生成,提升图数据问答准确率并降低token成本。 |
large language model |
|
|