| 1 |
Inference-Cost-Aware Dynamic Tree Construction for Efficient Inference in Large Language Models |
提出CAST:一种推理成本感知的动态树构建方法,提升大语言模型推理效率 |
large language model |
|
|
| 2 |
The Structure of Relation Decoding Linear Operators in Large Language Models |
揭示大语言模型关系解码线性算子的结构,发现其主要基于语义属性而非特定关系。 |
large language model |
|
|
| 3 |
Evontree: Ontology Rule-Guided Self-Evolution of Large Language Models |
Evontree:利用本体规则引导大语言模型自进化,提升领域知识 |
large language model |
|
|
| 4 |
Bayesian Network Fusion of Large Language Models for Sentiment Analysis |
提出基于贝叶斯网络的大语言模型融合框架,提升情感分析性能。 |
large language model |
|
|
| 5 |
Encoder-Decoder or Decoder-Only? Revisiting Encoder-Decoder Large Language Model |
重新审视Encoder-Decoder大语言模型,探索其在效率和性能上的潜力 |
large language model |
|
|
| 6 |
A Multi-agent Large Language Model Framework to Automatically Assess Performance of a Clinical AI Triage Tool |
提出多Agent LLM框架,自动评估临床AI分诊工具的性能 |
large language model |
|
|
| 7 |
1+1>2: A Synergistic Sparse and Low-Rank Compression Method for Large Language Models |
提出协同稀疏与低秩压缩方法SSLC,高效压缩大型语言模型。 |
large language model |
|
|
| 8 |
OmniEduBench: A Comprehensive Chinese Benchmark for Evaluating Large Language Models in Education |
提出OmniEduBench,用于全面评估中文教育领域大语言模型能力 |
large language model |
|
|
| 9 |
RCScore: Quantifying Response Consistency in Large Language Models |
RCScore:量化大语言模型对指令形式的响应一致性,评估模型鲁棒性 |
large language model |
|
|
| 10 |
MisSynth: Improving MISSCI Logical Fallacies Classification with Synthetic Data |
MisSynth:利用合成数据提升MISSCI谬误逻辑分类性能 |
large language model |
✅ |
|
| 11 |
Unravelling the Mechanisms of Manipulating Numbers in Language Models |
揭示语言模型中数字处理机制,探究其误差根源与精度下限 |
large language model |
|
|
| 12 |
PVMark: Enabling Public Verifiability for LLM Watermarking Schemes |
PVMark:一种支持LLM水印方案公开可验证性的框架 |
large language model |
|
|
| 13 |
Detecting Data Contamination in LLMs via In-Context Learning |
提出CoDeC,通过上下文学习检测LLM中的数据污染 |
large language model |
|
|
| 14 |
Chopping Trees: Semantic Similarity Based Dynamic Pruning for Tree-of-Thought Reasoning |
提出基于语义相似性的动态剪枝方法,加速思维树推理。 |
large language model |
✅ |
|
| 15 |
The Geometry of Dialogue: Graphing Language Models to Reveal Synergistic Teams for Multi-Agent Collaboration |
提出基于语言模型图的协同团队构建方法,解决多智能体LLM协作中的团队优化问题。 |
large language model |
|
|
| 16 |
Towards Global Retrieval Augmented Generation: A Benchmark for Corpus-Level Reasoning |
提出GlobalRAG框架,解决现有RAG方法在语料库级别推理任务中的不足。 |
large language model |
|
|
| 17 |
VISTA Score: Verification In Sequential Turn-based Assessment |
VISTA:提出一种用于评估对话系统中事实性幻觉的序列轮次验证框架 |
large language model |
|
|
| 18 |
Kad: A Framework for Proxy-based Test-time Alignment with Knapsack Approximation Deferral |
Kad框架:基于代理模型的测试时对齐,利用背包近似延迟解决LLM对齐计算成本高昂问题。 |
large language model |
|
|
| 19 |
Semantically-Aware LLM Agent to Enhance Privacy in Conversational AI Services |
提出LOPSIDED框架,增强会话AI中LLM的隐私保护能力 |
large language model |
|
|
| 20 |
SlideAgent: Hierarchical Agentic Framework for Multi-Page Visual Document Understanding |
提出SlideAgent,用于多页视觉文档理解的分层Agent框架。 |
large language model |
|
|
| 21 |
On the Role of Context for Discourse Relation Classification in Scientific Writing |
研究科学写作中篇章关系分类任务,探讨上下文信息对提升性能的作用 |
large language model |
|
|
| 22 |
Do LLMs Signal When They're Right? Evidence from Neuron Agreement |
提出神经元一致性解码(NAD),利用LLM内部神经元信号提升无标签集成解码效果。 |
large language model |
|
|
| 23 |
Language Models Are Borrowing-Blind: A Multilingual Evaluation of Loanword Identification across 10 Languages |
大型语言模型在跨语种外来语识别任务中表现不佳,揭示其“借用盲区”问题 |
large language model |
|
|
| 24 |
Pragmatic Theories Enhance Understanding of Implied Meanings in LLMs |
利用语用理论提示提升LLM对隐含意义的理解能力 |
chain-of-thought |
|
|
| 25 |
Similarity-Distance-Magnitude Language Models |
提出基于相似度-距离-幅度(SDM)激活的语言模型,提升指令跟随任务的统计效率。 |
instruction following |
|
|
| 26 |
On the Influence of Discourse Relations in Persuasive Texts |
利用大型语言模型分析说服文本中论述关系对说服技巧的影响 |
large language model |
|
|
| 27 |
QCoder Benchmark: Bridging Language Generation and Quantum Hardware through Simulator-Based Feedback |
提出QCoder Benchmark,通过模拟器反馈评估LLM在量子编程中的代码生成能力 |
large language model |
|
|