| 1 |
LMFusion: Adapting Pretrained Language Models for Multimodal Generation |
LMFusion:通过适配预训练语言模型实现多模态生成 |
large language model multimodal |
|
|
| 2 |
Progressive Multimodal Reasoning via Active Retrieval |
提出AR-MCTS框架,通过主动检索和蒙特卡洛树搜索提升多模态大语言模型的多步推理能力。 |
large language model multimodal |
|
|
| 3 |
Conceptual In-Context Learning and Chain of Concepts: Solving Complex Conceptual Problems Using Large Language Models |
提出概念内上下文学习与概念链,增强LLM解决复杂概念问题的能力 |
large language model chain-of-thought |
|
|
| 4 |
PsyDraw: A Multi-Agent Multimodal System for Mental Health Screening in Left-Behind Children |
PsyDraw:一种多智能体多模态系统,用于留守儿童的心理健康筛查。 |
large language model multimodal |
|
|
| 5 |
Adaptive Pruning for Large Language Models with Structural Importance Awareness |
提出结构感知自适应剪枝方法SAAP,用于压缩LLM并在资源受限设备上部署。 |
large language model |
|
|
| 6 |
A Comparative Study of DSPy Teleprompter Algorithms for Aligning Large Language Models Evaluation Metrics to Human Evaluation |
对比DSPy Teleprompter算法,优化LLM提示以对齐人类评估标准 |
large language model |
|
|
| 7 |
Confidence in the Reasoning of Large Language Models |
评估大语言模型推理置信度:定性分析与量化指标相结合 |
large language model |
|
|
| 8 |
Eliciting Causal Abilities in Large Language Models for Reasoning Tasks |
提出SCIE方法,通过诱导大语言模型的因果推理能力提升其在推理任务中的表现。 |
large language model |
|
|
| 9 |
RobustFT: Robust Supervised Fine-tuning for Large Language Models under Noisy Response |
提出RobustFT框架,解决大语言模型在噪声响应下的鲁棒微调问题 |
large language model |
|
|
| 10 |
Each Fake News is Fake in its Own Way: An Attribution Multi-Granularity Benchmark for Multimodal Fake News Detection |
构建多粒度属性基准数据集AMG,并提出多粒度线索对齐模型MGCM,用于多模态假新闻检测与溯源。 |
multimodal |
|
|
| 11 |
Analysis and Visualization of Linguistic Structures in Large Language Models: Neural Representations of Verb-Particle Constructions in BERT |
分析大型语言模型中动词-小品词结构的神经表征:以BERT为例 |
large language model |
|
|
| 12 |
ORBIT: Cost-Effective Dataset Curation for Large Language Model Domain Adaptation with an Astronomy Case Study |
ORBIT:一种低成本的大语言模型领域自适应数据集构建方法 |
large language model |
✅ |
|
| 13 |
Why Do Large Language Models (LLMs) Struggle to Count Letters? |
研究揭示大语言模型在字母计数任务上的困难,并分析其与词频、复杂度的关系 |
large language model |
|
|
| 14 |
Graph-Convolutional Networks: Named Entity Recognition and Large Language Model Embedding in Document Clustering |
提出一种融合命名实体识别和LLM嵌入的图卷积网络文档聚类方法 |
large language model |
|
|
| 15 |
ResoFilter: Fine-grained Synthetic Data Filtering for Large Language Models through Data-Parameter Resonance Analysis |
ResoFilter:通过数据-参数共振分析实现大语言模型精细化合成数据过滤 |
large language model |
|
|
| 16 |
A Large-Scale Simulation on Large Language Models for Decision-Making in Political Science |
提出基于大语言模型的多步推理框架,用于大规模模拟政治决策 |
large language model |
|
|
| 17 |
Sliding Windows Are Not the End: Exploring Full Ranking with Long-Context Large Language Models |
提出长上下文大语言模型以解决滑动窗口策略的效率问题 |
large language model |
✅ |
|
| 18 |
Why We Build Local Large Language Models: An Observational Analysis from 35 Japanese and Multilingual LLMs |
通过观察性分析,揭示了构建本地大型语言模型的必要性与策略。 |
large language model |
|
|
| 19 |
Are Longer Prompts Always Better? Prompt Selection in Large Language Models for Recommendation Systems |
针对LLM推荐系统,提出基于数据集特征的Prompt选择方法,提升推荐准确率和效率。 |
large language model |
|
|
| 20 |
Systematic Evaluation of Long-Context LLMs on Financial Concepts |
系统性评估长文本LLM在金融概念理解上的能力,揭示其在长上下文中的脆弱性 |
large language model instruction following |
|
|
| 21 |
Query pipeline optimization for cancer patient question answering systems |
针对癌症患者问答系统,提出RAG查询管道三方面优化方法,提升回答准确率。 |
large language model chain-of-thought |
|
|
| 22 |
Length Controlled Generation for Black-box LLMs |
提出基于Metropolis-Hastings算法的迭代采样框架,实现黑盒LLM的精确长度控制 |
large language model instruction following |
|
|
| 23 |
MMLU-CF: A Contamination-free Multi-task Language Understanding Benchmark |
提出MMLU-CF:一个无污染的多任务语言理解基准,用于更可靠地评估LLM。 |
large language model |
✅ |
|
| 24 |
Reasoning Through Execution: Unifying Process and Outcome Rewards for Code Generation |
提出Outcome Refining Process Supervision,统一过程和结果奖励,提升代码生成质量。 |
large language model |
✅ |
|
| 25 |
ALKAFI-LLAMA3: Fine-Tuning LLMs for Precise Legal Understanding in Palestine |
ALKAFI-LLAMA3:微调LLM以实现巴勒斯坦法律的精准理解 |
large language model |
|
|
| 26 |
Language Models as Continuous Self-Evolving Data Engineers |
提出LANCE:一种基于LLM的持续自进化数据工程框架,提升模型性能。 |
large language model |
✅ |
|
| 27 |
Dehallucinating Parallel Context Extension for Retrieval-Augmented Generation |
提出DePaC,通过解耦幻觉缓解RAG中并行上下文扩展的问题 |
large language model |
|
|
| 28 |
How good is GPT at writing political speeches for the White House? |
评估GPT在撰写白宫政治演讲稿方面的能力:对比GPT与美国总统的演讲风格 |
large language model |
|
|
| 29 |
All-in-One Tuning and Structural Pruning for Domain-Specific LLMs |
提出ATP:面向领域LLM的端到端调优与结构化剪枝方法 |
large language model |
|
|
| 30 |
SKETCH: Structured Knowledge Enhanced Text Comprehension for Holistic Retrieval |
SKETCH:融合结构化知识的文本理解方法,提升RAG系统检索性能 |
large language model |
|
|
| 31 |
Automatic Extraction of Metaphoric Analogies from Literary Texts: Task Formulation, Dataset Construction, and Evaluation |
提出文学文本隐喻类比自动抽取方法,并构建数据集用于评估大型语言模型。 |
large language model |
|
|
| 32 |
Decade of Natural Language Processing in Chronic Pain: A Systematic Review |
综述:自然语言处理在慢性疼痛研究中的十年进展与未来方向 |
multimodal |
|
|
| 33 |
ConfliBERT: A Language Model for Political Conflict |
ConfliBERT:用于政治冲突事件抽取的专用语言模型,性能超越通用LLM。 |
large language model |
|
|
| 34 |
LLMs Lost in Translation: M-ALERT uncovers Cross-Linguistic Safety Inconsistencies |
M-ALERT揭示LLM多语言安全漏洞,发现跨语言安全一致性问题 |
large language model |
|
|
| 35 |
Chain-of-MetaWriting: Linguistic and Textual Analysis of How Small Language Models Write Young Students Texts |
提出Chain-of-MetaWriting方法,分析小型语言模型在辅助青少年写作中的表现与局限 |
large language model |
|
|
| 36 |
Think&Cite: Improving Attributed Text Generation with Self-Guided Tree Search and Progress Reward Modeling |
提出Think&Cite框架,通过自引导树搜索和进度奖励建模提升属性文本生成的事实准确性。 |
large language model |
|
|
| 37 |
ViFactCheck: A New Benchmark Dataset and Methods for Multi-domain News Fact-Checking in Vietnamese |
ViFactCheck:提出越南语多领域新闻事实核查基准数据集与方法 |
large language model |
|
|
| 38 |
Disentangling Reasoning Tokens and Boilerplate Tokens For Language Model Fine-tuning |
提出RFT方法,通过解耦推理和样板 tokens,提升LLM在agent任务上的微调效果。 |
large language model |
|
|
| 39 |
On Verbalized Confidence Scores for LLMs |
提出一种提示工程方法,使LLM能够输出校准良好的置信度评分,用于不确定性量化。 |
large language model |
✅ |
|
| 40 |
Beyond Guilt: Legal Judgment Prediction with Trichotomous Reasoning |
提出LJPIV基准数据集,增强法律LLM的三分式推理能力,提升无罪判决预测准确性。 |
large language model |
|
|
| 41 |
Agent-SafetyBench: Evaluating the Safety of LLM Agents |
Agent-SafetyBench:构建LLM Agent安全评估基准,揭示现有Agent安全风险 |
large language model |
✅ |
|
| 42 |
To Err Is Human; To Annotate, SILICON? Reducing Measurement Error in LLM Annotation |
提出SILICON方法,系统性降低LLM文本标注中的测量误差,提升管理研究的标注质量和可复现性。 |
large language model |
|
|