| 1 |
DiM\textsuperscript{3}: Bridging Multilingual and Multimodal Models via Direction- and Magnitude-Aware Merging |
提出DiM³,通过方向和幅度感知的融合方法,桥接多语言和多模态模型。 |
large language model multimodal |
✅ |
|
| 2 |
Utility-Oriented Visual Evidence Selection for Multimodal Retrieval-Augmented Generation |
提出面向效用的视觉证据选择方法,提升多模态检索增强生成效果 |
multimodal |
|
|
| 3 |
Continual Learning with Multilingual Foundation Model |
提出多阶段框架,利用多语言基础模型检测社交媒体中LGBTQ+相关侮辱性词语的赎回用法。 |
foundation model |
✅ |
|
| 4 |
Query-Conditioned Test-Time Self-Training for Large Language Models |
提出查询条件自训练QueST,提升大语言模型在推理任务中的测试时自适应能力 |
large language model |
|
|
| 5 |
FIND: Toward Multimodal Financial Reasoning and Question Answering for Indic Languages |
提出FIND框架,解决印地语多模态金融推理与问答难题。 |
multimodal |
|
|
| 6 |
RAG-Enhanced Large Language Models for Dynamic Content Expiration Prediction in Web Search |
提出基于RAG增强的大语言模型,用于Web搜索中动态内容过期预测 |
large language model |
|
|
| 7 |
Leveraging Multimodal Self-Consistency Reasoning in Coding Motivational Interviewing for Alcohol Use Reduction |
提出基于多模态自洽性推理的酒精使用干预自动编码方法,提升编码鲁棒性。 |
multimodal |
|
|
| 8 |
CommonWhy: A Dataset for Evaluating Entity-Based Causal Commonsense Reasoning in Large Language Models |
提出CommonWhy数据集,用于评估大语言模型中基于实体的因果常识推理能力 |
large language model |
|
|
| 9 |
CiteVQA: Benchmarking Evidence Attribution for Trustworthy Document Intelligence |
提出CiteVQA基准,用于评估文档智能中证据溯源的可信度,揭示现有模型中的“溯源幻觉”问题。 |
large language model multimodal |
✅ |
|
| 10 |
Many-Shot CoT-ICL: Making In-Context Learning Truly Learn |
提出Curvilinear Demonstration Selection (CDS)方法,提升CoT-ICL在推理任务上的性能。 |
large language model chain-of-thought |
|
|
| 11 |
GeoBuildBench: A Benchmark for Interactive and Executable Geometry Construction from Natural Language |
提出GeoBuildBench:一个用于交互式几何构造的自然语言基准测试。 |
large language model multimodal |
|
|
| 12 |
Children's English Reading Story Generation via Supervised Fine-Tuning of Compact LLMs with Controllable Difficulty and Safety |
通过监督式微调小型LLM,实现可控难度和安全性的儿童英语阅读故事生成。 |
large language model |
|
|
| 13 |
LLMs as Implicit Imputers: Uncertainty Should Scale with Missing Information |
将LLM视为隐式插补器,提出不确定性应随缺失信息量增加的评估准则。 |
large language model |
|
|
| 14 |
WARDEN: Endangered Indigenous Language Transcription and Translation with 6 Hours of Training Data |
提出WARDEN以解决濒危土著语言转录与翻译问题 |
large language model |
|
|
| 15 |
LLMs as annotators of credibility assessment in Danish asylum decisions: evaluating classification performance and errors beyond aggregated metrics |
利用LLM标注丹麦语庇护决策可信度评估:评估分类性能与误差 |
large language model |
✅ |
|
| 16 |
Probing Persona-Dependent Preferences in Language Models |
通过线性探针揭示语言模型中人格依赖的偏好机制 |
large language model |
|
|
| 17 |
Tracing Persona Vectors Through LLM Pretraining |
追踪LLM预训练中的Persona向量:揭示早期形成与持续优化机制 |
large language model |
|
|
| 18 |
Context Training with Active Information Seeking |
提出基于主动信息搜索的上下文训练方法,提升LLM在特定领域任务上的性能 |
large language model |
|
|
| 19 |
Persona-Model Collapse in Emergent Misalignment |
揭示涌现性错位中的人格模型崩溃现象,并提出敏感诊断指标 |
large language model |
|
|
| 20 |
An LLM-Based System for Argument Reconstruction |
提出基于LLM的端到端系统,用于从自然语言文本中重构论证结构。 |
large language model |
|
|
| 21 |
Edit-level Majority Voting Mitigates Over-Correction in LLM-based Grammatical Error Correction |
提出基于编辑级别多数投票的推理方法,缓解LLM语法纠错中的过度纠正问题 |
large language model |
|
|
| 22 |
Inducing Artificial Uncertainty in Language Models |
提出一种人工诱导不确定性的方法,提升语言模型在安全关键应用中的不确定性量化能力。 |
large language model |
|
|
| 23 |
PersonalAI 2.0: Enhancing knowledge graph traversal/retrieval with planning mechanism for Personalized LLM Agents |
PersonalAI 2.0:通过规划机制增强知识图谱遍历/检索,用于个性化LLM Agent |
large language model |
|
|
| 24 |
Model-Agnostic Lifelong LLM Safety via Externalized Attack-Defense Co-Evolution |
EvoSafety:通过外部化攻防协同进化实现模型无关的终身LLM安全 |
large language model |
|
|
| 25 |
From Rosetta to Match-Up: A Paired Corpus of Linguistic Puzzles with Human and LLM Benchmarks |
提出系统化方法将Rosetta Stone难题转换为Match-Up格式 |
large language model |
|
|
| 26 |
The Cost of Perfect English: Pragmatic Flattening and the Erasure of Authorial Voice in L2 Writing Supported by GenAI |
研究揭示GenAI润色L2写作中存在的语用扁平化及作者声音的消解问题 |
large language model |
|
|