| 1 |
Review GIDE -- Restaurant Review Gastrointestinal Illness Detection and Extraction with Large Language Models |
利用大型语言模型进行餐厅评论中胃肠道疾病的检测与信息抽取 |
large language model |
|
|
| 2 |
A Survey on Enhancing Causal Reasoning Ability of Large Language Models |
综述:提升大型语言模型因果推理能力的研究进展 |
large language model |
|
|
| 3 |
What's In Your Field? Mapping Scientific Research with Knowledge Graphs and Large Language Models |
提出基于知识图谱和大语言模型的科学研究映射方法 |
large language model |
✅ |
|
| 4 |
TRACE: Real-Time Multimodal Common Ground Tracking in Situated Collaborative Dialogues |
TRACE:用于情境化协作对话的实时多模态共同基础追踪系统 |
multimodal |
|
|
| 5 |
Rethinking Prompt-based Debiasing in Large Language Models |
揭示大语言模型中基于Prompt的去偏见方法的局限性与虚假繁荣 |
large language model |
|
|
| 6 |
Harmonizing Large Language Models with Collaborative Behavioral Signals for Conversational Recommendation |
提出融合协同行为信号的大语言模型对话推荐框架,提升推荐相关性。 |
large language model |
|
|
| 7 |
Medical Large Language Model Benchmarks Should Prioritize Construct Validity |
医学大语言模型评测应优先考虑建构效度,避免唯榜单论 |
large language model |
|
|
| 8 |
Battling Misinformation: An Empirical Study on Adversarial Factuality in Open-Source Large Language Models |
研究表明,开源大语言模型在对抗性事实性攻击下的鲁棒性差异显著 |
large language model |
|
|
| 9 |
Probabilistic Reasoning with LLMs for k-anonymity Estimation |
提出BRANCH方法,利用LLM进行概率推理,提升k-匿名性估计的准确率。 |
large language model chain-of-thought |
|
|
| 10 |
Attention Reveals More Than Tokens: Training-Free Long-Context Reasoning with Attention-guided Retrieval |
提出Attrieval:利用注意力引导检索,实现无需训练的长文本推理 |
large language model chain-of-thought |
|
|
| 11 |
MOAT: Evaluating LMMs for Capability Integration and Instruction Grounding |
提出MOAT基准,评估LMMs在能力集成和指令理解方面的性能 |
multimodal |
✅ |
|
| 12 |
Reassessing Active Learning Adoption in Contemporary NLP: A Community Survey |
通过社区调查重新评估主动学习在现代NLP中的应用现状与挑战 |
large language model |
|
|
| 13 |
Cost-Optimal Grouped-Query Attention for Long-Context Modeling |
提出面向长文本建模的成本优化分组查询注意力机制,显著降低计算成本。 |
large language model |
|
|
| 14 |
Generative AI for Named Entity Recognition in Low-Resource Language Nepali |
探索生成式AI在低资源尼泊尔语命名实体识别中的应用 |
large language model |
|
|
| 15 |
How to Protect Yourself from 5G Radiation? Investigating LLM Responses to Implicit Misinformation |
提出EchoMist基准测试,评估并提升LLM对隐性错误信息的防御能力 |
large language model |
|
|
| 16 |
Safer or Luckier? LLMs as Safety Evaluators Are Not Robust to Artifacts |
揭示LLM安全评估器对输入伪影的脆弱性,提出陪审团评估方法以提升鲁棒性。 |
large language model |
|
|
| 17 |
MoC: Mixtures of Text Chunking Learners for Retrieval-Augmented Generation System |
提出MoC框架,优化RAG系统中文本分块策略,提升检索增强生成效果 |
large language model |
|
|
| 18 |
Plan-and-Act: Improving Planning of Agents for Long-Horizon Tasks |
提出Plan-and-Act框架,提升LLM智能体在长程任务中的规划能力 |
large language model |
|
|
| 19 |
Token Weighting for Long-Range Language Modeling |
提出基于token权重调整的训练方法,提升LLM在长文本建模任务中的性能。 |
large language model |
|
|
| 20 |
VaxGuard: A Multi-Generator, Multi-Type, and Multi-Role Dataset for Detecting LLM-Generated Vaccine Misinformation |
VaxGuard:多生成器、多类型、多角色的疫苗虚假信息检测数据集 |
large language model |
|
|
| 21 |
HaploVL: A Single-Transformer Baseline for Multi-Modal Understanding |
HaploVL:单Transformer架构的多模态理解基线模型 |
large language model |
|
|
| 22 |
Leveraging Retrieval Augmented Generative LLMs For Automated Metadata Description Generation to Enhance Data Catalogs |
利用检索增强生成式LLM自动生成元数据描述,提升数据目录质量。 |
large language model |
|
|
| 23 |
Learning to Contextualize Web Pages for Enhanced Decision Making by LLM Agents |
提出LCoW框架,通过上下文理解增强LLM Agent在Web任务中的决策能力 |
large language model |
|
|