| 1 |
Are Large Language Models the future crowd workers of Linguistics? |
利用大型语言模型替代语言学领域的人工众包工作,提升数据获取效率。 |
large language model chain-of-thought |
|
|
| 2 |
Leveraging large language models for structured information extraction from pathology reports |
利用大型语言模型从病理报告中提取结构化信息 |
large language model |
|
|
| 3 |
Large Language Models and Synthetic Data for Monitoring Dataset Mentions in Research Papers |
利用大语言模型和合成数据自动检测研究论文中的数据集引用 |
large language model |
|
|
| 4 |
Evaluating the Meta- and Object-Level Reasoning of Large Language Models for Question Answering |
评估大型语言模型在问答任务中元层次和对象层次的推理能力 |
large language model |
|
|
| 5 |
VisCon-100K: Leveraging Contextual Web Data for Fine-tuning Vision Language Models |
VisCon-100K:利用上下文网络数据微调视觉语言模型,提升多模态理解能力 |
large language model multimodal |
|
|
| 6 |
Large Language Diffusion Models |
提出LLaDA:一种基于扩散模型的大语言模型,挑战自回归模型主导地位。 |
large language model instruction following |
✅ |
|
| 7 |
A Preliminary Exploration with GPT-4o Voice Mode |
GPT-4o语音模式初步探索:音频理解与推理能力评估 |
large language model multimodal |
|
|
| 8 |
Enhancing Multilingual LLM Pretraining with Model-Based Data Selection |
提出基于模型的跨语言LLM预训练数据选择方法,提升模型性能和效率。 |
large language model |
|
|
| 9 |
KGGen: Extracting Knowledge Graphs from Plain Text with Language Models |
KGGen:利用语言模型从纯文本中抽取高质量知识图谱,解决知识图谱数据稀缺问题。 |
foundation model |
|
|
| 10 |
Man Made Language Models? Evaluating LLMs' Perpetuation of Masculine Generics Bias |
评估大型语言模型中男性泛指偏见:揭示并量化LLM对性别刻板印象的强化 |
large language model |
|
|
| 11 |
Hallucinations and Truth: A Comprehensive Accuracy Evaluation of RAG, LoRA and DoRA |
提出DoRA,在RAG基础上优化LLM微调,提升生成式AI在特定领域的准确率和效率。 |
large language model |
|
|
| 12 |
Prediction hubs are context-informed frequent tokens in LLMs |
揭示LLM预测中枢为上下文相关的频繁token,避免不必要的hubness缓解 |
large language model |
|
|
| 13 |
LaRA: Benchmarking Retrieval-Augmented Generation and Long-Context LLMs -- No Silver Bullet for LC or RAG Routing |
LaRA:基准测试检索增强生成与长文本LLM,揭示长文本处理或RAG路由并非万能解 |
large language model |
✅ |
|
| 14 |
Named entity recognition for Serbian legal documents: Design, methodology and dataset development |
提出一种基于BERT的塞尔维亚语法律文档命名实体识别方法与数据集 |
large language model |
|
|
| 15 |
Aspect-Oriented Summarization for Psychiatric Short-Term Readmission Prediction |
提出面向方面的摘要方法,用于提升精神病短期再入院预测性能 |
large language model |
|
|
| 16 |
Organize the Web: Constructing Domains Enhances Pre-Training Data Curation |
WebOrganizer:通过构建领域增强预训练数据筛选,提升下游任务性能。 |
large language model |
|
|
| 17 |
Beyond English: Unveiling Multilingual Bias in LLM Copyright Compliance |
揭示LLM版权合规中的多语言偏见,发现不同语言处理差异 |
large language model |
|
|
| 18 |
Can Post-Training Quantization Benefit from an Additional QLoRA Integration? |
提出PTQ-QLoRA集成方法,提升量化大语言模型在资源受限环境下的性能。 |
large language model |
|
|
| 19 |
ORI: O Routing Intelligence |
提出ORI:一种基于LLM路由的智能框架,提升多任务处理性能。 |
large language model |
|
|