| 1 |
CLAMBER: A Benchmark of Identifying and Clarifying Ambiguous Information Needs in Large Language Models |
CLAMBER:构建评估大语言模型识别和澄清歧义信息需求的基准 |
large language model chain-of-thought |
✅ |
|
| 2 |
WisPerMed at BioLaySumm: Adapting Autoregressive Large Language Models for Lay Summarization of Scientific Articles |
WisPerMed团队利用微调LLM解决生物医学领域科研文章的通俗化摘要生成问题 |
large language model |
|
|
| 3 |
A review on the use of large language models as virtual tutors |
综述:大型语言模型作为虚拟 tutor 在教育领域的应用 |
large language model |
|
|
| 4 |
Large Language Models for Medicine: A Survey |
综述性论文:探讨大型语言模型在医疗领域的应用与挑战 |
large language model |
|
|
| 5 |
Token-wise Influential Training Data Retrieval for Large Language Models |
提出RapidIn框架,用于高效检索影响LLM生成的训练数据。 |
large language model |
|
|
| 6 |
CT-Eval: Benchmarking Chinese Text-to-Table Performance in Large Language Models |
提出CT-Eval中文文本到表格数据集,用于评估和提升大语言模型在此任务上的性能。 |
large language model |
|
|
| 7 |
DOP: Diagnostic-Oriented Prompting for Large Language Models in Mathematical Correction |
提出面向诊断的提示方法(DOP),提升大语言模型在数学问题纠错中的能力 |
large language model |
✅ |
|
| 8 |
STYLE: Improving Domain Transferability of Asking Clarification Questions in Large Language Model Powered Conversational Agents |
提出Style方法,提升大语言模型驱动的对话Agent在未见领域的澄清提问能力 |
large language model |
|
|
| 9 |
Can AI Relate: Testing Large Language Model Response for Mental Health Support |
评估大型语言模型在心理健康支持中的应用,揭示潜在的公平性问题。 |
large language model |
|
|
| 10 |
xFinder: Large Language Models as Automated Evaluators for Reliable Evaluation |
xFinder:利用大语言模型作为自动评估器,提升LLM评估的可靠性 |
large language model |
|
|
| 11 |
Large language models for newspaper sentiment analysis during COVID-19: The Guardian |
利用大型语言模型分析新冠疫情期间《卫报》新闻情感倾向 |
large language model |
|
|
| 12 |
MoRA: High-Rank Updating for Parameter-Efficient Fine-Tuning |
MoRA:通过高秩更新实现参数高效的大模型微调 |
large language model |
|
|
| 13 |
Fennec: Fine-grained Language Model Evaluation and Correction Extended through Branching and Bridging |
Fennec:通过分支与桥接扩展的细粒度语言模型评估与修正框架 |
large language model |
✅ |
|
| 14 |
Question-Based Retrieval using Atomic Units for Enterprise RAG |
针对企业RAG,提出基于原子单元的问题检索方法,提升检索准确率 |
large language model |
|
|
| 15 |
MathBench: Evaluating the Theory and Application Proficiency of LLMs with a Hierarchical Mathematics Benchmark |
提出MathBench,用于全面评估LLM在理论和应用数学方面的能力 |
large language model |
✅ |
|
| 16 |
Distributional Semantics, Holism, and the Instability of Meaning |
研究词语意义分布模型的不稳定性,并提出差分不稳定性概念以应对意义变化。 |
large language model |
|
|