| 1 |
Analyzing the Role of Semantic Representations in the Era of Large Language Models |
研究大型语言模型时代语义表示的作用,提出AMR驱动的思维链提示方法。 |
large language model chain-of-thought |
✅ |
|
| 2 |
A Survey on Large Language Models for Critical Societal Domains: Finance, Healthcare, and Law |
综述:大型语言模型在金融、医疗和法律等关键社会领域的应用与挑战 |
large language model |
✅ |
|
| 3 |
Large Language Models are Inconsistent and Biased Evaluators |
揭示大语言模型评估器的不一致性和偏见,并提出缓解方案 |
large language model |
|
|
| 4 |
Automatically Extracting Numerical Results from Randomized Controlled Trials with Large Language Models |
利用大型语言模型自动提取随机对照试验中的数值结果,加速Meta分析。 |
large language model |
|
|
| 5 |
Investigating Wit, Creativity, and Detectability of Large Language Models in Domain-Specific Writing Style Adaptation of Reddit's Showerthoughts |
研究大型语言模型在Reddit Showerthoughts领域特定写作风格的适应性、创造力和可检测性 |
large language model |
|
|
| 6 |
Context-Aware Clustering using Large Language Models |
提出CACTUS,利用开源LLM和上下文感知机制进行高效的监督文本聚类。 |
large language model |
|
|
| 7 |
The Power of Question Translation Training in Multilingual Reasoning: Broadened Scope and Deepened Insights |
利用问题翻译训练增强多语言推理能力,扩展应用范围并加深理解 |
large language model chain-of-thought |
|
|
| 8 |
Prompt engineering paradigms for medical applications: scoping review and recommendations for better practices |
综述医学领域Prompt工程范式,为提升实践效果提供建议 |
large language model chain-of-thought |
|
|
| 9 |
Context Steering: Controllable Personalization at Inference Time |
提出Context Steering,一种无需训练的推理时上下文可控个性化方法。 |
large language model |
|
|
| 10 |
Question Suggestion for Conversational Shopping Assistants Using Product Metadata |
利用产品元数据,为对话式购物助手生成问题建议,提升用户体验。 |
large language model |
|
|
| 11 |
Controllable Text Generation in the Instruction-Tuning Era |
提出ConGenBench基准测试,评估指令调优语言模型在可控文本生成中的性能,并提出自动生成约束数据集的算法。 |
large language model |
|
|
| 12 |
Verification and Refinement of Natural Language Explanations through LLM-Symbolic Theorem Proving |
提出Explanation-Refiner框架,结合LLM与定理证明提升NLI解释的有效性。 |
large language model |
|
|
| 13 |
GAIA: A General AI Assistant for Intelligent Accelerator Operations |
GAIA:用于智能加速器操作的通用AI助手 |
large language model |
|
|
| 14 |
The Effectiveness of LLMs as Annotators: A Comparative Overview and Empirical Analysis of Direct Representation |
研究LLM作为标注器的有效性,对比分析直接表征方法 |
large language model |
|
|
| 15 |
Efficient Data Generation for Source-grounded Information-seeking Dialogs: A Use Case for Meeting Transcripts |
提出基于LLM半自动数据生成方法,用于会议记录上的源接地信息检索对话任务。 |
large language model |
|
|
| 16 |
Automating the Analysis of Public Saliency and Attitudes towards Biodiversity from Digital Media |
提出一种自动化分析公众对生物多样性态度的方法 |
large language model |
|
|
| 17 |
On the Evaluation of Machine-Generated Reports |
提出自动报告生成评估框架,解决长文本报告生成中完整性、准确性和可验证性问题。 |
large language model |
|
|
| 18 |
How Can I Get It Right? Using GPT to Rephrase Incorrect Trainee Responses |
利用GPT改进新手导师反馈:自动重述错误回答以提升培训效果 |
large language model |
|
|