| 1 |
DeSTA2: Developing Instruction-Following Speech Language Model Without Speech Instruction-Tuning Data |
DeSTA2:无需语音指令微调数据,开发指令跟随语音语言模型 |
large language model instruction following chain-of-thought |
|
|
| 2 |
Scheherazade: Evaluating Chain-of-Thought Math Reasoning in LLMs with Chain-of-Problems |
Scheherazade:利用问题链自动生成数学推理基准,评估LLM的思维链能力。 |
large language model chain-of-thought |
✅ |
|
| 3 |
Instance-adaptive Zero-shot Chain-of-Thought Prompting |
提出实例自适应的零样本思维链提示方法,提升LLM推理能力 |
large language model chain-of-thought |
|
|
| 4 |
Are Large Language Models In-Context Personalized Summarizers? Get an iCOPERNICUS Test Done! |
提出iCOPERNICUS框架,用于评估大型语言模型在上下文个性化摘要中的能力 |
large language model |
|
|
| 5 |
Zero-Shot Classification of Crisis Tweets Using Instruction-Finetuned Large Language Models |
利用指令微调的大语言模型进行危机推文的零样本分类 |
large language model |
|
|
| 6 |
Beyond Prompts: Dynamic Conversational Benchmarking of Large Language Models |
提出动态对话基准测试系统,评估LLM在多任务交错场景下的长期记忆和信息整合能力。 |
large language model |
|
|
| 7 |
1 Trillion Token (1TT) Platform: A Novel Framework for Efficient Data Sharing and Compensation in Large Language Models |
提出1TT平台,用于大型语言模型中高效数据共享和公平收益分配。 |
large language model |
|
|
| 8 |
Aggressive Post-Training Compression on Extremely Large Language Models |
提出一种激进的后训练压缩方法,在保证精度下高效压缩超大语言模型。 |
large language model |
|
|
| 9 |
Towards Robust Multimodal Sentiment Analysis with Incomplete Data |
提出语言主导的抗噪学习网络LNLN,解决多模态情感分析中的数据缺失问题。 |
multimodal |
|
|
| 10 |
Do Influence Functions Work on Large Language Models? |
研究表明影响函数在大型语言模型上的表现不佳,并分析了其原因。 |
large language model |
|
|
| 11 |
A Methodology for Explainable Large Language Models with Integrated Gradients and Linguistic Analysis in Text Classification |
提出SLIME方法,结合IG和语言分析提升LLM在文本分类中的可解释性 |
large language model |
|
|
| 12 |
Evaluating the performance of state-of-the-art esg domain-specific pre-trained large language models in text classification against existing models and traditional machine learning techniques |
利用Qlora微调的领域特定LLM在ESG文本分类中超越传统方法和现有模型 |
large language model |
|
|
| 13 |
Adaptable Moral Stances of Large Language Models on Sexist Content: Implications for Society and Gender Discourse |
大型语言模型在性别歧视内容上的道德立场分析及其社会影响 |
large language model |
|
|
| 14 |
LexEval: A Comprehensive Chinese Legal Benchmark for Evaluating Large Language Models |
提出LexEval以评估大型语言模型在法律领域的应用 |
large language model |
✅ |
|
| 15 |
Reference Trustable Decoding: A Training-Free Augmentation Paradigm for Large Language Models |
提出Reference Trustable Decoding,无需微调增强大语言模型下游任务能力。 |
large language model |
✅ |
|
| 16 |
Using Large Multimodal Models to Extract Knowledge Components for Knowledge Tracing from Multimedia Question Information |
利用大型多模态模型从多媒体问题信息中提取知识成分,用于知识追踪 |
multimodal |
|
|
| 17 |
LLMEmb: Large Language Model Can Be a Good Embedding Generator for Sequential Recommendation |
LLMEmb:利用大语言模型生成物品嵌入,提升序列推荐系统性能 |
large language model |
✅ |
|
| 18 |
Neurosymbolic AI approach to Attribution in Large Language Models |
提出神经符号AI方法,提升大语言模型归因的可靠性和可解释性 |
large language model |
|
|
| 19 |
A Looming Replication Crisis in Evaluating Behavior in Language Models? Evidence and Solutions |
揭示大语言模型行为评估中潜在的复现危机,并提出解决方案 |
large language model chain-of-thought |
|
|
| 20 |
HELPD: Mitigating Hallucination of LVLMs by Hierarchical Feedback Learning with Vision-enhanced Penalty Decoding |
提出HELPD框架,通过分层反馈学习和视觉增强惩罚解码缓解LVLM中的多模态幻觉问题 |
multimodal |
|
|
| 21 |
JaPOC: Japanese Post-OCR Correction Benchmark using Vouchers |
JaPOC:构建日语凭证OCR后校正基准,提升识别准确率 |
TAMP |
|
|
| 22 |
DoPAMine: Domain-specific Pre-training Adaptation from seed-guided data Mining |
提出DoPAMine以解决低资源行业领域的预训练数据不足问题 |
large language model |
|
|
| 23 |
Beyond Single Concept Vector: Modeling Concept Subspace in LLMs with Gaussian Distribution |
提出高斯概念子空间(GCS)方法,提升LLM概念表示的鲁棒性和应用效果 |
large language model |
|
|
| 24 |
FaithEval: Can Your Language Model Stay Faithful to Context, Even If "The Moon is Made of Marshmallows" |
FaithEval:评估语言模型在不一致上下文中的忠实度,揭示现有模型在此方面的不足 |
large language model |
✅ |
|
| 25 |
Beyond Scores: A Modular RAG-Based System for Automatic Short Answer Scoring with Feedback |
提出基于模块化RAG的自动简答题评分与反馈系统,提升评分准确率并提供可解释反馈。 |
large language model |
|
|
| 26 |
KV-Compress: Paged KV-Cache Compression with Variable Compression Rates per Attention Head |
KV-Compress:一种基于分页KV缓存和变压缩率的注意力头压缩方法 |
large language model |
|
|
| 27 |
Text Clustering as Classification with LLMs |
提出一种基于LLM上下文学习的文本聚类框架,无需微调和复杂算法,简化文本聚类流程。 |
large language model |
✅ |
|
| 28 |
Analysing Zero-Shot Readability-Controlled Sentence Simplification |
探索零样本可读性控制的句子简化方法,分析上下文信息的影响。 |
large language model |
|
|
| 29 |
How Entangled is Factuality and Deception in German? |
研究德语中事实性与欺骗性的纠缠关系,揭示现有欺骗检测模型的局限性。 |
large language model |
|
|
| 30 |
Enhancing High-order Interaction Awareness in LLM-based Recommender Model |
ELMRec:增强LLM对高阶交互的感知,提升推荐性能 |
large language model |
|
|
| 31 |
Deep Learning and Machine Learning, Advancing Big Data Analytics and Management: Object-Oriented Programming |
探讨面向对象编程在机器学习、深度学习和大数据分析中的应用,提升代码模块化、可维护性和可扩展性。 |
large language model |
|
|