| 1 |
CoCA: Regaining Safety-awareness of Multimodal Large Language Models with Constitutional Calibration |
提出CoCA,通过宪法校准恢复多模态大语言模型对恶意视觉输入的安全性感知。 |
large language model multimodal |
|
|
| 2 |
NVLM: Open Frontier-Class Multimodal LLMs |
NVLM 1.0:媲美GPT-4o的前沿多模态大语言模型,提升文本性能并开源 |
large language model multimodal |
✅ |
|
| 3 |
Chain-of-Thought Prompting for Speech Translation |
提出基于思维链提示的语音翻译方法,显著提升Speech-LLM的翻译性能 |
large language model chain-of-thought |
|
|
| 4 |
Large Language Models are Good Multi-lingual Learners : When LLMs Meet Cross-lingual Prompts |
提出MLPrompt多语言提示方法,提升LLM在复杂规则下的推理和理解能力 |
large language model chain-of-thought |
|
|
| 5 |
Exploring the Trade-Offs: Quantization Methods, Task Difficulty, and Model Size in Large Language Models From Edge to Giant |
大规模语言模型量化方法的全面评估:模型大小、任务难度与性能权衡 |
large language model instruction following |
|
|
| 6 |
Enriching Datasets with Demographics through Large Language Models: What's in a Name? |
利用大型语言模型进行人口统计信息推断,提升数据集质量 |
large language model |
|
|
| 7 |
THaMES: An End-to-End Tool for Hallucination Mitigation and Evaluation in Large Language Models |
THaMES:用于大规模语言模型幻觉缓解与评估的端到端工具 |
large language model |
|
|
| 8 |
Task Arithmetic for Language Expansion in Speech Translation |
提出增强型任务算术方法,用于语音翻译中的语言扩展,无需重新训练。 |
large language model foundation model multimodal |
|
|
| 9 |
LOLA -- An Open-Source Massively Multilingual Large Language Model |
LOLA:一个开源的大规模多语言大型语言模型 |
large language model |
|
|
| 10 |
The Art of Storytelling: Multi-Agent Generative AI for Dynamic Multimodal Narratives |
提出基于多智能体生成式AI的动态多模态叙事教育工具 |
multimodal |
|
|
| 11 |
Evaluating the Impact of Compression Techniques on Task-Specific Performance of Large Language Models |
评估压缩技术对大语言模型任务性能的影响,强调校准数据和评估指标的重要性 |
large language model |
|
|
| 12 |
Self-Evolutionary Large Language Models through Uncertainty-Enhanced Preference Optimization |
提出不确定性增强偏好优化(UPO),提升LLM自进化性能 |
large language model |
|
|
| 13 |
Strategic Insights in Human and Large Language Model Tactics at Word Guessing Games |
分析人类与大语言模型在猜词游戏中的策略,揭示模型在多语言环境下的挑战。 |
large language model |
|
|
| 14 |
KVPruner: Structural Pruning for Faster and Memory-Efficient Large Language Models |
KVPruner:通过结构化剪枝加速并降低大语言模型的内存占用 |
large language model |
|
|
| 15 |
Enhancing Low-Resource Language and Instruction Following Capabilities of Audio Language Models |
提出Typhoon-Audio模型,提升语音语言模型在低资源语言和指令跟随方面的能力 |
instruction following |
|
|
| 16 |
Enhancing Code-switched Text-to-Speech Synthesis Capability in Large Language Models with only Monolingual Corpora |
提出CS-LLM,仅用单语语料提升大语言模型在混合语文本转语音合成中的能力 |
large language model |
|
|
| 17 |
Investigating Context-Faithfulness in Large Language Models: The Roles of Memory Strength and Evidence Style |
研究记忆强度和证据风格对大语言模型上下文忠实度的影响 |
large language model |
✅ |
|
| 18 |
A Unified Framework to Classify Business Activities into International Standard Industrial Classification through Large Language Models for Circular Economy |
利用大型语言模型将商业活动分类到国际标准产业分类,促进循环经济发展。 |
large language model |
|
|
| 19 |
Adaptive Large Language Models By Layerwise Attention Shortcuts |
提出层级注意力捷径,用于自适应大型语言模型计算 |
large language model |
|
|
| 20 |
Diversify and Conquer: Diversity-Centric Data Selection with Iterative Refinement |
提出基于迭代优化的多样性数据选择方法,提升LLM微调效果 |
large language model instruction following |
✅ |
|
| 21 |
Surveying the MLLM Landscape: A Meta-Review of Current Surveys |
MLLM综述的元综述:系统性回顾多模态大语言模型评测方法与未来方向 |
large language model multimodal |
|
|
| 22 |
Less is More: A Simple yet Effective Token Reduction Method for Efficient Multi-modal LLMs |
提出TRIM方法,通过CLIP度量进行token缩减,提升多模态LLM效率。 |
large language model multimodal |
|
|
| 23 |
CREAM: Comparison-Based Reference-Free ELO-Ranked Automatic Evaluation for Meeting Summarization |
提出CREAM,一种基于比较和ELO排序的免参考会议摘要自动评估方法 |
large language model chain-of-thought |
|
|
| 24 |
Towards No-Code Programming of Cobots: Experiments with Code Synthesis by Large Code Models for Conversational Programming |
探索基于大型代码模型的对话式编程,实现协作机器人免代码编程 |
large language model |
|
|
| 25 |
Watch Your Steps: Observable and Modular Chains of Thought |
提出程序追踪提示,增强CoT的可观测性和模块化,解决非局部错误问题。 |
chain-of-thought |
|
|
| 26 |
Small Language Models can Outperform Humans in Short Creative Writing: A Study Comparing SLMs with Humans and LLMs |
小语言模型在短篇创意写作中超越人类:SLM与人类及LLM的对比研究 |
large language model |
|
|
| 27 |
Egalitarian Language Representation in Language Models: It All Begins with Tokenizers |
提出GPE,提升语言模型分词器对复杂文字的公平表征 |
large language model |
|
|
| 28 |
Multi-Document Grounded Multi-Turn Synthetic Dialog Generation |
提出一种多文档驱动的多轮合成对话生成技术,提升模型在文档型对话任务上的性能。 |
chain-of-thought |
|
|
| 29 |
Says Who? Effective Zero-Shot Annotation of Focalization |
利用大型语言模型实现叙事焦点零样本标注,性能媲美人工标注。 |
large language model |
|
|
| 30 |
Reasoning Graph Enhanced Exemplars Retrieval for In-Context Learning |
提出RGER:通过推理图增强的范例检索提升上下文学习效果 |
large language model |
✅ |
|
| 31 |
SC-Phi2: A Fine-tuned Small Language Model for StarCraft II Macromanagement Tasks |
SC-Phi2:微调的小型语言模型用于星际争霸II的宏观管理任务 |
large language model |
|
|
| 32 |
Diversity-grounded Channel Prototypical Learning for Out-of-Distribution Intent Detection |
提出多样性引导的通道原型学习以解决分布外意图检测问题 |
large language model |
|
|
| 33 |
DynamicNER: A Dynamic, Multilingual, and Fine-Grained Dataset for LLM-based Named Entity Recognition |
提出DynamicNER数据集,用于评估LLM在动态、多语言和细粒度命名实体识别中的能力。 |
large language model |
✅ |
|
| 34 |
Propulsion: Steering LLM with Tiny Fine-Tuning |
Propulsion:通过微调缩放LLM特定维度,实现高效任务引导。 |
large language model |
|
|
| 35 |
Attention-Seeker: Dynamic Self-Attention Scoring for Unsupervised Keyphrase Extraction |
提出Attention-Seeker以解决无监督关键短语提取问题 |
large language model |
|
|
| 36 |
Efficient and Personalized Mobile Health Event Prediction via Small Language Models |
利用小型语言模型实现高效且个性化的移动健康事件预测 |
large language model |
|
|