| 1 |
Context-Informed Machine Translation of Manga using Multimodal Large Language Models |
提出基于多模态大语言模型的漫画机器翻译方法,提升翻译质量并构建新数据集。 |
large language model multimodal |
|
|
| 2 |
MM-Embed: Universal Multimodal Retrieval with Multimodal LLMs |
提出MM-Embed,利用多模态LLM实现通用多模态检索,并在M-BEIR和MTEB上取得SOTA。 |
large language model multimodal |
|
|
| 3 |
FactTest: Factuality Testing in Large Language Models with Finite-Sample and Distribution-Free Guarantees |
FactTest:基于有限样本和无分布保证的大语言模型事实性测试框架 |
large language model |
|
|
| 4 |
Improving Scientific Hypothesis Generation with Knowledge Grounded Large Language Models |
提出KG-CoI框架,通过知识图谱增强大语言模型在科学假设生成中的准确性和可信度。 |
large language model |
|
|
| 5 |
Evaluating Creative Short Story Generation in Humans and Large Language Models |
系统分析人类与大型语言模型的短篇故事创作能力 |
large language model |
|
|
| 6 |
Scalable Efficient Training of Large Language Models with Low-dimensional Projected Attention |
提出低维投影注意力(LPA)以高效训练大规模语言模型,提升性能并加速训练。 |
large language model |
|
|
| 7 |
Explainable cognitive decline detection in free dialogues with a Machine Learning approach based on pre-trained Large Language Models |
提出一种基于预训练大语言模型的认知衰退检测方法,用于分析自由对话。 |
large language model |
|
|
| 8 |
Zebra-Llama: A Context-Aware Large Language Model for Democratizing Rare Disease Knowledge |
Zebra-Llama:一种上下文感知的大语言模型,用于普及罕见病知识 |
large language model |
|
|
| 9 |
AVSS: Layer Importance Evaluation in Large Language Models via Activation Variance-Sparsity Analysis |
提出AVSS指标,通过激活方差-稀疏性分析评估大语言模型层重要性,实现模型压缩。 |
large language model |
|
|
| 10 |
A Comprehensive Survey of Small Language Models in the Era of Large Language Models: Techniques, Enhancements, Applications, Collaboration with LLMs, and Trustworthiness |
综述小语言模型(SLM):技术、增强、应用、与LLM协作及可信赖性 |
large language model |
|
|
| 11 |
Advancements and limitations of LLMs in replicating human color-word associations |
评估LLM在复现人类颜色-词语联想方面的能力与局限性 |
large language model |
|
|
| 12 |
Extracting Unlearned Information from LLMs with Activation Steering |
提出匿名激活引导方法,从已进行知识遗忘的大语言模型中精确提取未学习信息。 |
large language model |
|
|
| 13 |
TeleOracle: Fine-Tuned Retrieval-Augmented Generation with Long-Context Support for Network |
提出TeleOracle以解决通信网络智能管理问题 |
large language model |
|
|
| 14 |
A Comparative Analysis of Counterfactual Explanation Methods for Text Classifiers |
对比分析文本分类器反事实解释方法,揭示LLM与梯度方法优劣 |
large language model |
|
|
| 15 |
Positive Experience Reflection for Agents in Interactive Text Environments |
提出Sweet&Sour方法,通过正向经验反思提升LLM Agent在交互式文本环境中的性能 |
large language model |
|
|
| 16 |
Shortcut Learning in In-Context Learning: A Survey |
综述ICL中的捷径学习:分析类型、原因、基准与缓解策略 |
large language model |
|
|
| 17 |
Culinary Class Wars: Evaluating LLMs using ASH in Cuisine Transfer Task |
提出ASH基准以评估LLMs在菜谱转移任务中的表现 |
large language model |
✅ |
|
| 18 |
QCG-Rerank: Chunks Graph Rerank with Query Expansion in Retrieval-Augmented LLMs for Tourism Domain |
QCG-Rerank:面向旅游领域的检索增强LLM的块图重排序与查询扩展 |
large language model |
|
|
| 19 |
MILU: A Multi-task Indic Language Understanding Benchmark |
提出MILU:一个多任务印度语言理解基准,用于评估LLM在印度语言上的能力。 |
large language model |
|
|
| 20 |
MdEval: Massively Multilingual Code Debugging |
提出MdEval大规模多语言代码调试基准,促进代码LLM在多语言环境下的调试能力。 |
large language model |
|
|
| 21 |
The LLM Language Network: A Neuroscientific Approach for Identifying Causally Task-Relevant Units |
利用神经科学方法,识别LLM中与任务相关的因果语言单元 |
large language model |
|
|
| 22 |
Towards Pedagogical LLMs with Supervised Fine Tuning for Computing Education |
通过监督式微调提升LLM在计算教育中的教学效果 |
large language model |
|
|