| 1 |
Evaluation of Large Language Models via Coupled Token Generation |
提出基于耦合Token生成的大语言模型评估方法,减少评估样本量并发现传统评估的潜在偏差。 |
large language model |
|
|
| 2 |
Large Language Models Are Human-Like Internally |
大型语言模型内部机制更贴近人类认知过程,优于以往认知建模研究结论 |
large language model |
|
|
| 3 |
What is a Number, That a Large Language Model May Know It? |
揭示大语言模型中数字表示的字符串与数值双重性及影响 |
large language model |
|
|
| 4 |
AlignVLM: Bridging Vision and Language Latent Spaces for Multimodal Document Understanding |
AlignVLM:通过对齐视觉和语言隐空间,提升多模态文档理解能力。 |
multimodal |
|
|
| 5 |
OphthBench: A Comprehensive Benchmark for Evaluating Large Language Models in Chinese Ophthalmology |
OphthBench:构建中文眼科领域LLM综合评测基准,助力临床应用 |
large language model |
|
|
| 6 |
On Bob Dylan: A Computational Perspective |
利用计算方法分析鲍勃·迪伦歌词,揭示其音乐风格的演变与创新 |
large language model |
|
|
| 7 |
Lifelong Knowledge Editing requires Better Regularization |
提出MPES与范数约束正则化方法,解决终身知识编辑中的模型退化问题 |
large language model |
|
|
| 8 |
LLM-TA: An LLM-Enhanced Thematic Analysis Pipeline for Transcripts from Parents of Children with Congenital Heart Disease |
提出LLM-TA:一种LLM增强的主题分析流程,用于分析先天性心脏病患儿父母的访谈记录。 |
large language model |
✅ |
|
| 9 |
FutureVision: A methodology for the investigation of future cognition |
提出FutureVision方法,结合多模态语义分析与眼动追踪,研究未来认知。 |
multimodal |
|
|
| 10 |
Massive Values in Self-Attention Modules are the Key to Contextual Knowledge Understanding |
发现自注意力模块中显著值是上下文知识理解的关键 |
large language model |
✅ |
|
| 11 |
Plan-Then-Execute: An Empirical Study of User Trust and Team Performance When Using LLM Agents As A Daily Assistant |
研究LLM智能体作为日常助手时,用户信任与团队表现的影响,采用Plan-Then-Execute模式。 |
large language model |
|
|
| 12 |
COVE: COntext and VEracity prediction for out-of-context images |
COVE:通过上下文预测和真实性验证解决脱离语境的图像误导问题 |
multimodal |
|
|
| 13 |
Jailbreaking with Universal Multi-Prompts |
提出JUMP:一种利用通用多提示词破解大型语言模型的方法 |
large language model |
|
|