| 1 |
DART: Leveraging Multi-Agent Disagreement for Tool Recruitment in Multimodal Reasoning |
DART:利用多智能体分歧,在多模态推理中进行工具选择 |
large language model multimodal |
|
|
| 2 |
Bridging Code Graphs and Large Language Models for Better Code Understanding |
CGBridge:通过桥接代码图和大型语言模型提升代码理解能力 |
large language model instruction following |
|
|
| 3 |
Complementary Learning Approach for Text Classification using Large Language Models |
提出一种互补学习方法,利用大语言模型进行文本分类,兼顾成本效益与研究严谨性。 |
large language model chain-of-thought |
|
|
| 4 |
Do Large Language Models Truly Understand Cross-cultural Differences? |
提出SAGE基准,评估大语言模型在跨文化理解和推理方面的能力 |
large language model |
|
|
| 5 |
When Large Language Models Do Not Work: Online Incivility Prediction through Graph Neural Networks |
提出基于图神经网络的在线不文明行为预测方法,优于大型语言模型。 |
large language model |
|
|
| 6 |
Investigating Training and Generalization in Faithful Self-Explanations of Large Language Models |
通过持续学习提升大语言模型自解释的忠实性与泛化能力 |
large language model |
|
|
| 7 |
NeSTR: A Neuro-Symbolic Abductive Framework for Temporal Reasoning in Large Language Models |
NeSTR:一种神经符号演绎框架,用于增强大语言模型的时间推理能力 |
large language model |
|
|
| 8 |
HalluShift++: Bridging Language and Vision through Internal Representation Shifts for Hierarchical Hallucinations in MLLMs |
HalluShift++:通过内部表征偏移弥合语言与视觉,实现多模态大语言模型中的分层幻觉检测 |
large language model multimodal |
✅ |
|
| 9 |
A Simple Method to Enhance Pre-trained Language Models with Speech Tokens for Classification |
提出一种简单方法,利用语音token增强预训练语言模型,用于分类任务。 |
large language model multimodal |
✅ |
|
| 10 |
Leveraging KV Similarity for Online Structured Pruning in LLMs |
提出Token Filtering,利用KV相似性实现在线LLM结构化剪枝。 |
large language model |
|
|
| 11 |
Segment, Embed, and Align: A Universal Recipe for Aligning Subtitles to Signing |
提出SEA框架,用于通用且高效的手语视频字幕对齐 |
TAMP |
|
|
| 12 |
Short-Context Dominance: How Much Local Context Natural Language Actually Needs? |
研究表明,大型语言模型预测任务中,短语境通常已足够,并提出DaMCL指标检测长语境依赖,优化模型输出。 |
large language model |
|
|
| 13 |
Do Generalisation Results Generalise? |
研究表明大语言模型泛化能力评估结果在不同OOD数据集上不具备一致性。 |
large language model |
|
|
| 14 |
Mary, the Cheeseburger-Eating Vegetarian: Do LLMs Recognize Incoherence in Narratives? |
研究表明大型语言模型在识别叙事不连贯性方面存在局限性,尤其是在人物性格违背方面。 |
large language model |
|
|
| 15 |
PCMind-2.1-Kaiyuan-2B Technical Report |
PCMind-2.1-Kaiyuan-2B:开源20亿参数模型,提升资源受限场景下的训练效率与效果。 |
large language model |
✅ |
|
| 16 |
MoCoRP: Modeling Consistent Relations between Persona and Response for Persona-based Dialogue |
MoCoRP:提出建模Persona与Response一致性关系框架,提升Persona对话质量。 |
large language model |
✅ |
|
| 17 |
SwissGov-RSD: A Human-annotated, Cross-lingual Benchmark for Token-level Recognition of Semantic Differences Between Related Documents |
提出SwissGov-RSD跨语言基准数据集,用于识别相关文档间语义差异。 |
large language model |
|
|
| 18 |
Beyond Real: Imaginary Extension of Rotary Position Embeddings for Long-Context LLMs |
扩展旋转位置编码RoPE的虚部,提升长文本LLM的建模能力 |
large language model |
✅ |
|