| 1 |
Discrete Multimodal Transformers with a Pretrained Large Language Model for Mixed-Supervision Speech Processing |
提出基于预训练LLM的离散多模态Transformer,用于混合监督语音处理 |
large language model multimodal |
|
|
| 2 |
Analyzing Temporal Complex Events with Large Language Models? A Benchmark towards Temporal, Long Context Understanding |
提出TCELongBench基准,利用大语言模型分析时序复杂事件,解决长文本理解难题。 |
large language model TAMP |
|
|
| 3 |
Break the Chain: Large Language Models Can be Shortcut Reasoners |
提出“打破链条”策略,提升大语言模型在复杂推理任务中的效率与泛化性 |
large language model chain-of-thought |
|
|
| 4 |
Chain of Agents: Large Language Models Collaborating on Long-Context Tasks |
提出Chain-of-Agents框架,通过多智能体协作解决长文本处理中的信息聚合与推理难题。 |
large language model |
|
|
| 5 |
Disentangling Logic: The Role of Context in Large Language Model Reasoning Capabilities |
解耦逻辑推理:探究上下文对大语言模型推理能力的影响 |
large language model |
✅ |
|
| 6 |
Mitigate Position Bias in Large Language Models via Scaling a Single Dimension |
通过缩放单维度隐藏状态,缓解大语言模型中的位置偏差问题 |
large language model |
|
|
| 7 |
Large Language Models as Carriers of Hidden Messages |
提出UTF攻击与UTFC防御,揭示并缓解大语言模型隐藏信息泄露风险 |
large language model |
|
|
| 8 |
Large Language Models Make Sample-Efficient Recommender Systems |
提出Laser框架,验证大语言模型提升推荐系统在小样本学习场景下的性能 |
large language model |
|
|
| 9 |
Prompting Large Language Models with Human Error Markings for Self-Correcting Machine Translation |
利用人工标注错误提示的大语言模型进行机器翻译自校正 |
large language model |
|
|
| 10 |
Synergetic Event Understanding: A Collaborative Approach to Cross-Document Event Coreference Resolution with Large Language Models |
提出协同事件理解方法,利用大语言模型与小语言模型解决跨文档事件共指消解问题。 |
large language model |
|
|
| 11 |
Reinforcement Tuning for Detecting Stances and Debunking Rumors Jointly with Large Language Models |
提出基于强化学习微调的大语言模型框架JSDRV,联合检测立场并证伪谣言 |
large language model |
|
|
| 12 |
The current status of large language models in summarizing radiology report impressions |
评估大型语言模型在放射报告印象总结中的能力与局限性 |
large language model |
|
|
| 13 |
Diver: Large Language Model Decoding with Span-Level Mutual Information Verification |
Diver:提出基于跨度互信息验证的大语言模型解码方法,提升输出与输入的符合度。 |
large language model |
|
|
| 14 |
TopViewRS: Vision-Language Models as Top-View Spatial Reasoners |
提出TopViewRS数据集,评估视觉-语言模型在鸟瞰视角下的空间推理能力 |
multimodal chain-of-thought |
|
|
| 15 |
mCoT: Multilingual Instruction Tuning for Reasoning Consistency in Language Models |
提出mCoT,通过多语言指令微调提升语言模型在多语言推理任务中的一致性 |
large language model chain-of-thought |
|
|
| 16 |
RATT: A Thought Structure for Coherent and Correct LLM Reasoning |
RATT:一种用于连贯且正确的大语言模型推理的思维结构 |
large language model |
|
|
| 17 |
Self-Control of LLM Behaviors by Compressing Suffix Gradient into Prefix Controller |
SelfControl:通过梯度压缩实现大语言模型行为的无监督自控 |
large language model |
|
|
| 18 |
SpecExec: Massively Parallel Speculative Decoding for Interactive LLM Inference on Consumer Devices |
SpecExec:面向消费级设备的LLM大规模并行推测解码 |
large language model |
|
|
| 19 |
Scalable MatMul-free Language Modeling |
提出无矩阵乘法的语言模型,在保持性能的同时显著降低计算和内存需求 |
large language model |
|
|
| 20 |
CheckEmbed: Effective Verification of LLM Solutions to Open-Ended Tasks |
CheckEmbed:有效验证LLM在开放任务中的解决方案,提升准确性和可扩展性 |
large language model |
|
|
| 21 |
Order-Independence Without Fine Tuning |
提出Set-Based Prompting,解决LLM对输入顺序的依赖问题,无需微调。 |
large language model |
|
|
| 22 |
Technical Language Processing for Telecommunications Specifications |
针对电信规范,提出技术语言处理方法以提升领域LLM性能。 |
large language model |
|
|
| 23 |
FedMKT: Federated Mutual Knowledge Transfer for Large and Small Language Models |
FedMKT:联邦互知识迁移框架,用于协同增强大小语言模型 |
large language model |
|
|