| 1 |
Federated Low-Rank Tensor Estimation for Multimodal Image Reconstruction |
提出基于联邦学习的低秩张量估计方法,用于多模态图像重建。 |
multimodal |
|
|
| 2 |
Shuttle Between the Instructions and the Parameters of Large Language Models |
提出SHIP框架,学习大语言模型指令与参数间的双向映射关系 |
large language model |
|
|
| 3 |
Vision-Language Model Dialog Games for Self-Improvement |
提出VLM对话游戏自提升框架,解决视觉-语言模型训练数据瓶颈问题 |
multimodal |
|
|
| 4 |
Peri-LN: Revisiting Normalization Layer in the Transformer Architecture |
提出Peri-LN,一种新型Transformer归一化策略,提升大规模模型训练稳定性和收敛速度。 |
large language model |
|
|
| 5 |
Twilight: Adaptive Attention Sparsity with Hierarchical Top-$p$ Pruning |
Twilight:利用分层Top-$p$剪枝实现自适应注意力稀疏化,加速长文本LLM推理。 |
large language model |
|
|
| 6 |
A Unified Understanding and Evaluation of Steering Methods |
提出统一框架,分析并评估大语言模型中的隐空间操控方法 |
large language model |
|
|
| 7 |
Multi-Agent Design: Optimizing Agents with Better Prompts and Topologies |
提出MASS框架,通过优化提示词和拓扑结构,自动设计高效的多智能体系统。 |
large language model |
|
|
| 8 |
Layer by Layer: Uncovering Hidden Representations in Language Models |
揭示语言模型中间层表征能力,超越传统末层输出范式 |
large language model |
|
|