| 1 |
FedNano: Toward Lightweight Federated Tuning for Pretrained Multimodal Large Language Models |
提出FedNano以解决多模态大语言模型的轻量化联邦调优问题 |
large language model multimodal |
|
|
| 2 |
Graph-MLLM: Harnessing Multimodal Large Language Models for Multimodal Graph Learning |
提出Graph-MLLM以解决多模态图学习的评估与整合问题 |
large language model multimodal |
|
|
| 3 |
Predictable Scale: Part II, Farseer: A Refined Scaling Law in Large Language Models |
提出Farseer以解决大规模语言模型训练中的预测精度问题 |
large language model |
✅ |
|
| 4 |
GUARD: Guided Unlearning and Retention via Data Attribution for Large Language Models |
提出GUARD框架以解决大语言模型中的非意图遗忘问题 |
large language model |
|
|
| 5 |
Foundation Models for Causal Inference via Prior-Data Fitted Networks |
提出CausalFM以解决因果推断中的模型训练问题 |
foundation model |
|
|
| 6 |
MNN-LLM: A Generic Inference Engine for Fast Large Language Model Deployment on Mobile Devices |
提出MNN-LLM以解决移动设备上大语言模型推理速度慢的问题 |
large language model |
|
|
| 7 |
Time-IMM: A Dataset and Benchmark for Irregular Multimodal Multivariate Time Series |
提出Time-IMM数据集以解决不规则多模态多变量时间序列问题 |
multimodal |
✅ |
|
| 8 |
EAGLE: Efficient Alignment of Generalized Latent Embeddings for Multimodal Survival Prediction with Interpretable Attribution Analysis |
提出EAGLE以解决多模态癌症生存预测中的融合与可解释性问题 |
multimodal |
|
|
| 9 |
Build the web for agents, not agents for the web |
提出代理网络接口以解决现有网页代理适应性不足问题 |
large language model multimodal |
|
|
| 10 |
Robustly Improving LLM Fairness in Realistic Settings via Interpretability |
通过可解释性方法提升LLM在招聘中的公平性 |
large language model chain-of-thought |
|
|
| 11 |
Data Shifts Hurt CoT: A Theoretical Study |
研究数据偏移对链式思维的影响及其机制 |
large language model chain-of-thought |
|
|
| 12 |
NoLoCo: No-all-reduce Low Communication Training Method for Large Models |
提出NoLoCo以解决大模型训练中的通信瓶颈问题 |
large language model |
|
|
| 13 |
Detecting High-Stakes Interactions with Activation Probes |
提出激活探针以检测高风险交互问题 |
large language model |
|
|
| 14 |
ConTextTab: A Semantics-Aware Tabular In-Context Learner |
提出ConTextTab以解决表格数据语义理解不足的问题 |
large language model |
✅ |
|
| 15 |
BugGen: A Self-Correcting Multi-Agent LLM Pipeline for Realistic RTL Bug Synthesis |
提出BugGen以解决RTL调试效率低下的问题 |
large language model |
|
|
| 16 |
Time To Impeach LLM-as-a-Judge: Programs are the Future of Evaluation |
提出PAJAMA以解决LLM评估中的高成本与偏见问题 |
large language model |
|
|
| 17 |
TreeLoRA: Efficient Continual Learning via Layer-Wise LoRAs Guided by a Hierarchical Gradient-Similarity Tree |
提出TreeLoRA以解决高效持续学习问题 |
large language model |
|
|
| 18 |
Provably Learning from Language Feedback |
提出HELiX算法以解决语言反馈学习问题 |
large language model |
|
|