| 1 |
LUMA: A Benchmark Dataset for Learning from Uncertain and Multimodal Data |
LUMA:一个用于学习不确定和多模态数据的基准数据集 |
large language model multimodal |
✅ |
|
| 2 |
Semantic Membership Inference Attack against Large Language Models |
提出语义成员推理攻击SMIA,提升大型语言模型成员推断攻击效果 |
large language model |
|
|
| 3 |
Deep Bayesian Active Learning for Preference Modeling in Large Language Models |
提出BAL-PM,通过主动学习优化LLM偏好建模,显著降低标注成本。 |
large language model |
|
|
| 4 |
QQQ: Quality Quattuor-Bit Quantization for Large Language Models |
QQQ:面向大语言模型的高质量四比特量化,加速推理。 |
large language model |
|
|
| 5 |
Recent Advances in Federated Learning Driven Large Language Models: A Survey on Architecture, Performance, and Security |
综述联邦学习驱动的大语言模型:架构、性能与安全性 |
large language model |
|
|
| 6 |
Large language model validity via enhanced conformal prediction methods |
提出增强型共形预测方法,提升大语言模型输出结果的有效性保证 |
large language model |
|
|
| 7 |
MiNT: Multi-Network Training for Transfer Learning on Temporal Graphs |
提出MiNT:一种用于时序图迁移学习的多网络训练方法 |
foundation model |
|
|
| 8 |
BEACON: Benchmark for Comprehensive RNA Tasks and Language Models |
BEACON:RNA任务与语言模型综合基准,提升RNA序列理解能力 |
foundation model |
✅ |
|
| 9 |
Quantifying Variance in Evaluation Benchmarks |
量化评估基准中的方差,为LLM评估提供更可靠的性能比较依据 |
large language model |
|
|
| 10 |
Evaluating LLM-driven User-Intent Formalization for Verification-Aware Languages |
提出一种基于符号测试的规约质量评估方法,用于验证感知语言的用户意图形式化。 |
large language model |
|
|
| 11 |
Unraveling the Mechanics of Learning-Based Demonstration Selection for In-Context Learning |
揭示ICL中基于学习的示例选择机制,提升泛化性和效率。 |
large language model |
|
|
| 12 |
TGB 2.0: A Benchmark for Learning on Temporal Knowledge Graphs and Heterogeneous Graphs |
TGB 2.0:用于时序知识图谱和异构图学习的基准测试框架 |
TAMP |
|
|