| 1 |
Higher Order Transformers: Enhancing Stock Movement Prediction On Multimodal Time-Series Data |
提出高阶Transformer,增强多模态时间序列数据上的股票走势预测 |
multimodal |
|
|
| 2 |
Aspen Open Jets: Unlocking LHC Data for Foundation Models in Particle Physics |
AspenOpenJets:利用LHC开放数据预训练粒子物理领域Foundation模型 |
foundation model |
|
|
| 3 |
Benchmarking large language models for materials synthesis: the case of atomic layer deposition |
ALDbench:评估大语言模型在原子层沉积材料合成中的性能 |
large language model |
|
|
| 4 |
FDM-Bench: A Comprehensive Benchmark for Evaluating Large Language Models in Additive Manufacturing Tasks |
FDM-Bench:用于评估大语言模型在增材制造任务中性能的综合基准 |
large language model |
|
|
| 5 |
Activation Sparsity Opportunities for Compressing General Large Language Models |
探索激活稀疏性以压缩通用大语言模型,实现边缘设备高效部署。 |
large language model |
|
|
| 6 |
KVDirect: Distributed Disaggregated LLM Inference |
KVDirect:实现分布式解耦LLM推理,提升资源利用率与服务能力 |
large language model |
|
|
| 7 |
METIS: Fast Quality-Aware RAG Systems with Configuration Adaptation |
METIS:通过配置自适应实现快速高质量的RAG系统 |
large language model |
|
|
| 8 |
AdvPrefix: An Objective for Nuanced LLM Jailbreaks |
AdvPrefix:一种用于细粒度大语言模型越狱的目标函数 |
large language model |
|
|
| 9 |
Detecting LLM Hallucination Through Layer-wise Information Deficiency: Analysis of Ambiguous Prompts and Unanswerable Questions |
通过层级信息缺失检测LLM幻觉以应对模糊提示和无解问题 |
large language model |
|
|
| 10 |
Text2Cypher: Bridging Natural Language and Graph Databases |
Text2Cypher:构建自然语言到图数据库查询的桥梁,提升非技术用户的使用体验。 |
large language model |
|
|
| 11 |
Llama 3 Meets MoE: Efficient Upcycling |
利用Llama 3高效训练MoE模型:低成本实现性能提升 |
large language model |
|
|
| 12 |
HashEvict: A Pre-Attention KV Cache Eviction Strategy using Locality-Sensitive Hashing |
HashEvict:利用局部敏感哈希的预注意力KV缓存淘汰策略,降低LLM推理的GPU内存消耗。 |
large language model |
|
|