| 1 |
Spec-VLA: Speculative Decoding for Vision-Language-Action Models with Relaxed Acceptance |
Spec-VLA:通过放宽接受条件加速视觉-语言-动作模型的推测解码 |
vision-language-action VLA large language model |
|
|
| 2 |
Doctor Sun: A Bilingual Multimodal Large Language Model for Biomedical AI |
Doctor Sun:一种用于生物医学AI的双语多模态大型语言模型 |
large language model multimodal |
|
|
| 3 |
A Foundation Model for Material Fracture Prediction |
提出基于Transformer的材料断裂预测基础模型,提升泛化性和效率。 |
large language model foundation model multimodal |
|
|
| 4 |
Investigating the Invertibility of Multimodal Latent Spaces: Limitations of Optimization-Based Methods |
研究多模态隐空间的可逆性:优化方法的局限性分析 |
multimodal |
|
|
| 5 |
Quantifying surprise in clinical care: Detecting highly informative events in electronic health records with foundation models |
利用电子病历中的Foundation Model量化临床诊疗中的“意外”事件,从而检测高信息量事件。 |
foundation model |
|
|
| 6 |
H2Tune: Federated Foundation Model Fine-Tuning with Hybrid Heterogeneity |
H2Tune:针对模型架构和任务双重异构的联邦基础模型微调框架 |
foundation model |
|
|
| 7 |
Hybrid Hypergraph Networks for Multimodal Sequence Data Classification |
提出混合超图网络HHN,用于建模多模态时序数据分类,提升长程依赖和跨模态交互。 |
multimodal |
|
|
| 8 |
Multimodal Late Fusion Model for Problem-Solving Strategy Classification in a Machine Learning Game |
提出多模态晚期融合模型,用于机器学习游戏中问题解决策略分类 |
multimodal |
|
|
| 9 |
On the Sustainability of AI Inferences in the Edge |
边缘AI推理可持续性研究:针对不同边缘设备和模型的性能与能耗权衡分析 |
large language model |
|
|
| 10 |
KLLM: Fast LLM Inference with K-Means Quantization |
KLLM:基于K-Means量化的快速LLM推理加速器 |
large language model |
|
|
| 11 |
Stop Evaluating AI with Human Tests, Develop Principled, AI-specific Tests instead |
呼吁停止使用人类测试评估AI,转而开发AI专属的、基于原则的测试方法 |
large language model |
|
|
| 12 |
Agentic Privacy-Preserving Machine Learning |
提出Agentic-PPML框架,提升隐私保护大语言模型推理的实用性 |
large language model |
|
|
| 13 |
Breaking Obfuscation: Cluster-Aware Graph with LLM-Aided Recovery for Malicious JavaScript Detection |
提出DeCoda框架,结合LLM去混淆和聚类感知图学习,提升恶意JavaScript代码检测效果。 |
large language model |
|
|