| 1 |
I Think, Therefore I Diffuse: Enabling Multimodal In-Context Reasoning in Diffusion Models |
ThinkDiff:通过对齐视觉-语言模型,赋予扩散模型多模态上下文推理能力 |
large language model multimodal |
✅ |
|
| 2 |
Continuous Cardiac Arrest Prediction in ICU using PPG Foundation Model |
提出基于PPG预训练模型的FEAN网络,用于ICU中连续心搏骤停预测。 |
foundation model |
|
|
| 3 |
Mathematical Reasoning in Large Language Models: Assessing Logical and Arithmetic Errors across Wide Numerical Ranges |
提出GSM-Ranges数据集与新评估方法,评估LLM在不同数值范围下的数学推理能力 |
large language model |
|
|
| 4 |
Quality over Quantity: Boosting Data Efficiency Through Ensembled Multimodal Data Curation |
EcoDatum:通过集成多模态数据清洗算子提升数据效率,解决网络爬取数据集的质量问题。 |
multimodal |
|
|
| 5 |
LLM4GNAS: A Large Language Model Based Toolkit for Graph Neural Architecture Search |
LLM4GNAS:基于大语言模型的图神经网络架构搜索工具包 |
large language model |
|
|
| 6 |
Spectral Journey: How Transformers Predict the Shortest Path |
Transformer学习最短路径:揭示谱分解与路径规划的关联 |
large language model |
|
|
| 7 |
Commercial LLM Agents Are Already Vulnerable to Simple Yet Dangerous Attacks |
揭示商业LLM Agent的简单而危险的攻击漏洞,无需机器学习知识 |
large language model |
|
|
| 8 |
The Paradox of Stochasticity: Limited Creativity and Computational Decoupling in Temperature-Varied LLM Outputs of Structured Fictional Data |
研究表明:温度对LLM生成结构化虚构数据的影响有限,模型架构是性能关键 |
large language model |
|
|
| 9 |
Trustworthy GNNs with LLMs: A Systematic Review and Taxonomy |
综述:利用大语言模型提升图神经网络可信度,提出系统分类法 |
large language model |
|
|
| 10 |
Self-Evaluation for Job-Shop Scheduling |
提出基于自评估的Job-Shop调度方法,超越现有技术水平 |
large language model |
|
|
| 11 |
Democratizing AI: Open-source Scalable LLM Training on GPU-based Supercomputers |
AxoNN:开源可扩展LLM训练框架,实现GPU超算上的高效训练。 |
large language model |
|
|
| 12 |
One Example Shown, Many Concepts Known! Counterexample-Driven Conceptual Reasoning in Mathematical LLMs |
提出CounterMATH基准,提升数学LLM基于反例的概念推理能力 |
large language model |
|
|
| 13 |
LowRA: Accurate and Efficient LoRA Fine-Tuning of LLMs under 2 Bits |
LowRA:在低于2比特下实现LLM的精确高效LoRA微调 |
large language model |
|
|