| 1 |
Long-Short Chain-of-Thought Mixture Supervised Fine-Tuning Eliciting Efficient Reasoning in Large Language Models |
提出LS-Mixture SFT,解决SFT微调中LLM的过度推理问题,提升推理效率。 |
large language model chain-of-thought |
|
|
| 2 |
Advancing Conversational Diagnostic AI with Multimodal Reasoning |
AMIE:基于多模态推理提升对话式诊断AI的性能 |
large language model multimodal |
|
|
| 3 |
SLOT: Structuring the Output of Large Language Models |
SLOT:通过后处理转换LLM输出为结构化格式,提升下游任务可靠性 |
large language model |
|
|
| 4 |
Scientific Hypothesis Generation and Validation: Methods, Datasets, and Future Directions |
综述:大型语言模型驱动的科学假设生成与验证方法 |
large language model multimodal symbolic grounding |
|
|
| 5 |
MedArabiQ: Benchmarking Large Language Models on Arabic Medical Tasks |
MedArabiQ:构建阿拉伯语医疗任务基准,评估并提升LLM在医疗领域的应用。 |
large language model |
|
|
| 6 |
TeleEval-OS: Performance evaluations of large language models for operations scheduling |
TeleEval-OS:首个面向电信运营调度的LLM性能评估基准 |
large language model |
|
|
| 7 |
Integration of Large Language Models and Traditional Deep Learning for Social Determinants of Health Prediction |
结合大语言模型与传统深度学习,用于预测健康的社会决定因素 |
large language model |
|
|
| 8 |
BadLingual: A Novel Lingual-Backdoor Attack against Large Language Models |
提出BadLingual,一种针对大型语言模型的任务无关的语言后门攻击。 |
large language model |
|
|
| 9 |
Uncertainty-Aware Large Language Models for Explainable Disease Diagnosis |
ConfiDx:面向可解释疾病诊断的、能感知不确定性的大语言模型 |
large language model |
|
|
| 10 |
Lightweight Clinical Decision Support System using QLoRA-Fine-Tuned LLMs and Retrieval-Augmented Generation |
提出基于QLoRA微调LLaMA 3.2-3B和RAG的轻量级临床决策支持系统 |
large language model foundation model |
|
|
| 11 |
Faster MoE LLM Inference for Extremely Large Models |
针对超大MoE LLM,提出更快速的推理方法,提升效率并优化性能。 |
large language model |
|
|
| 12 |
A Comparative Analysis of Ethical and Safety Gaps in LLMs using Relative Danger Coefficient |
提出相对危险系数RDC,用于比较评估不同LLM的伦理和安全差距 |
large language model |
|
|
| 13 |
Say It Another Way: Auditing LLMs with a User-Grounded Automated Paraphrasing Framework |
AUGMENT:一种用户行为驱动的LLM自动复述框架,用于可靠的审计。 |
large language model |
|
|
| 14 |
Ψ-Arena: Interactive Assessment and Optimization of LLM-based Psychological Counselors with Tripartite Feedback |
提出Ψ-Arena,通过三方反馈交互式评估和优化基于LLM的心理咨询师。 |
large language model |
|
|