| 1 |
Ultrasound Report Generation with Multimodal Large Language Models for Standardized Texts |
提出基于多模态大语言模型的超声报告生成框架,实现标准化文本输出。 |
large language model multimodal |
|
|
| 2 |
Decoding Neighborhood Environments with Large Language Models |
利用大型语言模型解码社区环境:无需训练,实现高精度环境要素识别。 |
large language model |
|
|
| 3 |
Optimized Couplings for Watermarking Large Language Models |
针对大语言模型,提出优化耦合的水印方案,提升检测能力并降低文本质量损失。 |
large language model |
✅ |
|
| 4 |
DeepMath-Creative: A Benchmark for Evaluating Mathematical Creativity of Large Language Models |
提出DeepMath-Creative以评估大语言模型的数学创造力 |
large language model |
|
|
| 5 |
CellTypeAgent: Trustworthy cell type annotation with Large Language Models |
CellTypeAgent:利用大语言模型实现可信的细胞类型注释 |
large language model |
|
|
| 6 |
The Truth Becomes Clearer Through Debate! Multi-Agent Systems with Large Language Models Unmask Fake News |
提出TruEDebate,利用多智能体辩论系统与大语言模型提升假新闻检测的解释性和有效性 |
large language model |
|
|
| 7 |
Unveiling the Best Practices for Applying Speech Foundation Models to Speech Intelligibility Prediction for Hearing-Impaired People |
针对听障人士语音清晰度预测,揭示语音基础模型应用的最佳实践 |
foundation model |
|
|
| 8 |
Federated Large Language Models: Feasibility, Robustness, Security and Future Directions |
综述联邦大语言模型(FLLM)在可行性、鲁棒性、安全性的挑战与未来方向 |
large language model |
|
|
| 9 |
TrialMatchAI: An End-to-End AI-powered Clinical Trial Recommendation System to Streamline Patient-to-Trial Matching |
TrialMatchAI:端到端AI临床试验推荐系统,加速患者与试验匹配 |
large language model chain-of-thought |
|
|
| 10 |
Lost in Transmission: When and Why LLMs Fail to Reason Globally |
提出BAPO模型,揭示LLM全局推理失败源于内部通信带宽限制 |
large language model chain-of-thought |
|
|
| 11 |
Improving the Reliability of LLMs: Combining CoT, RAG, Self-Consistency, and Self-Verification |
结合CoT、RAG、自洽性和自验证,提升大型语言模型的可靠性 |
large language model chain-of-thought |
|
|
| 12 |
Tests as Prompt: A Test-Driven-Development Benchmark for LLM Code Generation |
WebApp1K:提出测试驱动开发基准,评估LLM从测试用例生成代码的能力 |
large language model instruction following |
|
|
| 13 |
AI and Generative AI Transforming Disaster Management: A Survey of Damage Assessment and Response Techniques |
综述AI与生成式AI在灾害管理中的应用,聚焦于灾害评估与响应技术。 |
multimodal |
|
|
| 14 |
AI-Mediated Code Comment Improvement |
提出基于AI的代码注释改进方法,利用大语言模型重写注释以提升质量 |
large language model |
|
|
| 15 |
Securing RAG: A Risk Assessment and Mitigation Framework |
提出RAG安全框架,评估并缓解检索增强生成中的安全风险 |
large language model |
|
|
| 16 |
VizCV: AI-assisted visualization of researchers' publications tracks |
VizCV:提出AI辅助的可视化框架,用于分析科研人员的论文发表轨迹。 |
large language model |
|
|
| 17 |
Resource-Efficient Language Models: Quantization for Fast and Accessible Inference |
针对大语言模型,提出后训练量化方法以加速推理并降低资源需求 |
large language model |
|
|
| 18 |
Evaluating LLM Metrics Through Real-World Capabilities |
评估LLM在真实世界能力:弥合基准测试与实际应用差距,Gemini表现突出 |
large language model |
|
|
| 19 |
Communication Styles and Reader Preferences of LLM and Human Experts in Explaining Health Information |
研究LLM与人类专家在健康信息解释中的沟通风格差异及读者偏好 |
large language model |
|
|
| 20 |
Leveraging AI for Productive and Trustworthy HPC Software: Challenges and Research Directions |
利用AI革新HPC软件开发:挑战与研究方向探讨 |
large language model |
|
|