| 1 |
VLM-KG: Multimodal Radiology Knowledge Graph Generation |
提出VLM-KG以解决放射学知识图谱生成问题 |
multimodal instruction following |
|
|
| 2 |
Large Language Models Meet Stance Detection: A Survey of Tasks, Methods, Applications, Challenges and Future Directions |
提出系统性分析以推动基于大语言模型的立场检测研究 |
large language model multimodal |
|
|
| 3 |
Accelerating Chain-of-Thought Reasoning: When Goal-Gradient Importance Meets Dynamic Skipping |
提出Adaptive GoGI-Skip以解决长文本推理效率低下问题 |
large language model chain-of-thought |
|
|
| 4 |
ALOHA: Empowering Multilingual Agent for University Orientation with Hierarchical Retrieval |
提出ALOHA以解决大学校园信息检索的多语言问题 |
large language model Aloha |
|
|
| 5 |
HealthBench: Evaluating Large Language Models Towards Improved Human Health |
提出HealthBench以评估大型语言模型在医疗健康中的表现 |
large language model instruction following |
|
|
| 6 |
Enhancing Thyroid Cytology Diagnosis with RAG-Optimized LLMs and Pa-thology Foundation Models |
提出RAG优化的LLMs与病理基础模型以提升甲状腺细胞学诊断 |
large language model foundation model |
|
|
| 7 |
Aya Vision: Advancing the Frontier of Multilingual Multimodality |
提出Aya Vision以解决多语言多模态模型构建中的挑战 |
multimodal |
|
|
| 8 |
LCES: Zero-shot Automated Essay Scoring via Pairwise Comparisons Using Large Language Models |
提出LCES方法以解决零-shot自动化作文评分的偏差问题 |
large language model |
|
|
| 9 |
Large Language Model Psychometrics: A Systematic Review of Evaluation, Validation, and Enhancement |
提出大语言模型心理测量方法以解决评估与验证挑战 |
large language model |
✅ |
|
| 10 |
HCR-Reasoner: Synergizing Large Language Models and Theory for Human-like Causal Reasoning |
提出HCR-Reasoner以解决人类因果推理的系统性问题 |
large language model |
|
|
| 11 |
Probability Consistency in Large Language Models: Theoretical Foundations Meet Empirical Discrepancies |
提出一致性概率学习方法以解决LLMs的偏差问题 |
large language model |
|
|
| 12 |
NurValues: Real-World Nursing Values Evaluation for Large Language Models in Clinical Context |
提出NurValues基准以评估临床环境中的护理价值对齐 |
large language model |
✅ |
|
| 13 |
Small but Significant: On the Promise of Small Language Models for Accessible AIED |
提出小型语言模型以解决教育领域的可及性问题 |
large language model |
|
|
| 14 |
Enhancing Cache-Augmented Generation (CAG) with Adaptive Contextual Compression for Scalable Knowledge Integration |
提出自适应上下文压缩以解决CAG扩展性问题 |
large language model |
|
|
| 15 |
A Head to Predict and a Head to Question: Pre-trained Uncertainty Quantification Heads for Hallucination Detection in LLM Outputs |
提出预训练不确定性量化模块以解决LLM幻觉检测问题 |
large language model |
|
|
| 16 |
A suite of LMs comprehend puzzle statements as well as humans |
重新评估大型语言模型在理解英语语句中的表现 |
large language model |
|
|
| 17 |
Adaptive Schema-aware Event Extraction with Retrieval-Augmented Generation |
提出自适应模式感知事件提取方法以解决现有方法的局限性 |
large language model |
|
|
| 18 |
Automatic Task Detection and Heterogeneous LLM Speculative Decoding |
提出自动任务检测与异构LLM推测解码以优化下游任务 |
large language model |
|
|
| 19 |
LibVulnWatch: A Deep Assessment Agent System and Leaderboard for Uncovering Hidden Vulnerabilities in Open-Source AI Libraries |
提出LibVulnWatch以解决开源AI库中的隐性安全风险问题 |
large language model |
|
|
| 20 |
IterKey: Iterative Keyword Generation with LLMs for Enhanced Retrieval Augmented Generation |
提出IterKey以解决RAG中的准确性与可解释性问题 |
large language model |
|
|
| 21 |
A document processing pipeline for the construction of a dataset for topic modeling based on the judgments of the Italian Supreme Court |
提出文档处理管道以构建意大利最高法院判决主题建模数据集 |
large language model |
|
|
| 22 |
TUMS: Enhancing Tool-use Abilities of LLMs with Multi-structure Handlers |
提出TUMS框架以提升LLMs的工具使用能力 |
large language model |
|
|
| 23 |
Towards Contamination Resistant Benchmarks |
提出抗污染基准以解决LLM评估可靠性问题 |
large language model |
|
|
| 24 |
Evaluating the Effectiveness of Black-Box Prompt Optimization as the Scale of LLMs Continues to Grow |
评估黑箱提示优化在大规模LLM中的有效性 |
large language model |
|
|