| 1 |
Testing and Evaluation of Large Language Models: Correctness, Non-Toxicity, and Fairness |
针对大语言模型的正确性、非毒性和公平性进行测试与评估研究 |
large language model |
|
|
| 2 |
Large Language Models-Enabled Digital Twins for Precision Medicine in Rare Gynecological Tumors |
利用大语言模型构建数字孪生,为罕见妇科肿瘤精准医疗提供支持 |
large language model |
|
|
| 3 |
LongRecipe: Recipe for Efficient Long Context Generalization in Large Language Models |
LongRecipe:一种高效的长文本泛化训练方法,显著扩展大语言模型的上下文窗口。 |
large language model |
✅ |
|
| 4 |
An Empirical Study on Information Extraction using Large Language Models |
评估GPT-4在信息抽取任务中的能力,并提出prompt方法以提升其性能。 |
large language model |
|
|
| 5 |
Does Alignment Tuning Really Break LLMs' Internal Confidence? |
研究对齐调整如何影响LLM内部置信度校准,揭示校准退化问题 |
large language model instruction following |
|
|
| 6 |
Learning to Ask: When LLM Agents Meet Unclear Instruction |
针对指令不明确场景,提出Ask-when-Needed框架提升LLM工具使用能力 |
large language model |
|
|
| 7 |
From Prediction to Application: Language Model-based Code Knowledge Tracing with Domain Adaptive Pre-Training and Automatic Feedback System with Pedagogical Prompting for Comprehensive Programming Education |
提出CodeLKT:基于语言模型的代码知识追踪与自适应反馈系统 |
large language model |
|
|