| 1 |
A Survey on Large Language Models in Multimodal Recommender Systems |
综述:大型语言模型赋能多模态推荐系统,探索新型集成模式与技术挑战。 |
large language model multimodal |
|
|
| 2 |
Do Large Language Models Know Conflict? Investigating Parametric vs. Non-Parametric Knowledge of LLMs for Conflict Forecasting |
研究大型语言模型在冲突预测中的参数化与非参数化知识能力 |
large language model |
|
|
| 3 |
WorldView-Bench: A Benchmark for Evaluating Global Cultural Perspectives in Large Language Models |
提出WorldView-Bench基准,评估大型语言模型中的全球文化视角包容性 |
large language model |
|
|
| 4 |
LLM4CD: Leveraging Large Language Models for Open-World Knowledge Augmented Cognitive Diagnosis |
LLM4CD:利用大语言模型增强认知诊断的开放世界知识 |
large language model |
|
|
| 5 |
Large Language Models Are More Persuasive Than Incentivized Human Persuaders |
比较大型语言模型与人类说服者的说服能力 |
large language model |
|
|
| 6 |
Source framing triggers systematic evaluation bias in Large Language Models |
源框架影响大语言模型评估,揭示系统性评估偏差 |
large language model |
|
|
| 7 |
A Data Synthesis Method Driven by Large Language Models for Proactive Mining of Implicit User Intentions in Tourism |
SynPT:一种基于大语言模型的旅游领域隐式用户意图主动挖掘数据合成方法 |
large language model |
|
|
| 8 |
A Comprehensive Analysis of Large Language Model Outputs: Similarity, Diversity, and Bias |
大规模语言模型输出分析:相似性、多样性与偏见研究 |
large language model |
|
|
| 9 |
System Prompt Optimization with Meta-Learning |
提出基于元学习的系统提示优化方法,提升LLM在多任务和多领域上的泛化能力。 |
large language model |
|
|
| 10 |
KRISTEVA: Close Reading as a Novel Task for Benchmarking Interpretive Reasoning |
提出KRISTEVA基准,用于评估LLM在文学作品解读推理中的能力。 |
large language model |
|
|
| 11 |
VeriFact: Enhancing Long-Form Factuality Evaluation with Refined Fact Extraction and Reference Facts |
VeriFact:通过精细化事实抽取和参考事实增强长文本事实性评估 |
large language model |
|
|
| 12 |
Towards Automated Situation Awareness: A RAG-Based Framework for Peacebuilding Reports |
提出基于RAG的自动化情境感知框架,用于生成维和报告,加速决策过程。 |
large language model |
|
|
| 13 |
Tales of the 2025 Los Angeles Fire: Hotwash for Public Health Concerns in Reddit via LLM-Enhanced Topic Modeling |
利用LLM增强的主题建模分析Reddit中2025年洛杉矶火灾的公共健康问题 |
large language model |
|
|
| 14 |
Qwen3 Technical Report |
Qwen3:融合思考与非思考模式的大语言模型,提升性能、效率和多语言能力 |
large language model |
|
|
| 15 |
A Scalable Unsupervised Framework for multi-aspect labeling of Multilingual and Multi-Domain Review Data |
提出一种可扩展的无监督多语言多领域评论数据多方面标注框架。 |
large language model |
|
|
| 16 |
Ornithologist: Towards Trustworthy "Reasoning" about Central Bank Communications |
Ornithologist:一种基于弱监督和决策树的中央银行沟通内容分析系统 |
large language model |
|
|
| 17 |
S-DAT: A Multilingual, GenAI-Driven Framework for Automated Divergent Thinking Assessment |
S-DAT:一种基于GenAI的多语言发散思维自动评估框架 |
large language model |
|
|