| 1 |
Visual Grounding Methods for Efficient Interaction with Desktop Graphical User Interfaces |
针对GUI交互,提出两种Instruction Visual Grounding方法以提升自动化效率 |
large language model multimodal visual grounding |
|
|
| 2 |
High Order Reasoning for Time Critical Recommendation in Evidence-based Medicine |
提出基于高阶推理的LLM模型,用于循证医学中的时间敏感型推荐 |
large language model |
|
|
| 3 |
On the performativity of SDG classifications in large bibliometric databases |
利用大语言模型评估SDG分类对文献计量数据库的数据偏差影响 |
large language model |
|
|
| 4 |
Mozart's Touch: A Lightweight Multi-modal Music Generation Framework Based on Pre-Trained Large Models |
提出 Mozart's Touch 框架,利用预训练大模型实现多模态音乐生成。 |
large language model |
✅ |
|