| 1 |
A Review of Multimodal Explainable Artificial Intelligence: Past, Present and Future |
综述多模态可解释人工智能(MXAI)方法,应对AI黑盒难题,提升透明度和信任度。 |
large language model foundation model multimodal |
✅ |
|
| 2 |
MetaMorph: Multimodal Understanding and Generation via Instruction Tuning |
提出Visual-Predictive Instruction Tuning以提升多模态理解与生成能力 |
multimodal instruction following |
|
|
| 3 |
InstructSeg: Unifying Instructed Visual Segmentation with Multi-modal Large Language Models |
InstructSeg:统一多模态大语言模型的指令式视觉分割框架 |
large language model |
✅ |
|
| 4 |
Zero-Shot Prompting and Few-Shot Fine-Tuning: Revisiting Document Image Classification Using Large Language Models |
利用大语言模型,探索零样本提示与少样本微调在文档图像分类中的应用 |
large language model |
|
|
| 5 |
MedCoT: Medical Chain of Thought via Hierarchical Expert |
提出MedCoT:一种基于层级专家验证推理链的医学视觉问答方法 |
chain-of-thought |
|
|
| 6 |
G-VEval: A Versatile Metric for Evaluating Image and Video Captions Using GPT-4o |
提出基于GPT-4o的G-VEval,用于评估图像和视频字幕质量,并构建MSVD-Eval数据集。 |
large language model multimodal chain-of-thought |
✅ |
|
| 7 |
CAD-Assistant: Tool-Augmented VLLMs as Generic CAD Task Solvers |
提出CAD-Assistant,一种工具增强的VLLM,作为通用CAD任务求解器 |
large language model multimodal |
|
|
| 8 |
LLaVA-UHD v2: an MLLM Integrating High-Resolution Semantic Pyramid via Hierarchical Window Transformer |
LLaVA-UHD v2:通过分层窗口Transformer集成高分辨率语义金字塔的多模态大语言模型 |
large language model multimodal |
|
|
| 9 |
AnySat: One Earth Observation Model for Many Resolutions, Scales, and Modalities |
AnySat:提出一种地球观测统一模型,处理多分辨率、多尺度和多模态数据。 |
multimodal |
✅ |
|
| 10 |
Real Classification by Description: Extending CLIP's Limits of Part Attributes Recognition |
提出基于描述的真实分类任务,扩展CLIP在部件属性识别上的能力 |
large language model |
✅ |
|
| 11 |
Prompt Categories Cluster for Weakly Supervised Semantic Segmentation |
提出Prompt类别聚类(PCC)框架,利用LLM进行弱监督语义分割,提升类别间关系学习。 |
large language model |
|
|
| 12 |
Nullu: Mitigating Object Hallucinations in Large Vision-Language Models via HalluSpace Projection |
Nullu:通过HalluSpace投影缓解大型视觉语言模型中的对象幻觉问题 |
large language model |
✅ |
|