| 10 |
VideoRewardBench: Comprehensive Evaluation of Multimodal Reward Models for Video Understanding |
提出VideoRewardBench以解决视频理解中多模态奖励模型评估不足的问题 |
reinforcement learning multimodal |
|
|
| 11 |
SemaMIL: Semantic-Aware Multiple Instance Learning with Retrieval-Guided State Space Modeling for Whole Slide Images |
提出SemaMIL以解决全切片图像中的多实例学习问题 |
SSM state space model |
|
|
| 12 |
MorphGen: Morphology-Guided Representation Learning for Robust Single-Domain Generalization in Histopathological Cancer Classification |
提出MorphGen以解决组织病理学癌症分类中的领域泛化问题 |
representation learning contrastive learning |
✅ |
|
| 13 |
Make me an Expert: Distilling from Generalist Black-Box Models into Specialized Models for Semantic Segmentation |
提出黑箱蒸馏方法以解决局部模型训练问题 |
distillation open-vocabulary open vocabulary |
✅ |
|
| 14 |
Context-Aware Knowledge Distillation with Adaptive Weighting for Image Classification |
提出自适应知识蒸馏框架以优化图像分类性能 |
distillation |
|
|
| 15 |
LUT-Fuse: Towards Extremely Fast Infrared and Visible Image Fusion via Distillation to Learnable Look-Up Tables |
提出LUT-Fuse以解决实时红外与可见光图像融合问题 |
distillation |
✅ |
|
| 16 |
Multi-Focused Video Group Activities Hashing |
提出多聚焦视频组活动哈希技术以解决视频检索问题 |
representation learning spatiotemporal |
|
|