| 1 |
Semi-supervised classification of dental conditions in panoramic radiographs using large language model and instance segmentation: A real-world dataset evaluation |
提出基于大语言模型和实例分割的半监督学习框架,用于全景牙科X光片中牙齿状况的分类。 |
masked autoencoder large language model |
|
|
| 2 |
MAGIC: Meta-Ability Guided Interactive Chain-of-Distillation for Effective-and-Efficient Vision-and-Language Navigation |
提出MAGIC:元能力引导的交互式链式蒸馏,用于高效的视觉-语言导航 |
teacher-student distillation VLN |
✅ |
|
| 3 |
Pamba: Enhancing Global Interaction in Point Clouds via State Space Model |
提出Pamba,利用状态空间模型增强点云全局交互,实现高效语义分割。 |
Mamba SSM state space model |
|
|
| 4 |
Highly Constrained Coded Aperture Imaging Systems Design Via a Knowledge Distillation Approach |
提出基于知识蒸馏的编码孔径成像系统设计方法,解决物理约束下的性能优化问题 |
teacher-student distillation |
|
|
| 5 |
Pseudo Labelling for Enhanced Masked Autoencoders |
提出基于伪标签的增强型掩码自编码器,提升图像表征学习能力 |
masked autoencoder MAE |
|
|
| 6 |
Towards Optimal Trade-offs in Knowledge Distillation for CNNs and Vision Transformers at the Edge |
面向边缘设备,研究CNN与ViT知识蒸馏的最优权衡策略 |
distillation |
|
|
| 7 |
Three-Stream Temporal-Shift Attention Network Based on Self-Knowledge Distillation for Micro-Expression Recognition |
提出基于自知识蒸馏的三流时序注意力网络用于提升微表情识别性能 |
distillation |
✅ |
|
| 8 |
Video Occupancy Models |
提出视频占用模型以支持控制任务的预测 |
world model predictive model |
✅ |
|