cs.CV（2025-12-17）

📊 共 3 篇论文 | 🔗 1 篇有代码

🎯 兴趣领域导航

#	题目	一句话要点	标签	🔗	⭐
1	Seeing Beyond Words: Self-Supervised Visual Learning for Multimodal Large Language Models	提出JARVIS框架，通过自监督视觉学习增强多模态大语言模型(MLLM)的视觉理解能力。	large language model foundation model multimodal	✅

#	题目	一句话要点	标签	🔗	⭐
2	Photorealistic Phantom Roads in Real Scenes: Disentangling 3D Hallucinations from Physical Geometry	提出Grounded Self-Distillation框架，解决单目深度估计中的3D幻觉问题	distillation monocular depth foundation model

#	题目	一句话要点	标签	🔗	⭐
3	Gaussian Pixel Codec Avatars: A Hybrid Representation for Efficient Rendering	提出高斯像素编解码头像(GPiCA)，用于移动设备高效渲染的混合人像表示	3D gaussian splatting gaussian splatting splatting