Tail-Aware Post-Training Quantization for 3D Geometry Models
作者: Sicheng Pan, Chen Tang, Shuzhao Xie, Ke Yang, Weixiang Zhang, Jiawei Li, Bin Chen, Shu-Tao Xia, Zhi Wang
分类: cs.CV
发布日期: 2026-02-02
💡 一句话要点
提出TAPTQ以解决3D几何模型量化问题
🎯 匹配领域: 支柱三:空间感知与语义 (Perception & Semantics)
关键词: 后训练量化 3D几何模型 量化优化 模块级补偿 尾部感知
📋 核心要点
- 现有的后训练量化方法在处理3D几何模型时面临数据规模瓶颈和校准开销过大的挑战。
- 本文提出的TAPTQ通过渐进式校准策略、优化问题重构和模块级补偿等方法有效解决了量化过程中的问题。
- 在VGGT和Pi3基准测试中,TAPTQ在准确性上超越了最先进的PTQ方法,并显著缩短了校准时间。
📝 摘要(中文)
随着3D几何模型的复杂性和规模不断增加,如何在资源受限的平台上高效部署成为一大挑战。尽管后训练量化(PTQ)能够在不重新训练的情况下实现高效推理,但现有方法主要针对2D视觉变换器,难以有效迁移到3D模型。为此,本文提出了TAPTQ,一个专门为3D几何学习设计的尾部感知后训练量化管道。我们的贡献包括:开发渐进式粗到细的校准构建策略、将量化区间搜索重构为优化问题并引入三分搜索求解器,以及提出TRE引导的模块级补偿以减轻量化误差累积。大量实验表明,TAPTQ在准确性上始终优于现有的PTQ方法,同时显著减少了校准时间。
🔬 方法详解
问题定义:本文旨在解决3D几何模型在资源受限平台上的量化问题。现有的PTQ方法主要针对2D视觉变换器,难以有效适应3D模型,导致数据规模瓶颈和校准开销过大。
核心思路:TAPTQ通过引入尾部感知的校准策略,优化量化区间搜索,并采用模块级补偿来减轻量化误差,从而提高3D模型的推理效率和准确性。
技术框架:TAPTQ的整体架构包括三个主要模块:渐进式校准构建、量化区间优化和TRE引导的模块级补偿。首先,通过构建紧凑的子集进行校准;其次,将量化区间搜索转化为优化问题;最后,利用TRE指标进行模块级的误差补偿。
关键创新:TAPTQ的核心创新在于其渐进式校准策略和TRE引导的模块级补偿,这些设计使得量化过程更具针对性和有效性,显著提升了3D模型的性能。
关键设计:在量化区间搜索中,采用三分搜索求解器将计算复杂度降低至$ ext{O}( ext{log } N)$,并通过TRE指标识别和修正长尾激活异常的模块,确保量化后的模型在推理时保持高准确性。
🖼️ 关键图片
📊 实验亮点
在VGGT和Pi3基准测试中,TAPTQ在准确性上超越了现有的最先进PTQ方法,校准时间显著减少,具体提升幅度未知。这表明TAPTQ在3D几何模型量化方面的有效性和优越性。
🎯 应用场景
TAPTQ的研究成果可广泛应用于需要在资源受限设备上运行的3D几何模型,如移动设备、嵌入式系统和实时图形处理等领域。其高效的量化方法将推动3D视觉技术的普及与应用,具有重要的实际价值和未来影响。
📄 摘要(原文)
The burgeoning complexity and scale of 3D geometry models pose significant challenges for deployment on resource-constrained platforms. While Post-Training Quantization (PTQ) enables efficient inference without retraining, conventional methods, primarily optimized for 2D Vision Transformers, fail to transfer effectively to 3D models due to intricate feature distributions and prohibitive calibration overhead. To address these challenges, we propose TAPTQ, a Tail-Aware Post-Training Quantization pipeline specifically engineered for 3D geometric learning. Our contribution is threefold: (1) To overcome the data-scale bottleneck in 3D datasets, we develop a progressive coarse-to-fine calibration construction strategy that constructs a highly compact subset to achieve both statistical purity and geometric representativeness. (2) We reformulate the quantization interval search as an optimization problem and introduce a ternary-search-based solver, reducing the computational complexity from $\mathcal{O}(N)$ to $\mathcal{O}(\log N)$ for accelerated deployment. (3) To mitigate quantization error accumulation, we propose TRE-Guided Module-wise Compensation, which utilizes a Tail Relative Error (TRE) metric to adaptively identify and rectify distortions in modules sensitive to long-tailed activation outliers. Extensive experiments on the VGGT and Pi3 benchmarks demonstrate that TAPTQ consistently outperforms state-of-the-art PTQ methods in accuracy while significantly reducing calibration time. The code will be released soon.