QAL: A Loss for Recall Precision Balance in 3D Reconstruction
Pranay Meshram, Yash Turkar, Kartikeya Singh, Praveen Raj Masilamani, Charuvahan Adhivarahan, Karthik Dantu
TL;DR
QAL addresses a critical gap in 3D vision losses by explicitly balancing recall and precision through a coverage-weighted matching term and an uncovered-GT attraction term, formulated as $L_{\mathrm{QAL}} = L_{\mathrm{cov}} + \lambda_{\mathrm{attr}} L_{\mathrm{attr}}$. It acts as a drop-in replacement for Chamfer Distance and Earth Mover’s Distance, improving surface coverage and reducing holes while managing spurious predictions. Across point-cloud completion, single-view reconstruction, and image-to-mesh pipelines, QAL yields consistent coverage gains (averaging +4.3 points) and enhances downstream grasping performance, with robustness to resolution and architecture. The method aligns training with thresholded metrics like Cov@$\tau$ and SP@$\tau$, offering a practical, interpretable objective for robust 3D vision and safety-critical robotics, and is accompanied by efficient GPU implementations for wide adoption.
Abstract
Volumetric learning underpins many 3D vision tasks such as completion, reconstruction, and mesh generation, yet training objectives still rely on Chamfer Distance (CD) or Earth Mover's Distance (EMD), which fail to balance recall and precision. We propose Quality-Aware Loss (QAL), a drop-in replacement for CD/EMD that combines a coverage-weighted nearest-neighbor term with an uncovered-ground-truth attraction term, explicitly decoupling recall and precision into tunable components. Across diverse pipelines, QAL achieves consistent coverage gains, improving by an average of +4.3 pts over CD and +2.8 pts over the best alternatives. Though modest in percentage, these improvements reliably recover thin structures and under-represented regions that CD/EMD overlook. Extensive ablations confirm stable performance across hyperparameters and across output resolutions, while full retraining on PCN and ShapeNet demonstrates generalization across datasets and backbones. Moreover, QAL-trained completions yield higher grasp scores under GraspNet evaluation, showing that improved coverage translates directly into more reliable robotic manipulation. QAL thus offers a principled, interpretable, and practical objective for robust 3D vision and safety-critical robotics pipelines
