Marginalized Generalized IoU (MGIoU): A Unified Objective Function for Optimizing Any Convex Parametric Shapes
Duy-Tho Le, Trung Pham, Jianfei Cai, Hamid Rezatofighi
TL;DR
MGIoU introduces a unified, differentiable loss for optimizing convex parametric shapes by projecting shapes onto a set of normals and computing a one-dimensional GIoU per direction, then averaging to form MGIoU. It extends to MGIoU$^+$ for unstructured shapes with a convexity regularizer and MGIoU$^-$ for minimizing overlaps in trajectory prediction, creating a general framework across 2D/3D, rotated geometries, and temporal sequences. Empirically, MGIoU and its variants outperform strong baselines across 2D oriented detection, monocular 3D 6-DoF recognition, quadrangle detection, and collision-avoidant trajectory prediction, while offering substantial latency reductions (10–40x) and satisfying core metric properties, including scale invariance. The approach provides a practical, unified tool for shape optimization with broad applicability and improved robustness in real-world tasks.”
Abstract
Optimizing the similarity between parametric shapes is crucial for numerous computer vision tasks, where Intersection over Union (IoU) stands as the canonical measure. However, existing optimization methods exhibit significant shortcomings: regression-based losses like L1/L2 lack correlation with IoU, IoU-based losses are unstable and limited to simple shapes, and task-specific methods are computationally intensive and not generalizable accross domains. As a result, the current landscape of parametric shape objective functions has become scattered, with each domain proposing distinct IoU approximations. To address this, we unify the parametric shape optimization objective functions by introducing Marginalized Generalized IoU (MGIoU), a novel loss function that overcomes these challenges by projecting structured convex shapes onto their unique shape Normals to compute one-dimensional normalized GIoU. MGIoU offers a simple, efficient, fully differentiable approximation strongly correlated with IoU. We then extend MGIoU to MGIoU+ that supports optimizing unstructured convex shapes. Together, MGIoU and MGIoU+ unify parametric shape optimization across diverse applications. Experiments on standard benchmarks demonstrate that MGIoU and MGIoU+ consistently outperform existing losses while reducing loss computation latency by 10-40x. Additionally, MGIoU and MGIoU+ satisfy metric properties and scale-invariance, ensuring robustness as an objective function. We further propose MGIoU- for minimizing overlaps in tasks like collision-free trajectory prediction. Code is available at https://ldtho.github.io/MGIoU
