Geometric Learning of Canonical Parameterizations of $2D$-curves
Ioana Ciuclea, Giorgio Longari, Alice Barbara Tumpach
TL;DR
This work develops a geometry-inspired framework to normalize 2D contours with respect to shape-preserving symmetries by employing sections of the principal fiber bundle of parameterized curves. It introduces a two-parameter family of canonical parameterizations, the curvature-weighted clock parameterizations, and defines a simple L2-based distance restricted to a chosen section, enabling metric learning to maximize class separation. Through extensive experiments on leaf contour datasets, including Swedish leaves and Flavia, the authors demonstrate that optimally chosen sections substantially improve clustering and classification with minimal parameterization. The approach also provides automatic contour registration and deformations, and the authors show that the method can serve as an efficient pre-processing step for more complex classifiers, with potential extensions to higher-dimensional shape spaces. Code and tutorials are provided to facilitate adoption and replication.
Abstract
Most datasets encountered in computer vision and medical applications present symmetries that should be taken into account in classification tasks. A typical example is the symmetry by rotation and/or scaling in object detection. A common way to build neural networks that learn the symmetries is to use data augmentation. In order to avoid data augmentation and build more sustainable algorithms, we present an alternative method to mod out symmetries based on the notion of section of a principal fiber bundle. This framework allows the use of simple metrics on the space of objects in order to measure dissimilarities between orbits of objects under the symmetry group. Moreover, the section used can be optimized to maximize separation of classes. We illustrate this methodology on a dataset of contours of objects for the groups of translations, rotations, scalings and reparameterizations. In particular, we present a $2$-parameter family of canonical parameterizations of curves, containing the constant-speed parameterization as a special case, which we believe is interesting in its own right. We hope that this simple application will serve to convey the geometric concepts underlying this method, which have a wide range of possible applications. The code is available at the following link: $\href{https://github.com/GiLonga/Geometric-Learning}{https://github.com/GiLonga/Geometric-Learning}$. A tutorial notebook showcasing an application of the code to a specific dataset is available at the following link: $\href{https://github.com/ioanaciuclea/geometric-learning-notebook}{https://github.com/ioanaciuclea/geometric-learning-notebook}$
