Two-Sample Tests for Optimal Lifts, Manifold Stability and Reverse Labeling Reflection Shap
Do Tran Van, Susovan Pal, Benjamin Eltzner, Stephan F. Huckemann
TL;DR
The paper addresses inference on shape data living on quotient manifolds $Q = M/G$ by introducing optimal lifts of quotient data back to the ambient manifold $M$ to mitigate curvature-induced bias in Fréchet-mean analysis. It establishes measurability, almost-everywhere uniqueness, and smoothness of optimal lifts, proves a manifold-stability theorem ensuring Fréchet means lie on the manifold part $Q^*$ with positive probability, and proves a strong law for optimal lifts that underpins new two-sample tests. Building four testing procedures based on lifting each sample optimally (with pooled and individual variants, intrinsic versions, and bootstrap calibrations), the work demonstrates superior performance over classical Procrustes-type tests on simulated data and in a reverse-labeling planar shape space modeling filament-like biology. It further introduces reverse labeling reflection shape spaces to model end-to-end label reversals in filament data and applies the methods to microtubule–intermediate filament interactions, offering a practical approach for detecting group differences in curved shape spaces. Overall, the framework provides theoretically grounded, computationally feasible tools for hypothesis testing on non-Euclidean shape data with tangible applications in cell biology.
Abstract
We consider a quotient of a complete Riemannian manifold modulo an isometrically and properly acting Lie group and lifts of the quotient to the manifolds in optimal position to a reference point on the manifold. With respect to the pushed forward Riemannian volume onto the quotient we derive continuity and uniqueness a.e. and smoothness to large extents also with respect to the reference point. In consequence we derive a general manifold stability theorem: the Fréchet mean lies in the highest dimensional stratum assumed with positive probability, and a strong law for optimal lifts. This allows to define new two-sample tests utilizing individual optimal lifts which outperform existing two-sample tests on simulated data. They also outperform existing tests on a newly derived reverse labeling reflection shape space, that is used to model filament data of microtubules within cells in a biological application.
