Learning Conditionally Independent Transformations using Normal Subgroups in Group Theory
Kayato Nishitsunoi, Yoshiyuki Ohmura, Takayuki Komatsu, Yasuo Kuniyoshi
TL;DR
This work addresses unsupervised disentanglement of conditionally independent, noncommutative transformations by grounding representation learning in Galois-inspired normal-subgroup decompositions. It introduces a homomorphism-based formulation where a normal subgroup $N$ corresponds to one transformation component and the quotient captures the residual, enabling conditional independence beyond commutativity. A NeuralODE-based coordinate transformation model learns two transformations $g$ and $v$, enforces a homomorphism constraint, and uses reconstruction, self-supervision, and isometry losses to discover rotation and translation in image sequences without labels. Empirical results on geometric transformations demonstrate successful separation and consistent identification of rotation and translation, suggesting a principled extension of representation learning to structured transformation decompositions with potential applicability to broader group-theoretic learning paradigms.
Abstract
Humans develop certain cognitive abilities to recognize objects and their transformations without explicit supervision, highlighting the importance of unsupervised representation learning. A fundamental challenge in unsupervised representation learning is to separate different transformations in learned feature representations. Although algebraic approaches have been explored, a comprehensive theoretical framework remains underdeveloped. Existing methods decompose transformations based on algebraic independence, but these methods primarily focus on commutative transformations and do not extend to cases where transformations are conditionally independent but noncommutative. To extend current representation learning frameworks, we draw inspiration from Galois theory, where the decomposition of groups through normal subgroups provides an approach for the analysis of structured transformations. Normal subgroups naturally extend commutativity under certain conditions and offer a foundation for the categorization of transformations, even when they do not commute. In this paper, we propose a novel approach that leverages normal subgroups to enable the separation of conditionally independent transformations, even in the absence of commutativity. Through experiments on geometric transformations in images, we show that our method successfully categorizes conditionally independent transformations, such as rotation and translation, in an unsupervised manner, suggesting a close link between group decomposition via normal subgroups and transformation categorization in representation learning.
