Feature Space Topology Control via Hopkins Loss
Einari Vaaras, Manu Airaksinen
TL;DR
This work introduces Hopkins loss, a differentiable loss based on the Hopkins statistic $H$ that can steer feature space topology toward a user-defined target $H_T$, enabling regularly-spaced, randomly-spaced, or clustered arrangements of features. By integrating $L_H$ with standard classification losses, the authors demonstrate that model performance is largely preserved across speech, text, and image tasks while the feature topology can be guided toward specific structures. In autoencoder-based dimensionality reduction, topology control via $L_H$ produces larger shifts in $H$ with modest or acceptable drops in downstream classification accuracy, highlighting its potential for visualization and data compression. The results suggest practical utility for topology-aware representations in applications such as generative modeling, transfer learning, and robustness, with future work focusing on broader architectures, additional distance metrics, and more diverse domains.
Abstract
Feature space topology refers to the organization of samples within the feature space. Modifying this topology can be beneficial in machine learning applications, including dimensionality reduction, generative modeling, transfer learning, and robustness to adversarial attacks. This paper introduces a novel loss function, Hopkins loss, which leverages the Hopkins statistic to enforce a desired feature space topology, which is in contrast to existing topology-related methods that aim to preserve input feature topology. We evaluate the effectiveness of Hopkins loss on speech, text, and image data in two scenarios: classification and dimensionality reduction using nonlinear bottleneck autoencoders. Our experiments show that integrating Hopkins loss into classification or dimensionality reduction has only a small impact on classification performance while providing the benefit of modifying feature topology.
