Classification of Firn Data via Topological Features
Sarah Day, Jesse Dimino, Matt Jester, Kaitlin Keegan, Thomas Weighill
TL;DR
The paper addresses depth-prediction from firn micro-CT images using invariant topological features. It compares sublevel-set and distance-transform persistent-homology featurizations, translated into Betti and Gaussian persistence curves, and fed into random forests. Results reveal trade-offs: sublevel-set features achieve high accuracy on unblurred data but are sensitive to noise and out-of-sample depths, while distance-transform features are more robust to blur and better at extrapolating to unseen depths. The findings highlight the need to balance scale sensitivity and preprocessing choices in TDA-based texture analysis, with implications for geoscience applications in firn densification studies and cross-site depth inference.
Abstract
In this paper we evaluate the performance of topological features for generalizable and robust classification of firn image data, with the broader goal of understanding the advantages, pitfalls, and trade-offs in topological featurization. Firn refers to layers of granular snow within glaciers that haven't been compressed into ice. This compactification process imposes distinct topological and geometric structure on firn that varies with depth within the firn column, making topological data analysis (TDA) a natural choice for understanding the connection between depth and structure. We use two classes of topological features, sublevel set features and distance transform features, together with persistence curves, to predict sample depth from microCT images. A range of challenging training-test scenarios reveals that no one choice of method dominates in all categories, and uncoveres a web of trade-offs between accuracy, interpretability, and generalizability.
