Nested Fusion: A Method for Learning High Resolution Latent Structure of Multi-Scale Measurement Data on Mars
Austin P. Wright, Scott Davidoff, Duen Horng Chau
TL;DR
Nested Fusion tackles the challenge of learning high-resolution latent structure from nested, multi-scale measurements by formalizing nested datasets and employing a variational autoencoder that encodes hierarchical data into a max-resolution latent space. The method tokenizes heterogeneous data scales into a sequence processed by a dedicated encoder, with per-scale decoders that preserve cross-scale dependencies via a nested beta correspondence. Empirically, Nested Fusion outperforms joint and concatenative baselines in both qualitative latent structure and quantitative reconstruction fidelity on Mars PIXL data, and it has been deployed in NASA's PIXL scientific workflow with open-source tooling. The approach enables rapid, interpretable exploration of cross-modal patterns, significantly accelerating mineral identification workflows and informing future data-analysis designs for multi-scale planetary science datasets.
Abstract
The Mars Perseverance Rover represents a generational change in the scale of measurements that can be taken on Mars, however this increased resolution introduces new challenges for techniques in exploratory data analysis. The multiple different instruments on the rover each measures specific properties of interest to scientists, so analyzing how underlying phenomena affect multiple different instruments together is important to understand the full picture. However each instrument has a unique resolution, making the mapping between overlapping layers of data non-trivial. In this work, we introduce Nested Fusion, a method to combine arbitrarily layered datasets of different resolutions and produce a latent distribution at the highest possible resolution, encoding complex interrelationships between different measurements and scales. Our method is efficient for large datasets, can perform inference even on unseen data, and outperforms existing methods of dimensionality reduction and latent analysis on real-world Mars rover data. We have deployed our method Nested Fusion within a Mars science team at NASA Jet Propulsion Laboratory (JPL) and through multiple rounds of participatory design enabled greatly enhanced exploratory analysis workflows for real scientists. To ensure the reproducibility of our work we have open sourced our code on GitHub at https://github.com/pixlise/NestedFusion.
