Modular Jets for Supervised Pipelines: Diagnosing Mirage vs Identifiability
Suman Sanyal
TL;DR
<3-5 sentence high-level summary> Modular Jets addresses the gap between risk-based evaluation and understanding a model's internal modular decomposition in supervised pipelines. By treating the input space as a task manifold and instrumenting module taps, the authors estimate empirical jets that capture local linear responses to structured perturbations. They define mirage vs identifiability regimes and prove a jet-identifiability result for linear two-module pipelines, complemented by a practical MoJet algorithm for empirical jet estimation and diagnostics. Across synthetic and real data, jets reveal internal differences that risk metrics miss, providing a scalable, post-hoc auditing tool with potential to guide evaluation design and interpretability in larger systems.
Abstract
Classical supervised learning evaluates models primarily via predictive risk on hold-out data. Such evaluations quantify how well a function behaves on a distribution, but they do not address whether the internal decomposition of a model is uniquely determined by the data and evaluation design. In this paper, we introduce \emph{Modular Jets} for regression and classification pipelines. Given a task manifold (input space), a modular decomposition, and access to module-level representations, we estimate empirical jets, which are local linear response maps that describe how each module reacts to small structured perturbations of the input. We propose an empirical notion of \emph{mirage} regimes, where multiple distinct modular decompositions induce indistinguishable jets and thus remain observationally equivalent, and contrast this with an \emph{identifiable} regime, where the observed jets single out a decomposition up to natural symmetries. In the setting of two-module linear regression pipelines we prove a jet-identifiability theorem. Under mild rank assumptions and access to module-level jets, the internal factorisation is uniquely determined, whereas risk-only evaluation admits a large family of mirage decompositions that implement the same input-to-output map. We then present an algorithm (MoJet) for empirical jet estimation and mirage diagnostics, and illustrate the framework using linear and deep regression as well as pipeline classification.
