Table of Contents
Fetching ...

Riemannian Integrated Gradients: A Geometric View of Explainable AI

Federico Costanza, Lachlan Simpson

TL;DR

This work addresses explainability for neural networks on non-Euclidean data by extending gradient-based attributions to Riemannian manifolds. It introduces Riemannian Integrated Gradients (RIG) as a manifold-consistent generalisation of Integrated Gradients (IG), using a bilinear path-attribution form and geodesic connections between points. A key contribution is showing that, with a natural choice of basis, RIG attributions correspond to the eigenvalues of a symmetric endomorphism $Q_{F,\gamma}(p)$ associated with the path attribution form, and that RIG reduces to IG in Euclidean space. The framework preserves isometry-invariance like properties and provides a geometrically meaningful, basis-dependent attribution mechanism that respects manifold structure, with planned empirical validation on geometries beyond Euclidean spaces.

Abstract

We introduce Riemannian Integrated Gradients (RIG); an extension of Integrated Gradients (IG) to Riemannian manifolds. We demonstrate that RIG restricts to IG when the Riemannian manifold is Euclidean space. We show that feature attribution can be phrased as an eigenvalue problem where attributions correspond to eigenvalues of a symmetric endomorphism.

Riemannian Integrated Gradients: A Geometric View of Explainable AI

TL;DR

This work addresses explainability for neural networks on non-Euclidean data by extending gradient-based attributions to Riemannian manifolds. It introduces Riemannian Integrated Gradients (RIG) as a manifold-consistent generalisation of Integrated Gradients (IG), using a bilinear path-attribution form and geodesic connections between points. A key contribution is showing that, with a natural choice of basis, RIG attributions correspond to the eigenvalues of a symmetric endomorphism associated with the path attribution form, and that RIG reduces to IG in Euclidean space. The framework preserves isometry-invariance like properties and provides a geometrically meaningful, basis-dependent attribution mechanism that respects manifold structure, with planned empirical validation on geometries beyond Euclidean spaces.

Abstract

We introduce Riemannian Integrated Gradients (RIG); an extension of Integrated Gradients (IG) to Riemannian manifolds. We demonstrate that RIG restricts to IG when the Riemannian manifold is Euclidean space. We show that feature attribution can be phrased as an eigenvalue problem where attributions correspond to eigenvalues of a symmetric endomorphism.

Paper Structure

This paper contains 6 sections, 5 theorems, 22 equations.

Key Result

theorem thmcountertheorem

$\mathrm{RIG}$ coincides with $\mathrm{IG}$ in Euclidean space.

Theorems & Definitions (14)

  • definition thmcounterdefinition
  • remark thmcounterremark
  • definition thmcounterdefinition
  • remark thmcounterremark
  • definition thmcounterdefinition
  • theorem thmcountertheorem
  • proof
  • proposition thmcounterproposition
  • proof
  • proposition thmcounterproposition
  • ...and 4 more