Table of Contents
Fetching ...

ModHiFi: Identifying High Fidelity predictive components for Model Modification

Dhruva Kashyap, Chaitanya Murti, Pranav K Nayak, Tanay Narshana, Chiranjib Bhattacharyya

TL;DR

ModHiFi introduces Subset Fidelity to identify high-fidelity, small component subsets that preserve predictive performance when parts of a model are modified without access to training data or the loss. It proves a local-to-global bound for Lipschitz networks, enabling a principled, architecture-agnostic approach to pruning and unlearning based on local reconstruction, with singleton components often sufficing. The ModHiFi framework (ModHiFi-P for structured pruning and ModHiFi-U for classwise unlearning) uses distributional access via synthetic data to select HiFi components and apply targeted edits without retraining, achieving strong ImageNet pruning speedups and complete CIFAR-10 unlearning, while remaining competitive on LLM pruning. Empirical results across CNNs, Transformers, and LLMs support the existence and effectiveness of HiFi components and demonstrate practical, data-free model modification with broad applicability and minimal fine-tuning.

Abstract

Open weight models, which are ubiquitous, rarely provide access to their training data or loss function. This makes modifying such models for tasks such as pruning or unlearning constrained by this unavailability an active area of research. Existing techniques typically require gradients or ground-truth labels, rendering them infeasible in settings with limited computational resources. In this work, we investigate the fundamental question of identifying components that are critical to the model's predictive performance, without access to either gradients or the loss function, and with only distributional access such as synthetic data. We theoretically demonstrate that the global reconstruction error is linearly bounded by local reconstruction errors for Lipschitz-continuous networks such as CNNs and well-trained Transformers (which, contrary to existing literature, we find exhibit Lipschitz continuity). This motivates using the locally reconstructive behavior of component subsets to quantify their global importance, via a metric that we term Subset Fidelity. In the uncorrelated features setting, selecting individual components via their Subset Fidelity scores is optimal, which we use to propose ModHiFi, an algorithm for model modification that requires no training data or loss function access. ModHiFi-P, for structured pruning, achieves an 11% speedup over the current state of the art on ImageNet models and competitive performance on language models. ModHiFi-U, for classwise unlearning, achieves complete unlearning on CIFAR-10 without fine-tuning and demonstrates competitive performance on Swin Transformers.

ModHiFi: Identifying High Fidelity predictive components for Model Modification

TL;DR

ModHiFi introduces Subset Fidelity to identify high-fidelity, small component subsets that preserve predictive performance when parts of a model are modified without access to training data or the loss. It proves a local-to-global bound for Lipschitz networks, enabling a principled, architecture-agnostic approach to pruning and unlearning based on local reconstruction, with singleton components often sufficing. The ModHiFi framework (ModHiFi-P for structured pruning and ModHiFi-U for classwise unlearning) uses distributional access via synthetic data to select HiFi components and apply targeted edits without retraining, achieving strong ImageNet pruning speedups and complete CIFAR-10 unlearning, while remaining competitive on LLM pruning. Empirical results across CNNs, Transformers, and LLMs support the existence and effectiveness of HiFi components and demonstrate practical, data-free model modification with broad applicability and minimal fine-tuning.

Abstract

Open weight models, which are ubiquitous, rarely provide access to their training data or loss function. This makes modifying such models for tasks such as pruning or unlearning constrained by this unavailability an active area of research. Existing techniques typically require gradients or ground-truth labels, rendering them infeasible in settings with limited computational resources. In this work, we investigate the fundamental question of identifying components that are critical to the model's predictive performance, without access to either gradients or the loss function, and with only distributional access such as synthetic data. We theoretically demonstrate that the global reconstruction error is linearly bounded by local reconstruction errors for Lipschitz-continuous networks such as CNNs and well-trained Transformers (which, contrary to existing literature, we find exhibit Lipschitz continuity). This motivates using the locally reconstructive behavior of component subsets to quantify their global importance, via a metric that we term Subset Fidelity. In the uncorrelated features setting, selecting individual components via their Subset Fidelity scores is optimal, which we use to propose ModHiFi, an algorithm for model modification that requires no training data or loss function access. ModHiFi-P, for structured pruning, achieves an 11% speedup over the current state of the art on ImageNet models and competitive performance on language models. ModHiFi-U, for classwise unlearning, achieves complete unlearning on CIFAR-10 without fine-tuning and demonstrates competitive performance on Swin Transformers.

Paper Structure

This paper contains 85 sections, 10 theorems, 41 equations, 16 figures, 14 tables, 2 algorithms.

Key Result

Lemma 3.0

For any subset $C \subseteq [c^l_{in}]$ in layer $l$,

Figures (16)

  • Figure 1: Monte Carlo estimation of \ref{['eq:MFS']} across selected layers of various models. The x-axis indicates subset size $k$, and the y-axis the maximum fidelity found across random samples.
  • Figure 2: Fidelity score of selected layers of a ResNet-50 model on CIFAR10 and the effect of noise on the fidelity score.
  • Figure 3: Boxplots for the distribution of norms of inputs to normalization layer. Minimum value indicated in Red, showing that $\frac{1}{r}$ is at most 5. Y-axis is log scale.
  • Figure 4: Estimates of Optimal subset fidelity for ResNet-50 on CIFAR10.
  • Figure 5: Estimates of Optimal subset fidelity for ResNet-50 on CIFAR100.
  • ...and 11 more figures

Theorems & Definitions (27)

  • Definition 3.0: Subset Fidelity
  • Lemma 3.0: Properties of Subset Fidelity
  • Remark 3.1
  • Definition 3.1: ((k,η))-HiFi Set
  • Definition 3.2: $k$-Maximum Fidelity Subset
  • Theorem 3.3: Local to Global
  • proof : Sketch
  • Remark 3.4
  • Proposition 3.4: Compensation and Singleton Fidelity
  • Theorem 3.5
  • ...and 17 more