Visual Prompting Upgrades Neural Network Sparsification: A Data-Model Perspective

Can Jin; Tianjin Huang; Yihua Zhang; Mykola Pechenizkiy; Sijia Liu; Shiwei Liu; Tianlong Chen

Visual Prompting Upgrades Neural Network Sparsification: A Data-Model Perspective

Can Jin, Tianjin Huang, Yihua Zhang, Mykola Pechenizkiy, Sijia Liu, Shiwei Liu, Tianlong Chen

TL;DR

This research introduces a novel data-model co-design perspective: to promote superior weight sparsity by learning important model topology and adequate input data in a synergetic manner and finds that subnetworks discovered by VPNs from pre-trained models enjoy better transferability across diverse downstream scenarios.

Abstract

The rapid development of large-scale deep learning models questions the affordability of hardware platforms, which necessitates the pruning to reduce their computational and memory footprints. Sparse neural networks as the product, have demonstrated numerous favorable benefits like low complexity, undamaged generalization, etc. Most of the prominent pruning strategies are invented from a model-centric perspective, focusing on searching and preserving crucial weights by analyzing network topologies. However, the role of data and its interplay with model-centric pruning has remained relatively unexplored. In this research, we introduce a novel data-model co-design perspective: to promote superior weight sparsity by learning important model topology and adequate input data in a synergetic manner. Specifically, customized Visual Prompts are mounted to upgrade neural Network sparsification in our proposed VPNs framework. As a pioneering effort, this paper conducts systematic investigations about the impact of different visual prompts on model pruning and suggests an effective joint optimization approach. Extensive experiments with 3 network architectures and 8 datasets evidence the substantial performance improvements from VPNs over existing start-of-the-art pruning algorithms. Furthermore, we find that subnetworks discovered by VPNs from pre-trained models enjoy better transferability across diverse downstream scenarios. These insights shed light on new promising possibilities of data-model co-designs for vision model sparsification.

Visual Prompting Upgrades Neural Network Sparsification: A Data-Model Perspective

TL;DR

Abstract

Paper Structure (36 sections, 6 equations, 11 figures, 3 tables, 1 algorithm)

This paper contains 36 sections, 6 equations, 11 figures, 3 tables, 1 algorithm.

Introduction
Related Works and A Pilot Study
Neural Network Pruning.
Prompting.
A Pilot Study
Motivation.
Open Question.
Methodology
Designing Appropriate Visual Prompts
Upgrading Network Sparsification with Visual Prompts
Overall Procedure of VPNs
Experiments
Implementation Details
Network and Datasets.
Pruning Baselines.
...and 21 more sections

Figures (11)

Figure 1: Post-pruning Prompt Results. Performance of $5$ pruning methods and their post-pruning prompt counterparts on ResNet-$18$ and CIFAR$10$, which are marked as $\bullet$ and $\bigstar$, respectively. The dashed line indicates the dense network's performance. (a) Post-pruning with zero-shot. (b) Post-pruning with fine-tuning. Post-pruning prompt is only valid without fine-tuning.
Figure 2: Overview of VPNs. In stage $1$, it locates superior sparse topologies from a data-model perspective. A tailored VP is added to input samples and weight masks are jointly optimized together with the VP. In stage $2$, the identified subnetwork is further fine-tuned with its VP.
Figure 3: Our VP.
Figure 4: Downstream Fine-tuning Results. The performance overview of $9$ unstructured pruning algorithms. All the models are pre-trained on ImageNet-1K; and then pruned and fine-tuned both on the specific downstream dataset. The performance of the dense model and VPNs' best are marked using dashed lines. All the results are averaged over 3 runs. VPNs consistently outperforms other baselines on all eight tasks.
Figure 5: Downstream Fine-tuning Results. The performance overview of VPNs, HYDRA, and OMP. All the results are obtained with ImageNet-$1$K pre-trained ResNet-$50$ and VGG-$16$, fine-tuned on Tiny-ImageNet. VPNs consistently has superior performance.
...and 6 more figures

Visual Prompting Upgrades Neural Network Sparsification: A Data-Model Perspective

TL;DR

Abstract

Visual Prompting Upgrades Neural Network Sparsification: A Data-Model Perspective

Authors

TL;DR

Abstract

Table of Contents

Figures (11)