Maintaining Performance with Less Data

Dominic Sanderson; Tatiana Kalgonova

Maintaining Performance with Less Data

Dominic Sanderson, Tatiana Kalgonova

TL;DR

The paper tackles the high cost and environmental impact of training image classifiers by introducing three dynamic data-use methods to reduce input data during training. It evaluates these methods—Data Step, Data Increment, and Data Cut—across MNIST, CIFAR-10, and smallNORB using a CNN-Capsule architecture and reports up to $50\%$ runtime reductions with dataset-dependent effects on accuracy. The findings show that targeted data reductions can maintain or even improve performance in some cases (notably MNIST and certain smallNORB runs) while random or excessive reductions can degrade accuracy, especially on CIFAR-10. The work highlights the potential for substantial efficiency gains in AI training and points to future research on principled data selection to optimize performance-cost tradeoffs.

Abstract

We propose a novel method for training a neural network for image classification to reduce input data dynamically, in order to reduce the costs of training a neural network model. As Deep Learning tasks become more popular, their computational complexity increases, leading to more intricate algorithms and models which have longer runtimes and require more input data. The result is a greater cost on time, hardware, and environmental resources. By using data reduction techniques, we reduce the amount of work performed, and therefore the environmental impact of AI techniques, and with dynamic data reduction we show that accuracy may be maintained while reducing runtime by up to 50%, and reducing carbon emission proportionally.

Maintaining Performance with Less Data

TL;DR

runtime reductions with dataset-dependent effects on accuracy. The findings show that targeted data reductions can maintain or even improve performance in some cases (notably MNIST and certain smallNORB runs) while random or excessive reductions can degrade accuracy, especially on CIFAR-10. The work highlights the potential for substantial efficiency gains in AI training and points to future research on principled data selection to optimize performance-cost tradeoffs.

Abstract

Paper Structure (20 sections, 1 equation, 8 figures, 11 tables)

This paper contains 20 sections, 1 equation, 8 figures, 11 tables.

Introduction
Previous Work
Increasing data use
Reducing data use
Variable data use
Methods
Datasets
Hardware
Performance Metrics
Network Architecture
Experimentation
Benchmark
Data Step
Starting with 25% of the dataset
Starting with 50% of the dataset
...and 5 more sections

Figures (8)

Figure 1: Common model structure with Online Augmentation. $\lambda=$ full dataset, $\lambda_a=$ augmented dataset. $f_a()=$ augmentation function.
Figure 2: Proposed model structure with Dynamic Data Selection. $\lambda=$ full dataset, $\lambda_r=$ reduced dataset, $\lambda_a=$ augmented dataset. $f_r()=$ reduction function, $f_a()=$ augmentation function
Figure 3: Data Step Experiments starting with 25% data. (a) Step at 25% through training, (b) Step at 50% through training, (c) Step at 75% through training
Figure 4: Data Step Experiments starting with 50% data. (a) Step at 25% through training, (b) Step at 50% through training, (c) Step at 75% through training
Figure 5: Data Step Experiments starting with 75% data. (a) Step at 25% through training, (b) Step at 50% through training, (c) Step at 75% through training
...and 3 more figures

Maintaining Performance with Less Data

TL;DR

Abstract

Maintaining Performance with Less Data

Authors

TL;DR

Abstract

Table of Contents

Figures (8)