Impacts of Darwinian Evolution on Pre-trained Deep Neural Networks
Guodong Du, Runhua Jiang, Senqiao Yang, Haoyang Li, Wei Chen, Keren Li, Sim Kuan Goh, Ho-Kin Tang
TL;DR
The paper addresses the inefficiencies and overfitting tendencies of gradient-based training by introducing a Darwinian framework that treats pretrained DNNs as evolving populations. It uses two stages: BP-based pretraining to seed a population of weights, followed by differential evolution to iteratively mutate, recombine, and select fitter networks using the cross-entropy loss as the fitness proxy. Empirical results across MNIST, Fashion-MNIST, CIFAR-10/100, and ImageNet show that this neuro-evolution approach improves accuracy, reduces overfitting without explicit regularization in the DE phase, and enhances robustness to corruptions such as MNIST-C and CIFAR-10-C, with lower time complexity than standard backpropagation. The framework generalizes across architectures and data scales, suggesting practical applicability and potential for adaptive DE refinements in deeper, larger models.
Abstract
Darwinian evolution of the biological brain is documented through multiple lines of evidence, although the modes of evolutionary changes remain unclear. Drawing inspiration from the evolved neural systems (e.g., visual cortex), deep learning models have demonstrated superior performance in visual tasks, among others. While the success of training deep neural networks has been relying on back-propagation (BP) and its variants to learn representations from data, BP does not incorporate the evolutionary processes that govern biological neural systems. This work proposes a neural network optimization framework based on evolutionary theory. Specifically, BP-trained deep neural networks for visual recognition tasks obtained from the ending epochs are considered the primordial ancestors (initial population). Subsequently, the population evolved with differential evolution. Extensive experiments are carried out to examine the relationships between Darwinian evolution and neural network optimization, including the correspondence between datasets, environment, models, and living species. The empirical results show that the proposed framework has positive impacts on the network, with reduced over-fitting and an order of magnitude lower time complexity compared to BP. Moreover, the experiments show that the proposed framework performs well on deep neural networks and big datasets.
