ResNet: Enabling Deep Convolutional Neural Networks through Residual Learning

Xingyu Liu; Kun Ming Goh

ResNet: Enabling Deep Convolutional Neural Networks through Residual Learning

Xingyu Liu, Kun Ming Goh

TL;DR

The paper tackles the difficulty of training very deep CNNs due to vanishing gradients and degradation. It introduces residual learning with skip connections, formalizing a residual function $F(x)=H(x)-x$ so that $H(x)=F(x)+x$, to enable direct gradient flow and easier optimization. Empirically, ResNet-18 configured for CIFAR-10 achieves 89.9% Top-1 accuracy, outperforming a comparable Baseline CNN by 5.8 percentage points and converging more quickly and stably, demonstrating the effectiveness of deep residual architectures. The work also discusses extensions like ResNeXt, DenseNet, and Wide ResNet, and provides ablation analyses showing that skip connections are essential for gradient propagation and performance gains, thereby establishing residual learning as a scalable paradigm for deep computer vision models.

Abstract

Convolutional Neural Networks (CNNs) has revolutionized computer vision, but training very deep networks has been challenging due to the vanishing gradient problem. This paper explores Residual Networks (ResNet), introduced by He et al. (2015), which overcomes this limitation by using skip connections. ResNet enables the training of networks with hundreds of layers by allowing gradients to flow directly through shortcut connections that bypass intermediate layers. In our implementation on the CIFAR-10 dataset, ResNet-18 achieves 89.9% accuracy compared to 84.1% for a traditional deep CNN of similar depth, while also converging faster and training more stably.

ResNet: Enabling Deep Convolutional Neural Networks through Residual Learning

TL;DR

The paper tackles the difficulty of training very deep CNNs due to vanishing gradients and degradation. It introduces residual learning with skip connections, formalizing a residual function

so that

, to enable direct gradient flow and easier optimization. Empirically, ResNet-18 configured for CIFAR-10 achieves 89.9% Top-1 accuracy, outperforming a comparable Baseline CNN by 5.8 percentage points and converging more quickly and stably, demonstrating the effectiveness of deep residual architectures. The work also discusses extensions like ResNeXt, DenseNet, and Wide ResNet, and provides ablation analyses showing that skip connections are essential for gradient propagation and performance gains, thereby establishing residual learning as a scalable paradigm for deep computer vision models.

ResNet: Enabling Deep Convolutional Neural Networks through Residual Learning

TL;DR

Abstract

ResNet: Enabling Deep Convolutional Neural Networks through Residual Learning

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (5)