Resnet in Resnet: Generalizing Residual Architectures
Sasha Targ, Diogo Almeida, Kevin Lyman
TL;DR
The paper targets limitations of ResNets arising from fixed identity shortcuts by introducing a generalized residual block with dual parallel streams (residual and transient). This dual-stream, cross-constrained design is instantiated as RiR using a ResNet Init initialization that adds negligible overhead. Empirical results on CIFAR-10/100 demonstrate consistent gains over ResNet and set a new state-of-the-art on CIFAR-100, with ablations confirming contributions from both streams. The approach broadens residual-network design space by enabling deeper, more flexible processing while preserving optimization benefits.
Abstract
Residual networks (ResNets) have recently achieved state-of-the-art on challenging computer vision tasks. We introduce Resnet in Resnet (RiR): a deep dual-stream architecture that generalizes ResNets and standard CNNs and is easily implemented with no computational overhead. RiR consistently improves performance over ResNets, outperforms architectures with similar amounts of augmentation on CIFAR-10, and establishes a new state-of-the-art on CIFAR-100.
