One weird trick for parallelizing convolutional neural networks

Alex Krizhevsky

Paper

One weird trick for parallelizing convolutional neural networks

Abstract

I present a new way to parallelize the training of convolutional neural networks across multiple GPUs. The method scales significantly better than all alternatives when applied to modern convolutional neural networks.