Demystifying Neural Style Transfer
Yanghao Li, Naiyan Wang, Jiaying Liu, Xiaodi Hou
TL;DR
This work reframes neural style transfer as a distribution-alignment problem by showing that Gram-matrix matching is equivalent to minimizing an MMD statistic with a second-order polynomial kernel. It situates style as a distribution over CNN feature activations and generalizes the approach by exploring MMD with different kernels and BN-statistics matching. The results demonstrate that multiple distribution-alignment methods yield competitive stylizations, highlighting the flexibility and potential of a domain-adaptation viewpoint for style transfer. The insights provide a foundation for designing new, kernel-aware style transfer methods with varied visual characteristics.
Abstract
Neural Style Transfer has recently demonstrated very exciting results which catches eyes in both academia and industry. Despite the amazing results, the principle of neural style transfer, especially why the Gram matrices could represent style remains unclear. In this paper, we propose a novel interpretation of neural style transfer by treating it as a domain adaptation problem. Specifically, we theoretically show that matching the Gram matrices of feature maps is equivalent to minimize the Maximum Mean Discrepancy (MMD) with the second order polynomial kernel. Thus, we argue that the essence of neural style transfer is to match the feature distributions between the style images and the generated images. To further support our standpoint, we experiment with several other distribution alignment methods, and achieve appealing results. We believe this novel interpretation connects these two important research fields, and could enlighten future researches.
