Upright adjustment with graph convolutional networks
Raehyuk Jung, Sungmin Cho, Junseok Kwon
TL;DR
This work tackles upright adjustment of 360° images by processing data on its natural spherical domain. It introduces a CNN+GCN framework that maps CNN-derived feature maps to a spherical graph and uses a GCN to predict a discrete North-pole distribution, from which the upright rotation is obtained. A novel training objective combines distribution labels generated via von Mises-Fisher statistics with Jensen-Shannon divergence, yielding improved rotation invariance, faster convergence, and superior accuracy on SUN360 compared to projection-based CNNs and horizon-based methods. By operating directly on the sphere, the approach preserves the data's geometric structure and enhances VR viewing stability without relying on 2D projections.
Abstract
We present a novel method for the upright adjustment of 360 images. Our network consists of two modules, which are a convolutional neural network (CNN) and a graph convolutional network (GCN). The input 360 images is processed with the CNN for visual feature extraction, and the extracted feature map is converted into a graph that finds a spherical representation of the input. We also introduce a novel loss function to address the issue of discrete probability distributions defined on the surface of a sphere. Experimental results demonstrate that our method outperforms fully connected based methods.
