3D View Optimization for Improving Image Aesthetics
Taichi Uchida, Yoshihiro Kanamori, Yuki Endo
TL;DR
This work tackles the challenge of improving image aesthetics by moving beyond traditional 2D cropping to a 3D view optimization approach. It extrapolates the input image, reconstructs a 3D scene, and then optimizes camera parameters and output aspect ratios to maximize an aesthetics score provided by a pre-trained evaluator, using CMA-ES for robust global search. Key contributions include introducing a practical extrapolation-and-reconstruction pipeline, a mask-based regularization to suppress out-of-view artifacts, and a comprehensive set of experiments showing superiority over existing 2D cropping methods both quantitatively and via user study. The approach enables larger, more informative viewpoints and demonstrates a significant step toward aesthetic retrospective rephotography with potential impact on automated photography tools and imaging software.
Abstract
Achieving aesthetically pleasing photography necessitates attention to multiple factors, including composition and capture conditions, which pose challenges to novices. Prior research has explored the enhancement of photo aesthetics post-capture through 2D manipulation techniques; however, these approaches offer limited search space for aesthetics. We introduce a pioneering method that employs 3D operations to simulate the conditions at the moment of capture retrospectively. Our approach extrapolates the input image and then reconstructs the 3D scene from the extrapolated image, followed by an optimization to identify camera parameters and image aspect ratios that yield the best 3D view with enhanced aesthetics. Comparative qualitative and quantitative assessments reveal that our method surpasses traditional 2D editing techniques with superior aesthetics.
