Retargeting Visual Data with Deformation Fields
Tim Elsner, Julia Berger, Tong Wu, Victor Czech, Lin Gao, Leif Kobbelt
TL;DR
The paper reframes content-aware retargeting as a global, continuous deformation problem learned by neural fields, extending seam carving beyond images to 3D data such as Neural Radiance Fields (NeRFs) and polygon meshes. A scalar deformation field D(p) along a fixed axis v induces a coordinate shift p' = p + D(p)·v, with I(p) = I'(p') providing the retargeted content. It combines energy-based content awareness with sanity constraints via losses L_e, L_s, L_b, and L_m, plus domain-specific energy and cumulative-energy networks E and Σ, enabling the deformation to concentrate in low-information regions while preserving high-information detail; an inverse deformation U facilitates NeRF surface manipulation. The framework yields better content-aware retargeting than prior seam-carving methods, demonstrated through quantitative metrics (e.g., FID) and user studies across images and 3D scenes, and supports editing operations such as object removal and movement. Overall, the approach offers a domain-agnostic backbone for seam-carving-like retargeting that generalizes to 3D representations with controllable distortion and plausible outputs, albeit with computational trade-offs relative to traditional seam carving.
Abstract
Seam carving is an image editing method that enable content-aware resizing, including operations like removing objects. However, the seam-finding strategy based on dynamic programming or graph-cut limits its applications to broader visual data formats and degrees of freedom for editing. Our observation is that describing the editing and retargeting of images more generally by a displacement field yields a generalisation of content-aware deformations. We propose to learn a deformation with a neural network that keeps the output plausible while trying to deform it only in places with low information content. This technique applies to different kinds of visual data, including images, 3D scenes given as neural radiance fields, or even polygon meshes. Experiments conducted on different visual data show that our method achieves better content-aware retargeting compared to previous methods.
