Deep Manifold Traversal: Changing Labels with Convolutional Features
Jacob R. Gardner, Paul Upchurch, Matt J. Kusner, Yixuan Li, Kilian Q. Weinberger, Kavita Bala, John E. Hopcroft
TL;DR
This paper tackles the problem of general label-changing in images by introducing Deep Manifold Traversal (DMT), a data-driven method that traverses the natural image manifold in a deep CNN feature space guided by Maximum Mean Discrepancy. The approach maps images to high-level features, performs a budgeted linear traversal toward a target class via an MMD-based objective, and reconstructs the resulting image from modified features. It demonstrates versatile semantic edits—aging, hair color changes, and outdoor scene transformations—at high resolutions and compares favorably to baselines and single-target morphing methods. The work highlights a scalable, task-agnostic framework with potential as a powerful data augmentation and pre-processing tool for vision systems.
Abstract
Many tasks in computer vision can be cast as a "label changing" problem, where the goal is to make a semantic change to the appearance of an image or some subject in an image in order to alter the class membership. Although successful task-specific methods have been developed for some label changing applications, to date no general purpose method exists. Motivated by this we propose deep manifold traversal, a method that addresses the problem in its most general form: it first approximates the manifold of natural images then morphs a test image along a traversal path away from a source class and towards a target class while staying near the manifold throughout. The resulting algorithm is surprisingly effective and versatile. It is completely data driven, requiring only an example set of images from the desired source and target domains. We demonstrate deep manifold traversal on highly diverse label changing tasks: changing an individual's appearance (age and hair color), changing the season of an outdoor image, and transforming a city skyline towards nighttime.
