Lightweight Optimal-Transport Harmonization on Edge Devices
Maria Larchenko, Dmitry Guskov, Alexander Lobashev, Georgy Derevyanko
TL;DR
The paper tackles real-time color harmonization for augmented reality by casting the problem as an optimal transport task and predicting MKL transport map parameters with a lightweight encoder suitable for edge devices. It grounds the approach in classical OT theory, derives an explicit error bound for the MKL approximation, and demonstrates practical viability with an EfficientNet-B0-based encoder that outputs a 12-parameter MKL filter. The authors also contribute an AR-specific dataset (ARCore) and show competitive quantitative metrics as well as strong perceptual performance, including on-device inference at mobile frame rates. This work enables fast, on-device color harmonization suitable for immersive AR pipelines and provides a theoretical and empirical basis for relying on linear OT-based filters in this domain.
Abstract
Color harmonization adjusts the colors of an inserted object so that it perceptually matches the surrounding image, resulting in a seamless composite. The harmonization problem naturally arises in augmented reality (AR), yet harmonization algorithms are not currently integrated into AR pipelines because real-time solutions are scarce. In this work, we address color harmonization for AR by proposing a lightweight approach that supports on-device inference. For this, we leverage classical optimal transport theory by training a compact encoder to predict the Monge-Kantorovich transport map. We benchmark our MKL-Harmonizer algorithm against state-of-the-art methods and demonstrate that for real composite AR images our method achieves the best aggregated score. We release our dedicated AR dataset of composite images with pixel-accurate masks and data-gathering toolkit to support further data acquisition by researchers.
