Clair Obscur: an Illumination-Aware Method for Real-World Image Vectorization
Xingyue Lin, Shuai Peng, Xiangyu Xie, Jianhua Zhu, Yuxuan Zhou, Liangcai Gao
TL;DR
The paper addresses real-world image vectorization by reducing fragmentation and enhancing editability through a Clair-Obscur-inspired, intrinsic decomposition in the vector domain. It introduces COVec, which represents an image as a unified vector composition $\mathcal{V}_{final} = \mathcal{V}_{A} * \mathcal{V}_{I}$, where $\mathcal{V}_{A}$ captures intrinsic albedo and $\mathcal{V}_{I}$ encodes shading and illumination, subsequently decomposed into $\mathcal{V}_{S}$ (shade) and $\mathcal{V}_{L}$ (light). The method initializes layers via region-wise semantic binarization and SAM-based albedo masks, then uses a two-stage differentiable rendering optimization with a structure loss and a reconstruction loss, followed by illumination refinement and layer separation. Key contributions include the first integration of intrinsic image decomposition into vector graphics, a region-wise initialization strategy for illumination modeling, and demonstrated improvements in visual fidelity and editability over state-of-the-art baselines. This work enables high-fidelity, editable vector representations for real-world imagery, with practical impact on scalable, resolution-independent editing and rendering of complex scenes.
Abstract
Image vectorization aims to convert raster images into editable, scalable vector representations while preserving visual fidelity. Existing vectorization methods struggle to represent complex real-world images, often producing fragmented shapes at the cost of semantic conciseness. In this paper, we propose COVec, an illumination-aware vectorization framework inspired by the Clair-Obscur principle of light-shade contrast. COVec is the first to introduce intrinsic image decomposition in the vector domain, separating an image into albedo, shade, and light layers in a unified vector representation. A semantic-guided initialization and two-stage optimization refine these layers with differentiable rendering. Experiments on various datasets demonstrate that COVec achieves higher visual fidelity and significantly improved editability compared to existing methods.
