Delving into Cascaded Instability: A Lipschitz Continuity View on Image Restoration and Object Detection Synergy
Qing Zhao, Weijian Deng, Pengxu Wei, ZiYi Dong, Hannan Lu, Xiangyang Ji, Liang Lin
TL;DR
The paper tackles instability in Image Restoration → Object Detection pipelines under adverse conditions by adopting a Lipschitz-continuity framework. It identifies a fundamental mismatch: restoration networks exhibit smoother mappings while detectors have sharp, discontinuous decision boundaries, leading to amplified perturbations and unstable training in cascades. To address this, it introduces Lipschitz-regularized object detection (LROD), combining low-Lipschitz restoration with parameter-space smoothing, implemented as LR-YOLO and integrated into YOLO detectors. Across haze and low-light benchmarks, the method achieves more stable optimization and higher detection accuracy, while remaining plug-and-play with existing detectors and generalizable to other backbones. This Lipschitz-informed approach offers a practical path to robust perception in challenging environments.
Abstract
To improve detection robustness in adverse conditions (e.g., haze and low light), image restoration is commonly applied as a pre-processing step to enhance image quality for the detector. However, the functional mismatch between restoration and detection networks can introduce instability and hinder effective integration -- an issue that remains underexplored. We revisit this limitation through the lens of Lipschitz continuity, analyzing the functional differences between restoration and detection networks in both the input space and the parameter space. Our analysis shows that restoration networks perform smooth, continuous transformations, while object detectors operate with discontinuous decision boundaries, making them highly sensitive to minor perturbations. This mismatch introduces instability in traditional cascade frameworks, where even imperceptible noise from restoration is amplified during detection, disrupting gradient flow and hindering optimization. To address this, we propose Lipschitz-regularized object detection (LROD), a simple yet effective framework that integrates image restoration directly into the detector's feature learning, harmonizing the Lipschitz continuity of both tasks during training. We implement this framework as Lipschitz-regularized YOLO (LR-YOLO), extending seamlessly to existing YOLO detectors. Extensive experiments on haze and low-light benchmarks demonstrate that LR-YOLO consistently improves detection stability, optimization smoothness, and overall accuracy.
