Examining the Impact of Optical Aberrations to Image Classification and Object Detection Models
Patrick Müller, Alexander Braun, Margret Keuper
TL;DR
This work investigates how realistic optical aberrations challenge image classification and object detection, introducing OpticsBench and LensCorruptions as physically grounded benchmarks. By deriving blur kernels from Zernike polynomials and real lens prescriptions, the authors quantify performance degradation across diverse models on ImageNet and MSCOCO, revealing that single-kernel baselines inadequately proxy optical blur. They propose OpticsAugment, a GPU-accelerated data-augmentation technique that uses optical kernels during training, achieving substantial robustness gains (e.g., up to +29.6 percentage points on ImageNet-100 OpticsBench) and transferring improvements to 2D common corruptions. The LensCorruptions framework further demonstrates that robustness scales with lens quality and that real-world lens variability can be a stronger stress test than synthetic baselines, underscoring the need to account for blur type in robustness evaluations and deployment-ready training pipelines.
Abstract
Deep neural networks (DNNs) have proven to be successful in various computer vision applications such that models even infer in safety-critical situations. Therefore, vision models have to behave in a robust way to disturbances such as noise or blur. While seminal benchmarks exist to evaluate model robustness to diverse corruptions, blur is often approximated in an overly simplistic way to model defocus, while ignoring the different blur kernel shapes that result from optical systems. To study model robustness against realistic optical blur effects, this paper proposes two datasets of blur corruptions, which we denote OpticsBench and LensCorruptions. OpticsBench examines primary aberrations such as coma, defocus, and astigmatism, i.e. aberrations that can be represented by varying a single parameter of Zernike polynomials. To go beyond the principled but synthetic setting of primary aberrations, LensCorruptions samples linear combinations in the vector space spanned by Zernike polynomials, corresponding to 100 real lenses. Evaluations for image classification and object detection on ImageNet and MSCOCO show that for a variety of different pre-trained models, the performance on OpticsBench and LensCorruptions varies significantly, indicating the need to consider realistic image corruptions to evaluate a model's robustness against blur.
