Blind Augmentation: Calibration-free Camera Distortion Model Estimation for Real-time Mixed-reality Consistency

Siddhant Prakash; David R. Walton; Rafael K. dos Anjos; Anthony Steed; Tobias Ritschel

Blind Augmentation: Calibration-free Camera Distortion Model Estimation for Real-time Mixed-reality Consistency

Siddhant Prakash, David R. Walton, Rafael K. dos Anjos, Anthony Steed, Tobias Ritschel

TL;DR

The paper tackles the challenge of achieving visual consistency between real camera footage and virtual content in real-time mixed-reality without requiring camera calibration or markers. It introduces blind augmentation, which jointly learns a distortion model for noise, motion blur, and depth-of-field from arbitrary videos, then uses this model to synthesize corresponding distortions for virtual objects in real time. The approach combines depth and motion estimation with lightweight, end-to-end optimization to recover parameters such as $\lambda$, $\delta$, and $\sigma$, and then applies MB, DoF, and noise using off-the-shelf renderers, enabling fast startup and robust AR compositing. Extensive qualitative, quantitative, user-study, and real-time demonstrations (including a Unity demo on a Meta Quest 3) show that the method matches or exceeds marker-based baselines without requiring prior calibration, while maintaining practical runtime. This work offers a practical path to calibration-free, high-fidelity AR alignment in consumer MR devices.

Abstract

Real camera footage is subject to noise, motion blur (MB) and depth of field (DoF). In some applications these might be considered distortions to be removed, but in others it is important to model them because it would be ineffective, or interfere with an aesthetic choice, to simply remove them. In augmented reality applications where virtual content is composed into a live video feed, we can model noise, MB and DoF to make the virtual content visually consistent with the video. Existing methods for this typically suffer two main limitations. First, they require a camera calibration step to relate a known calibration target to the specific cameras response. Second, existing work require methods that can be (differentiably) tuned to the calibration, such as slow and specialized neural networks. We propose a method which estimates parameters for noise, MB and DoF instantly, which allows using off-the-shelf real-time simulation methods from e.g., a game engine in compositing augmented content. Our main idea is to unlock both features by showing how to use modern computer vision methods that can remove noise, MB and DoF from the video stream, essentially providing self-calibration. This allows to auto-tune any black-box real-time noise+MB+DoF method to deliver fast and high-fidelity augmentation consistency.

Blind Augmentation: Calibration-free Camera Distortion Model Estimation for Real-time Mixed-reality Consistency

TL;DR

Abstract

Blind Augmentation: Calibration-free Camera Distortion Model Estimation for Real-time Mixed-reality Consistency

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (16)