MVS-TTA: Test-Time Adaptation for Multi-View Stereo via Meta-Auxiliary Learning

Hannuo Zhang; Zhixiang Chi; Yang Wang; Xinxin Zuo

MVS-TTA: Test-Time Adaptation for Multi-View Stereo via Meta-Auxiliary Learning

Hannuo Zhang, Zhixiang Chi, Yang Wang, Xinxin Zuo

TL;DR

MVS-TTA addresses the limited generalization of learning-based multi-view stereo by introducing test-time adaptation guided by a self-supervised cross-view photometric consistency objective. A meta-auxiliary learning strategy trains models to benefit from lightweight adaptation at inference, enabling rapid scene-specific refinement without extra labels. The approach is model-agnostic and demonstrates consistent gains across DTU, BlendedMVS, and cross-dataset scenarios, with improved depth accuracy and robustness to domain shifts. This framework offers a practical pathway to bring optimization-style adaptability to data-driven MVS pipelines in real-world deployments, preserving efficiency while enhancing reconstruction fidelity.

Abstract

Recent learning-based multi-view stereo (MVS) methods are data-driven and have achieved remarkable progress due to large-scale training data and advanced architectures. However, their generalization remains sub-optimal due to fixed model parameters trained on limited training data distributions. In contrast, optimization-based methods enable scene-specific adaptation but lack scalability and require costly per-scene optimization. In this paper, we propose MVS-TTA, an efficient test-time adaptation (TTA) framework that enhances the adaptability of learning-based MVS methods by bridging these two paradigms. Specifically, MVS-TTA employs a self-supervised, cross-view consistency loss as an auxiliary task to guide inference-time adaptation. We introduce a meta-auxiliary learning strategy to train the model to benefit from auxiliary-task-based updates explicitly. Our framework is model-agnostic and can be applied to a wide range of MVS methods with minimal architectural changes. Extensive experiments on standard datasets (DTU, BlendedMVS) and a challenging cross-dataset generalization setting demonstrate that MVS-TTA consistently improves performance, even when applied to state-of-the-art MVS models. To our knowledge, this is the first attempt to integrate optimization-based test-time adaptation into learning-based MVS using meta-learning. The code will be available at https://github.com/mart87987-svg/MVS-TTA.

MVS-TTA: Test-Time Adaptation for Multi-View Stereo via Meta-Auxiliary Learning

TL;DR

Abstract

MVS-TTA: Test-Time Adaptation for Multi-View Stereo via Meta-Auxiliary Learning

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (7)