CFDNet: A Generalizable Foggy Stereo Matching Network with Contrastive Feature Distillation

Zihua Liu; Yizhou Li; Masatoshi Okutomi

CFDNet: A Generalizable Foggy Stereo Matching Network with Contrastive Feature Distillation

Zihua Liu, Yizhou Li, Masatoshi Okutomi

TL;DR

This work introduces a framework based on contrastive feature distillation (CFD) that combines feature distillation from merged clean-fog features with contrastive learning, ensuring balanced dependence on fog depth hints and clean matching features.

Abstract

Stereo matching under foggy scenes remains a challenging task since the scattering effect degrades the visibility and results in less distinctive features for dense correspondence matching. While some previous learning-based methods integrated a physical scattering function for simultaneous stereo-matching and dehazing, simply removing fog might not aid depth estimation because the fog itself can provide crucial depth cues. In this work, we introduce a framework based on contrastive feature distillation (CFD). This strategy combines feature distillation from merged clean-fog features with contrastive learning, ensuring balanced dependence on fog depth hints and clean matching features. This framework helps to enhance model generalization across both clean and foggy environments. Comprehensive experiments on synthetic and real-world datasets affirm the superior strength and adaptability of our method.

CFDNet: A Generalizable Foggy Stereo Matching Network with Contrastive Feature Distillation

TL;DR

Abstract

Paper Structure (18 sections, 11 equations, 6 figures, 5 tables)

This paper contains 18 sections, 11 equations, 6 figures, 5 tables.

Introduction
Related Work
Stereo Matching
Stereo Matching in Foggy Conditions.
Feature Distillation and Contrastive Learning.
Proposed methods
Network Architecture Overview
Attentive Feature Converter
Teacher : Feature Fusion Stereo Matching Network
Student : Parallel Stereo Matching Network with Contrast-Balanced Distillation.
Experiments
Datasets
Foggy Images Synthesis
Implementation Details
Evaluation Metrics.
...and 3 more sections

Figures (6)

Figure 1: Clean and foggy left images from stereo image pairs (top), and estimated disparity maps from each pair with state-of-the-art RaftStereoRaftStereo (bottom).
Figure 2: Overall architecture of our proposed CFDNet with contrastive feature distillation.
Figure 3: Attentive Feature Converter. A feature aggregation module with pixel-wise and channel-wise attention layers.
Figure 4: Visualizations of ablation study on SceneFlow dataset. The first column is the predicted disparity for each group, while the second is the corresponding error map. We enlarged the selected part (white bounding box) for easier viewing.
Figure 5: Qualitative comparison on SceneFlow dataset foggy images with other superior works.
...and 1 more figures

CFDNet: A Generalizable Foggy Stereo Matching Network with Contrastive Feature Distillation

TL;DR

Abstract

CFDNet: A Generalizable Foggy Stereo Matching Network with Contrastive Feature Distillation

Authors

TL;DR

Abstract

Table of Contents

Figures (6)