Learning Intra-view and Cross-view Geometric Knowledge for Stereo Matching
Rui Gong, Weide Liu, Zaiwang Gu, Xulei Yang, Jun Cheng
TL;DR
This work tackles stereo matching by injecting both intra-view and cross-view geometric knowledge into learning-based disparity estimation. It introduces ICGNet, which employs an intra-view decoder guided by a pre-trained interest-point detector and a cross-view decoder guided by a pre-trained interest-point matcher and ground-truth correspondences, optimized with L_intra, L_cross-soft, and L_cross-hard losses in addition to the standard disparity loss. Empirically, ICGNet achieves state-of-the-art performance on SceneFlow and strong cross-domain generalization to KITTI and Middlebury, while incurring zero inference overhead due to the decoders being discarded at test time. The approach demonstrates the value of leveraging geometric priors from local feature matching to improve disparity estimation, with broad implications for robust stereo in textureless or occluded regions.
Abstract
Geometric knowledge has been shown to be beneficial for the stereo matching task. However, prior attempts to integrate geometric insights into stereo matching algorithms have largely focused on geometric knowledge from single images while crucial cross-view factors such as occlusion and matching uniqueness have been overlooked. To address this gap, we propose a novel Intra-view and Cross-view Geometric knowledge learning Network (ICGNet), specifically crafted to assimilate both intra-view and cross-view geometric knowledge. ICGNet harnesses the power of interest points to serve as a channel for intra-view geometric understanding. Simultaneously, it employs the correspondences among these points to capture cross-view geometric relationships. This dual incorporation empowers the proposed ICGNet to leverage both intra-view and cross-view geometric knowledge in its learning process, substantially improving its ability to estimate disparities. Our extensive experiments demonstrate the superiority of the ICGNet over contemporary leading models.
