CSCPR: Cross-Source-Context Indoor RGB-D Place Recognition
Jing Liang, Zhuo Deng, Zheming Zhou, Min Sun, Omid Ghasemalizadeh, Cheng-Hao Kuo, Arnie Sen, Dinesh Manocha
TL;DR
This work tackles RGB-D indoor place recognition under cross-source conditions by introducing CSCPR, an end-to-end system that integrates global retrieval with a fast, learned reranking stage.The reranker leverages two novel modules, Self Context Clusters (SCC) and Cross Source Context Clusters (CSCC), to process multi-scale, multi-source RGB-D features within the Context-of-Clusters framework, producing a robust reranking score.The authors also contribute two large-scale, overlap-based datasets, ScanNetIPR and ARKitIPR, and demonstrate that CSCPR achieves substantial Recall@1 improvements over state-of-the-art methods across these datasets, along with improved efficiency.Together, these advances advance RGB-D indoor place recognition by enabling integrated retrieval-reranking with cross-source adaptability, while providing resources for ongoing research and development.
Abstract
We extend our previous work, PoCo, and present a new algorithm, Cross-Source-Context Place Recognition (CSCPR), for RGB-D indoor place recognition that integrates global retrieval and reranking into an end-to-end model and keeps the consistency of using Context-of-Clusters (CoCs) for feature processing. Unlike prior approaches that primarily focus on the RGB domain for place recognition reranking, CSCPR is designed to handle the RGB-D data. We apply the CoCs to handle cross-sourced and cross-scaled RGB-D point clouds and introduce two novel modules for reranking: the Self-Context Cluster (SCC) and the Cross Source Context Cluster (CSCC), which enhance feature representation and match query-database pairs based on local features, respectively. We also release two new datasets, ScanNetIPR and ARKitIPR. Our experiments demonstrate that CSCPR significantly outperforms state-of-the-art models on these datasets by at least 29.27% in Recall@1 on the ScanNet-PR dataset and 43.24% in the new datasets. Code and datasets will be released.
