LangGas: Introducing Language in Selective Zero-Shot Background Subtraction for Semi-Transparent Gas Leak Detection with a New Dataset
Wenqi Guo, Yiyang Du, Shan Du
TL;DR
This work tackles the scarce availability of annotated gas-leak datasets by introducing SimGas, a synthetic, video-based dataset with accurate segmentation ground truth for semi-transparent leaks. It presents LangGas, a selective zero-shot pipeline that fuses background subtraction, vision-language model filtering with careful prompts, temporal consistency checks, and SAM-based segmentation to localize leaks without labeled training data. The method achieves an IoU of 0.69 on its dataset, and demonstrates promising qualitative transfer to GasVid, indicating potential for practical deployment in industrial monitoring. The approach is lightweight enough for near-real-time operation and highlights future extensions such as moving-camera handling via optical flow and broader applicability to other detection tasks.
Abstract
Gas leakage poses a significant hazard that requires prevention. Traditionally, human inspection has been used for detection, a slow and labour-intensive process. Recent research has applied machine learning techniques to this problem, yet there remains a shortage of high-quality, publicly available datasets. This paper introduces a synthetic dataset, SimGas, featuring diverse backgrounds, interfering foreground objects, diverse leak locations, and precise segmentation ground truth. We propose a zero-shot method that combines background subtraction, zero-shot object detection, filtering, and segmentation to leverage this dataset. Experimental results indicate that our approach significantly outperforms baseline methods based solely on background subtraction and zero-shot object detection with segmentation, reaching an IoU of 69%. We also present an analysis of various prompt configurations and threshold settings to provide deeper insights into the performance of our method. Finally, we qualitatively (because of the lack of ground truth) tested our performance on GasVid and reached decent results on the real-world dataset. The dataset, code, and full qualitative results are available at https://github.com/weathon/Lang-Gas.
