Provably Outlier-resistant Semi-parametric Regression for Transferable Calibration of Low-cost Air-quality Sensors
Divyansh Chaurasia, Manoj Daram, Roshan Kumar, Nihal Thukarama Rao, Vipul Sangode, Pranjal Srivastava, Avnish Tripathi, Shoubhik Chakraborty, Akanksha, Ambasht Kumar, Davender Sethi, Sachchida Nand Tripathi, Purushottam Kar
TL;DR
This work tackles calibration of low-cost CO sensors across diverse deployment conditions by introducing RESPIRE, an outlier-resistant semi-parametric regression framework. RESPIRE models temperature-dependent sensor sensitivities non-parametrically within an RKHS, enabling robust transfer across sites, seasons, and sensors, while providing an interpretable set of weights and a bias term that reveal potential calibration issues. A robust training procedure (SPR) based on an APIS-inspired alternating scheme detects and downweights outliers, with a hard-thresholding step yielding compressed, sparse models without sacrificing performance. Empirical results on a large, multi-site mobile deployment show RESPIRE achieving strong transfer performance, effective sensor-to-sensor adaptation, and the ability to detect anomalies such as swapped sensor readings, underscoring its practical utility for scalable LCAQ networks. The work positions RESPIRE as a practical, provably robust calibration tool with public code for broad adoption in environmental sensing and monitoring networks.
Abstract
We present a case study for the calibration of Low-cost air-quality (LCAQ) CO sensors from one of the largest multi-site-multi-season-multi-sensor-multi-pollutant mobile air-quality monitoring network deployments in India. LCAQ sensors have been shown to play a critical role in the establishment of dense, expansive air-quality monitoring networks and combating elevated pollution levels. The calibration of LCAQ sensors against regulatory-grade monitors is an expensive, laborious and time-consuming process, especially when a large number of sensors are to be deployed in a geographically diverse layout. In this work, we present the RESPIRE technique to calibrate LCAQ sensors to detect ambient CO (Carbon Monoxide) levels. RESPIRE offers specific advantages over baseline calibration methods popular in literature, such as improved prediction in cross-site, cross-season, and cross-sensor settings. RESPIRE offers a training algorithm that is provably resistant to outliers and an explainable model with the ability to flag instances of model overfitting. Empirical results are presented based on data collected during an extensive deployment spanning four sites, two seasons and six sensor packages. RESPIRE code is available at https://github.com/purushottamkar/respire.
