Data Optimisation of Machine Learning Models for Smart Irrigation in Urban Parks
Nasser Ghadiri, Bahman Javadi, Oliver Obst, Sebastian Pfautsch
TL;DR
This paper addresses the reliability and cost of data in smart irrigation for urban parks under climate stress by introducing two techniques: clustering-based estimation of missing sensor readings (using DTW-based K-means and K-shape clustering) and sequential robotic data collection to reduce fixed sensors. The DTW-based clustering consistently yields more accurate sensor replacements with lower variability, while the robotic sensing strategy reduces maintenance needs and maintains accurate soil-moisture predictions, achieving MAE improvements up to 17.2% on circular paths and 2.1% on linear paths. Overall, the methods enhance data quality, lower sensor-network costs, and improve resilience of the SIMPaCT irrigation system against sensor failures, supporting scalable deployment in similar urban parks. The work demonstrates practical gains in data reliability and operational efficiency, with future plans to generalize to other parks and weather conditions.
Abstract
Urban environments face significant challenges due to climate change, including extreme heat, drought, and water scarcity, which impact public health, community well-being, and local economies. Effective management of these issues is crucial, particularly in areas like Sydney Olympic Park, which relies on one of Australia's largest irrigation systems. The Smart Irrigation Management for Parks and Cool Towns (SIMPaCT) project, initiated in 2021, leverages advanced technologies and machine learning models to optimize irrigation and induce physical cooling. This paper introduces two novel methods to enhance the efficiency of the SIMPaCT system's extensive sensor network and applied machine learning models. The first method employs clustering of sensor time series data using K-shape and K-means algorithms to estimate readings from missing sensors, ensuring continuous and reliable data. This approach can detect anomalies, correct data sources, and identify and remove redundant sensors to reduce maintenance costs. The second method involves sequential data collection from different sensor locations using robotic systems, significantly reducing the need for high numbers of stationary sensors. Together, these methods aim to maintain accurate soil moisture predictions while optimizing sensor deployment and reducing maintenance costs, thereby enhancing the efficiency and effectiveness of the smart irrigation system. Our evaluations demonstrate significant improvements in the efficiency and cost-effectiveness of soil moisture monitoring networks. The cluster-based replacement of missing sensors provides up to 5.4% decrease in average error. The sequential sensor data collection as a robotic emulation shows 17.2% and 2.1% decrease in average error for circular and linear paths respectively.
