Environmental Feature Engineering and Statistical Validation for ML-Based Path Loss Prediction
Jonathan Ethier, Mathieu Chateauvert, Ryan G. Dempsey, Alexis Bose
TL;DR
This work addresses accurate path loss prediction by leveraging GIS-derived environment information, specifically DSM-based obstructions, to enrich a compact neural network with eight scalar features defined along the direct Tx–Rx path. The approach uses two hidden layers with 64 neurons each, dropout, and $L_2$-style regularization, and validates generalization through six geographically distinct UK holdouts and an intercontinental blind test on Canadian data, achieving RMSE as low as $6.74$ dB with $R^2=0.88$. The study demonstrates robustness to initialization and train/validation splits, and shows consistent improvements over prior baselines (e.g., P.1812) while avoiding overfitting despite feature richness. The GIS-based feature engineering enables scalable, environment-aware path loss modeling across $0.5$–$6$ GHz without terrain-type labels, with future work targeting diffraction effects and higher-frequency regimes via additional obstruction properties.
Abstract
Wireless communications rely on path loss modeling, which is most effective when it includes the physical details of the propagation environment. Acquiring this data has historically been challenging, but geographic information systems data is becoming increasingly available with higher resolution and accuracy. Access to such details enables propagation models to more accurately predict coverage and account for interference in wireless deployments. Machine learning-based modeling can significantly support this effort, with feature based approaches allowing for accurate, efficient, and scalable propagation modeling. Building on previous work, we introduce an extended set of features that improves prediction accuracy while, most importantly, proving model generalization through rigorous statistical assessment and the use of test set holdouts.
