Explainable Federated Learning for U.S. State-Level Financial Distress Modeling
Lorenzo Carta, Fernando Spadea, Oshani Seneviratne
TL;DR
This study applies cross-silo federated learning to predict debt-collection contact as a signal of consumer financial distress using NFCS survey data, treating each U.S. state as a data silo to preserve privacy. It introduces an 8-layer Highway Network with class weighting to handle severe data imbalance and integrates SHAP and Owen values to provide interpretable, global and state-specific explanations of predictors. Results show that the federated model achieves performance comparable to a centralized model while substantially reducing communication costs, and reveals meaningful state-level differences (e.g., Washington vs Hawaii) that can inform region-tailored interventions. The framework offers a scalable, regulation-compliant blueprint for socially responsible AI in consumer credit risk and financial inclusion, with broad applicability to privacy-sensitive decentralized data settings.
Abstract
We present the first application of federated learning (FL) to the U.S. National Financial Capability Study, introducing an interpretable framework for predicting consumer financial distress across all 50 states and the District of Columbia without centralizing sensitive data. Our cross-silo FL setup treats each state as a distinct data silo, simulating real-world governance in nationwide financial systems. Unlike prior work, our approach integrates two complementary explainable AI techniques to identify both global (nationwide) and local (state-specific) predictors of financial hardship, such as contact from debt collection agencies. We develop a machine learning model specifically suited for highly categorical, imbalanced survey data. This work delivers a scalable, regulation-compliant blueprint for early warning systems in finance, demonstrating how FL can power socially responsible AI applications in consumer credit risk and financial inclusion.
