Fin-Fed-OD: Federated Outlier Detection on Financial Tabular Data

Dayananda Herurkar; Sebastian Palacio; Ahmed Anwar; Joern Hees; Andreas Dengel

Fin-Fed-OD: Federated Outlier Detection on Financial Tabular Data

Dayananda Herurkar, Sebastian Palacio, Ahmed Anwar, Joern Hees, Andreas Dengel

TL;DR

Fin-Fed-OD tackles open-world anomaly detection in privacy-sensitive financial tabular data by marrying representation learning with federated learning. The method trains client-specific autoencoders to learn latent representations, which are then used to refine inlier boundaries via OD models while sharing only model parameters through FL aggregation (FedAvg/FedProx). Across tabular and image datasets, including DAGMM and MemAE baselines, FL-based approaches yield stronger detection of unknown outliers without degrading known-outlier performance, supported by both quantitative AP improvements and qualitative latent-space clustering. This privacy-preserving collaboration enables robust, client-discriminating OD with practical implications for financial fraud detection and cross-organization anomaly resilience.

Abstract

Anomaly detection in real-world scenarios poses challenges due to dynamic and often unknown anomaly distributions, requiring robust methods that operate under an open-world assumption. This challenge is exacerbated in practical settings, where models are employed by private organizations, precluding data sharing due to privacy and competitive concerns. Despite potential benefits, the sharing of anomaly information across organizations is restricted. This paper addresses the question of enhancing outlier detection within individual organizations without compromising data confidentiality. We propose a novel method leveraging representation learning and federated learning techniques to improve the detection of unknown anomalies. Specifically, our approach utilizes latent representations obtained from client-owned autoencoders to refine the decision boundary of inliers. Notably, only model parameters are shared between organizations, preserving data privacy. The efficacy of our proposed method is evaluated on two standard financial tabular datasets and an image dataset for anomaly detection in a distributed setting. The results demonstrate a strong improvement in the classification of unknown outliers during the inference phase for each organization's model.

Fin-Fed-OD: Federated Outlier Detection on Financial Tabular Data

TL;DR

Abstract

Paper Structure (21 sections, 3 equations, 8 figures, 4 tables, 1 algorithm)

This paper contains 21 sections, 3 equations, 8 figures, 4 tables, 1 algorithm.

Introduction
Related Work
Approach
Standalone (Baseline) Model For Outlier Detection
Collaborative Federated Learning Environment (CoFLE)
Experimental Setup
Tabular Dataset + Synthetic Outliers
Image Dataset + Natural Outliers
Model Parameters
Evaluation Metric
Results
Quantitative Evaluation
Qualitative Evaluation
Conclusion and Future Work
Quatitative Evaluation Result
...and 6 more sections

Figures (8)

Figure 1: Standalone models trained using local data (Inliers: Blue) favor detecting only known outliers (Bank1:Red, Bank2:Green, Bank3:Grey) but are not robust against unknown outliers from other organizations. The aggregated model can adapt to distinct representations and adjust its boundaries to detect unknown outliers.
Figure 2: FL-OD Setup
Figure 3: Results of OD models clientwise. FL-OD models (Fed_Avg+RF, Fed_Prox+RF) outperform the baseline (Baseline_RF) in detecting unknown outliers. Every score reflects the mean and standard deviation from five experiments.
Figure 4: Latent Space Visualization of Baseline, Fed_Avg, and Fed_Prox models
Figure 5: Results of OD models clientwise. FL-OD models (Fed_Avg+RF, Fed_Prox+RF) outperform the baseline (Baseline_RF) in detecting unknown outliers. Every score reflects the mean and standard deviation from five experiments.
...and 3 more figures

Fin-Fed-OD: Federated Outlier Detection on Financial Tabular Data

TL;DR

Abstract

Fin-Fed-OD: Federated Outlier Detection on Financial Tabular Data

Authors

TL;DR

Abstract

Table of Contents

Figures (8)