RobustFSM: Submodular Maximization in Federated Setting with Malicious Clients
Duc A. Tran, Dung Truong, Duy Le
TL;DR
RobustFSM tackles robustness in Federated Submodular Maximization where malicious clients can mislead the aggregation. It introduces a Byzantine-robust aggregation via two coreset-based global candidates: one built from maximally similar local gradients and another from maximally diverse gradients, with clients selecting the candidate best aligned to their local objective before proceeding with local updates. Empirical results on CIFAR10 and PATHMNIST show RobustFSM substantially outperforms the standard FedCG under targeted attacks (up to 200% improvement in some scenarios) while remaining competitive in no-attack settings. This work pioneers robustness in FSM, providing a practical defense and a benchmark for future studies in federated submodular optimization.$S^* = \arg \max_S \{ f(S) = \frac{1}{|C|} \sum_{i\in C} f_i(S) : S\in\mathcal{I} \}$, with a continuous-greedy based approach and a novel two-coreset aggregation to resist Byzantine clients.
Abstract
Submodular maximization is an optimization problem benefiting many machine learning applications, where we seek a small subset best representing an extremely large dataset. We focus on the federated setting where the data are locally owned by decentralized clients who have their own definitions for the quality of representability. This setting requires repetitive aggregation of local information computed by the clients. While the main motivation is to respect the privacy and autonomy of the clients, the federated setting is vulnerable to client misbehaviors: malicious clients might share fake information. An analogy is backdoor attack in conventional federated learning, but our challenge differs freshly due to the unique characteristics of submodular maximization. We propose RobustFSM, a federated submodular maximization solution that is robust to various practical client attacks. Its performance is substantiated with an empirical evaluation study using real-world datasets. Numerical results show that the solution quality of RobustFSM substantially exceeds that of the conventional federated algorithm when attacks are severe. The degree of this improvement depends on the dataset and attack scenarios, which can be as high as 200%
