A Generative Framework for Low-Cost Result Validation of Machine Learning-as-a-Service Inference
Abhinav Kumar, Miguel A. Guirao Aguilera, Reza Tourani, Satyajayant Misra
TL;DR
This paper addresses the challenge of validating the integrity of outsourced ML inference in MLaaS, especially for real-time edge/AR/VR applications. It proposes Fides, a framework that combines Greedy Distillation Transfer Learning (GDTL) to produce a compact verification model running in a Trusted Execution Environment with GAN-based client-side attack detection and re-classification. The approach achieves high attack detection (up to 98%) and re-classification accuracy (up to 94%), while delivering significant system-speed improvements over prior verifiable ML approaches. The work demonstrates practical feasibility through extensive experiments on CIFAR-10/100 and ImageNet across multiple architectures, and shows favorable scalability and overheads for edge deployments, highlighting its potential for secure, real-time MLaaS in edge ecosystems.
Abstract
The growing popularity of Machine Learning (ML) has led to its deployment in various sensitive domains, which has resulted in significant research focused on ML security and privacy. However, in some applications, such as Augmented/Virtual Reality, integrity verification of the outsourced ML tasks is more critical--a facet that has not received much attention. Existing solutions, such as multi-party computation and proof-based systems, impose significant computation overhead, which makes them unfit for real-time applications. We propose Fides, a novel framework for real-time integrity validation of ML-as-a-Service (MLaaS) inference. Fides features a novel and efficient distillation technique--Greedy Distillation Transfer Learning--that dynamically distills and fine-tunes a space and compute-efficient verification model for verifying the corresponding service model while running inside a trusted execution environment. Fides features a client-side attack detection model that uses statistical analysis and divergence measurements to identify, with a high likelihood, if the service model is under attack. Fides also offers a re-classification functionality that predicts the original class whenever an attack is identified. We devised a generative adversarial network framework for training the attack detection and re-classification models. The evaluation shows that Fides achieves an accuracy of up to 98% for attack detection and 94% for re-classification.
