SODA: Protecting Proprietary Information in On-Device Machine Learning Models
Akanksha Atrey, Ritwik Sinha, Saayan Mitra, Prashant Shenoy
TL;DR
This paper addresses the risk of proprietary information leakage from ML models deployed on user devices. It proposes SODA, an end-to-end defense that uses an autoencoder-based detector and a leakage-rate metric to identify adversarial, query-based exploitation of on-device models, achieving high detection accuracy (89% in under 50 queries) with minimal service impact. The authors provide a taxonomy of on-device models, formalize a threat model, demonstrate two classes of leakage attacks on HAR and MNIST data, and validate SODA’s effectiveness and practicality on a Raspberry Pi prototype, including ONNX-based deployment and encryption. The work advances edge deployment security by delivering a generalizable, on-device solution that protects proprietary service criteria while maintaining latency, storage, and accuracy within acceptable bounds. Practical impact includes faster, private, and resilient on-device services with robust defenses against both output-diversity and decision-boundary leakage threats.
Abstract
The growth of low-end hardware has led to a proliferation of machine learning-based services in edge applications. These applications gather contextual information about users and provide some services, such as personalized offers, through a machine learning (ML) model. A growing practice has been to deploy such ML models on the user's device to reduce latency, maintain user privacy, and minimize continuous reliance on a centralized source. However, deploying ML models on the user's edge device can leak proprietary information about the service provider. In this work, we investigate on-device ML models that are used to provide mobile services and demonstrate how simple attacks can leak proprietary information of the service provider. We show that different adversaries can easily exploit such models to maximize their profit and accomplish content theft. Motivated by the need to thwart such attacks, we present an end-to-end framework, SODA, for deploying and serving on edge devices while defending against adversarial usage. Our results demonstrate that SODA can detect adversarial usage with 89% accuracy in less than 50 queries with minimal impact on service performance, latency, and storage.
