GuaranTEE: Towards Attestable and Private ML with CCA

Sandra Siby; Sina Abdollahi; Mohammad Maheri; Marios Kogias; Hamed Haddadi

GuaranTEE: Towards Attestable and Private ML with CCA

Sandra Siby, Sina Abdollahi, Mohammad Maheri, Marios Kogias, Hamed Haddadi

TL;DR

The paper addresses the challenge of private and auditable ML deployment on edge devices by introducing GuaranTEE, a framework that runs provider models inside Arm's Confidential Computing Architecture (CCA) realms. It details a threat model and a multi-step pipeline for provisioning, attesting, and running models within a realm, and implements a prototype on Arm's Fixed Virtual Platforms to assess feasibility. Preliminary results show a roughly 1.7x instruction overhead for realm-based inference and substantial setup costs, driven by realm creation and memory provisioning, with attestation limitations tied to current hardware simulators. The work highlights practical constraints and outlines architectural and ecosystem improvements needed to realize a fully private, attestable edge ML deployment, including enhanced attestation, secure I/O for inputs/outputs, per-realm policy enforcement, and better availability guarantees.

Abstract

Machine-learning (ML) models are increasingly being deployed on edge devices to provide a variety of services. However, their deployment is accompanied by challenges in model privacy and auditability. Model providers want to ensure that (i) their proprietary models are not exposed to third parties; and (ii) be able to get attestations that their genuine models are operating on edge devices in accordance with the service agreement with the user. Existing measures to address these challenges have been hindered by issues such as high overheads and limited capability (processing/secure memory) on edge devices. In this work, we propose GuaranTEE, a framework to provide attestable private machine learning on the edge. GuaranTEE uses Confidential Computing Architecture (CCA), Arm's latest architectural extension that allows for the creation and deployment of dynamic Trusted Execution Environments (TEEs) within which models can be executed. We evaluate CCA's feasibility to deploy ML models by developing, evaluating, and openly releasing a prototype. We also suggest improvements to CCA to facilitate its use in protecting the entire ML deployment pipeline on edge devices.

GuaranTEE: Towards Attestable and Private ML with CCA

TL;DR

Abstract

Paper Structure (12 sections, 2 figures, 2 tables)

This paper contains 12 sections, 2 figures, 2 tables.

Introduction
Model protection on the edge
Towards CCA
GuaranTEE architecture
System and Threat Model
GuaranTEE Pipeline
Implementation
Preliminary evaluation
Inference overhead
Realm setup
Considerations for ML deployment using CCA
Conclusion

Figures (2)

Figure 1: Arm CCA software architecture. CCA introduces two new execution environments: realm and root. CCA's architecture allows for the creation of dynamic, hardware-protected enclaves called realms. Unlike TrustZone, the secure monitor runs in the root physical address (PA) space which is separate from the secure world PA.
Figure 2: Overview of GuaranTEE outlining the steps required for running a ML model on the client edge device. We show a simplified view of the normal and realm worlds within the client. The client's steps are (1) obtaining realm image from verifier (2) creating and activating a realm VM (3) establishing connection with provider (4) realm attestation (5) obtaining model from provider (6) announcing model readiness to normal world (7) running inference (8) performing model updates.

GuaranTEE: Towards Attestable and Private ML with CCA

TL;DR

Abstract

GuaranTEE: Towards Attestable and Private ML with CCA

Authors

TL;DR

Abstract

Table of Contents

Figures (2)