ConstMig: Enabling Secure Live Migration of Large Intel SGX-based applications
Sandeep Kumar, Abhisek Panda, Smruti R. Sarangi
TL;DR
This work addresses the challenge of securely migrating large SGX-based applications with minimal downtime, where traditional stop-and-copy and hardware-assisted options fall short due to SGX’s restrictions. The authors introduce ConstMig, a two-component solution combining a CRIU-powered Migration Manager and an SGX-resident ConstMigLib that saves and restores only the enclave’s memory—the heap—while maintaining data integrity and freshness through encryption, integrity checks, and fine-grained access tracking. They implement two complementary mechanisms, Fault Tracker and Access Tracker, to prevent use-before-restore in restored enclaves, and evaluate ConstMig against MigSGX and (where available) TDX, showing downtime reductions of 77–96% and end-to-end throughput that often matches native once migration completes. The results indicate significant practical impact for data-center operators seeking high availability, low-latency migrations, and seamless patching of SGX-based services without hardware changes. Overall, ConstMig enables scalable, secure live migration for large SGX enclaves, advancing practical deployment of TEEs in cloud environments.
Abstract
Cloud service providers are adopting Trusted Execution Environments (TEEs) to provide hardware-guaranteed security to applications running on remote, untrusted data centers. However, migrating such applications still relies on the decade-old stop-and-copy method, which introduces large downtimes. Modern live-migration approaches such as pre-copy and post-copy do not work for TEE-based applications due to hardware-enforced restrictions. We propose ConstMig, a near-zero-downtime live-migration mechanism for large memory-footprint TEE-based applications. ConstMig is fully compatible with containers, virtual machines (VMs), and microVMs. Our prototype, built on Intel SGX, achieves near-zero downtime irrespective of enclave size and requires no additional hardware support. ConstMig reduces total downtime by 77 - 96% for a suite of SGX applications with multi-gigabyte memory footprints compared to state-of-the-art TEE-based migration solutions such as MigSGX.
