Scaling Homomorphic Applications in Deployment
Ryan Marinelli, Angelica Chowdhury
TL;DR
This work tackles scalable deployment of privacy-preserving computation by integrating Fully Homomorphic Encryption ($FHE$) with a production-like movie-recommendation pipeline built on Concrete-ML. It demonstrates a complete pipeline: $FHE$-enabled inference in a containerized Flask app, orchestrated on Kubernetes, with an RL agent (PPO) learning to tune replica deployment under encryption-induced latency constraints. The study introduces an explicit MDP for deployment, robust self-healing mechanisms, and stress-testing to train a resilient controller. Key finding shows a practical replica range around $3$–$6$ balances latency and resource usage, signaling the viability of scalable, secure inference in real-world deployments.
Abstract
In this endeavor, a proof-of-concept homomorphic application is developed to determine the production readiness of encryption ecosystems. A movie recommendation app is implemented for this purpose and productionized through containerization and orchestration. By tuning deployment configurations, the computational limitations of Fully Homomorphic Encryption (FHE) are mitigated through additional infrastructure optimizations Index Terms: Reinforcement Learning, Orchestration, Homomorphic Encryption
