How To Think About End-To-End Encryption and AI: Training, Processing, Disclosure, and Consent
Mallory Knodel, Andrés Fábrega, Daniella Ferrari, Jacob Leiken, Betty Li Hou, Derek Yen, Sam de Alfaro, Kyunghyun Cho, Sunoo Park
TL;DR
This paper interrogates whether end-to-end encryption (E2EE) can coexist with contemporary AI integrations in messaging platforms. It develops a framework that simultaneously analyzes cryptographic guarantees, practical deployment, and legal/regulatory considerations, concluding that training on E2EE data is incompatible with E2EE, while inference can be compatible only under strict, endpoint-local or tightly controlled setups. The authors catalog real-world deployments (Apple, Samsung, Meta) to illustrate current practices and gaps, and they propose four core recommendations: preserve E2EE in processing, avoid unqualified E2EE claims when third parties access data, require opt-in consent for AI features, and apply rigorous, transparent disclosures. The work emphasizes that unlocking responsible AI in E2EE contexts demands coordinated technical design and robust regulatory and consumer-protection frameworks to prevent systemic erosion of confidentiality and user trust.
Abstract
End-to-end encryption (E2EE) has become the gold standard for securing communications, bringing strong confidentiality and privacy guarantees to billions of users worldwide. However, the current push towards widespread integration of artificial intelligence (AI) models, including in E2EE systems, raises some serious security concerns. This work performs a critical examination of the (in)compatibility of AI models and E2EE applications. We explore this on two fronts: (1) the integration of AI "assistants" within E2EE applications, and (2) the use of E2EE data for training AI models. We analyze the potential security implications of each, and identify conflicts with the security guarantees of E2EE. Then, we analyze legal implications of integrating AI models in E2EE applications, given how AI integration can undermine the confidentiality that E2EE promises. Finally, we offer a list of detailed recommendations based on our technical and legal analyses, including: technical design choices that must be prioritized to uphold E2EE security; how service providers must accurately represent E2EE security; and best practices for the default behavior of AI features and for requesting user consent. We hope this paper catalyzes an informed conversation on the tensions that arise between the brisk deployment of AI and the security offered by E2EE, and guides the responsible development of new AI features.
