Designing Transport-Level Encryption for Datacenter Networks
Tianyi Gao, Xinshu Ma, Suhas Narreddy, Eugenio Luo, Steven W. D. Chien, Michio Honda
TL;DR
This work introduces SMT, a secure, transport-level encryption framework for datacenter networks that integrates TLS-based encryption with a native, message-based transport (initially Homa/NDP) to support RPC workloads. SMT preserves TLS security properties while enabling per-message record sequences and unordered message delivery, leveraging NIC offloads and TSO-compatible framing. The authors implement SMT as a Linux kernel patch to Homa/Linux and demonstrate up to 41% throughput gains and up to 35% latency reductions compared with TLS/TCP, along with favorable results for Redis and NVMe-oF workloads. A thorough analysis covers threat models, key exchange mechanisms (including 0-RTT via SMT-tickets), and security properties, arguing that SMT offers a practical path to replacing TLS/TCP in datacenters without sacrificing hardware offload or security guarantees. The work also discusses trade-offs, limitations, and future directions toward broader applicability to other datacenter transports like NDP and beyond.
Abstract
Cloud applications need network data encryption to isolate from other tenants and protect their data from potential eavesdroppers in the network infrastructure. This paper presents SMT, a protocol design for emerging datacenter transport protocols, such as NDP and Homa, to integrate data encryption. SMT integrates TLS-based encryption with a message-based transport protocol that supports efficient Remote Procedure Calls (RPCs), a common workload in datacenters. This architecture enables the use of per-message record sequence number spaces in a secure session, while ensuring unique message identities to prevent replay attacks. It also enables the use of existing NIC offloads designed for TLS over TCP, while being a native transport protocol alongside TCP and UDP. We implement SMT in the Linux kernel by extending Homa/Linux and improve RPC throughput by up to 41 % and latency by up to 35 % in comparison to TLS/TCP.
