High-Performance DBMSs with io_uring: When and How to use it
Matthias Jasny, Muhammad El-Hindi, Tobias Ziegler, Viktor Leis, Carsten Binnig
TL;DR
The paper investigates when and how modern DBMSs can gain from Linux io_uring by examining storage-bound and network-bound scenarios. It shows that naive substitution yields modest gains, while end-to-end architectural design that exploits batching, asynchronous execution, and device-specific optimizations yields substantial throughput improvements. Through two case studies—a buffer-managed storage engine and a high-speed data shuffle over 400 Gbit/s networks—the authors derive practical guidelines and validate them with a PostgreSQL integration that achieves 11–15% improvements in table-scan performance. The work emphasizes that gains depend on workload characteristics and system bottlenecks, offering actionable steps for DBMS builders to leverage io_uring effectively. Overall, the study provides a concrete roadmap for designing I/O-intensive systems that harness io_uring’s unified, asynchronous, and batched I/O capabilities while remaining production-friendly.
Abstract
We study how modern database systems can leverage the Linux io_uring interface for efficient, low-overhead I/O. io_uring is an asynchronous system call batching interface that unifies storage and network operations, addressing limitations of existing Linux I/O interfaces. However, naively replacing traditional I/O interfaces with io_uring does not necessarily yield performance benefits. To demonstrate when io_uring delivers the greatest benefits and how to use it effectively in modern database systems, we evaluate it in two use cases: Integrating io_uring into a storage-bound buffer manager and using it for high-throughput data shuffling in network-bound analytical workloads. We further analyze how advanced io_uring features, such as registered buffers and passthrough I/O, affect end-to-end performance. Our study shows when low-level optimizations translate into tangible system-wide gains and how architectural choices influence these benefits. Building on these insights, we derive practical guidelines for designing I/O-intensive systems using io_uring and validate their effectiveness in a case study of PostgreSQL's recent io_uring integration, where applying our guidelines yields a performance improvement of 14%.
