Femur: A Flexible Framework for Fast and Secure Querying from Public Key-Value Store
Jiaoyi Zhang, Liqiang Peng, Mo Sha, Weiran Liu, Xiang Li, Sheng Wang, Feifei Li, Mingyu Gao, Huanchen Zhang
TL;DR
Femur addresses the challenge of privately querying large public key-value stores without incurring the prohibitive costs of full PIR. It introduces distance-based indistinguishability to relax privacy guarantees in a controlled, provable manner and combines a learned index (PGM-index) with an offline initialization and online query workflow. The framework supports two retrieval modes, plaintext download and a novel variable-range PIR, and uses a lightweight cost model to adaptively select the optimal scheme per query. Empirical results on a 200M-record dataset show substantial speedups over state-of-the-art PIR systems, with up to 163.9X gains under relaxed privacy and a realistic offline initialization time, demonstrating practical scalability for real-world deployments like Redis integrations.
Abstract
With increasing demands for privacy, it becomes necessary to protect sensitive user query data when accessing public key-value databases. Existing Private Information Retrieval (PIR) schemes provide full security but suffer from poor scalability, limiting their applicability in large-scale deployment. We argue that in many real-world scenarios, a more practical solution should allow users to flexibly determine the privacy levels of their queries in a theoretically guided way, balancing security and performance based on specific needs. To formally provide provable guarantees, we introduce a novel concept of distance-based indistinguishability, which can facilitate users to comfortably relax their security requirements. We then design Femur, an efficient framework to securely query public key-value stores with flexible security and performance trade-offs. It uses a space-efficient learned index to convert query keys into storage locations, obfuscates these locations with extra noise provably derived by the distance-based indistinguishability theory, and sends the expanded range to the server. The server then adaptively utilizes the best scheme to retrieve data. We also propose a novel variable-range PIR scheme optimized for bandwidth-constrained environments. Experiments show that Femur outperforms the state-of-the-art designs even when ensuring the same full security level. When users are willing to relax their privacy requirements, Femur can further improve the performance gains to up to 163.9X, demonstrating an effective trade-off between security and performance.
