Table of Contents
Fetching ...

ELMo-Tune-V2: LLM-Assisted Full-Cycle Auto-Tuning to Optimize LSM-Based Key-Value Stores

Viraj Thakkar, Qi Lin, Kenanya Keandra Adriel Prasetyo, Raden Haryosatyo Wisjnunandono, Achmad Imam Kistijantoro, Reza Fuad Rachmadi, Zhichao Cao

TL;DR

The paper tackles the complexity of tuning LSM-KVS systems with hundreds of configurable options under dynamic workloads. It introduces ELMo-Tune-V2, an LLM-driven framework that jointly characterizes workloads, models benchmarks, and tunes configurations across offline and real-time settings. The approach combines LLM-based workload synthesis, iterative macro/micro tuning, and adaptive real-time adjustments via a JSON-based interface, demonstrating substantial end-to-end gains on RocksDB and upper-layer applications. These results suggest a practical path to automated, cross-domain optimization of data storage systems in production environments, with robust performance across storage media and workload shifts.

Abstract

Log-Structured Merge-tree-based Key-Value Store (LSM-KVS) is a foundational storage engine serving diverse modern workloads, systems, and applications. To suit varying use cases, LSM-KVS allows a vast configuration space that controls core parameters like compaction, flush, and cache sizes, each consuming a shared pool of CPU, Memory, and Storage resources. Navigating the LSM-KVS configuration space necessitates knowledge of the impact of each configuration on the expected workload and underlying hardware. Beyond expensive and time-intensive human-expert-based tuning, existing LSM-KVS tuning solutions focus on tuning with specific workload expectations while limited to a narrow subset of parameters. This paper introduces ELMo-Tune-V2, a framework that integrates Large Language Models (LLMs) at its foundation to demonstrate the potential of applying modern LLMs in data system optimization problems. ELMo-Tune-V2 leverages the contextual reasoning, cross-domain, and generative capabilities of LLMs to perform 1) self-navigated characterization and modeling of LSM-KVS workloads, 2) automatic tuning across a broad parameter space using cross-domain knowledge, and 3) real-time dynamic configuration adjustments for LSM-KVS. ELMo-Tune-V2 integrates three innovations: LLM-based workload synthesis for adaptive benchmark generation, feedback-driven iterative fine-tuning for configuration refinement, and real-time tuning to handle evolving workloads. Through detailed evaluation using RocksDB under several real-world applications across diverse scenarios, ELMo-Tune-V2 achieves performance improvements up to ~14X our YCSB benchmarks compared against default RocksDB configurations, and our end-to-end tests with upper-level applications, NebulaGraph and Kvrocks, demonstrate performance gains of 34% and 26%, respectively.

ELMo-Tune-V2: LLM-Assisted Full-Cycle Auto-Tuning to Optimize LSM-Based Key-Value Stores

TL;DR

The paper tackles the complexity of tuning LSM-KVS systems with hundreds of configurable options under dynamic workloads. It introduces ELMo-Tune-V2, an LLM-driven framework that jointly characterizes workloads, models benchmarks, and tunes configurations across offline and real-time settings. The approach combines LLM-based workload synthesis, iterative macro/micro tuning, and adaptive real-time adjustments via a JSON-based interface, demonstrating substantial end-to-end gains on RocksDB and upper-layer applications. These results suggest a practical path to automated, cross-domain optimization of data storage systems in production environments, with robust performance across storage media and workload shifts.

Abstract

Log-Structured Merge-tree-based Key-Value Store (LSM-KVS) is a foundational storage engine serving diverse modern workloads, systems, and applications. To suit varying use cases, LSM-KVS allows a vast configuration space that controls core parameters like compaction, flush, and cache sizes, each consuming a shared pool of CPU, Memory, and Storage resources. Navigating the LSM-KVS configuration space necessitates knowledge of the impact of each configuration on the expected workload and underlying hardware. Beyond expensive and time-intensive human-expert-based tuning, existing LSM-KVS tuning solutions focus on tuning with specific workload expectations while limited to a narrow subset of parameters. This paper introduces ELMo-Tune-V2, a framework that integrates Large Language Models (LLMs) at its foundation to demonstrate the potential of applying modern LLMs in data system optimization problems. ELMo-Tune-V2 leverages the contextual reasoning, cross-domain, and generative capabilities of LLMs to perform 1) self-navigated characterization and modeling of LSM-KVS workloads, 2) automatic tuning across a broad parameter space using cross-domain knowledge, and 3) real-time dynamic configuration adjustments for LSM-KVS. ELMo-Tune-V2 integrates three innovations: LLM-based workload synthesis for adaptive benchmark generation, feedback-driven iterative fine-tuning for configuration refinement, and real-time tuning to handle evolving workloads. Through detailed evaluation using RocksDB under several real-world applications across diverse scenarios, ELMo-Tune-V2 achieves performance improvements up to ~14X our YCSB benchmarks compared against default RocksDB configurations, and our end-to-end tests with upper-level applications, NebulaGraph and Kvrocks, demonstrate performance gains of 34% and 26%, respectively.

Paper Structure

This paper contains 22 sections, 18 figures, 2 tables.

Figures (18)

  • Figure 1: Different Components and their Configurable Options in LSM-KVS.
  • Figure 2: LSM-KVS performance on different Hardware, Workload, and Configuration.
  • Figure 3: Workflow to achieve performant tuning configuration for LSM-KVS
  • Figure 4: LLM Generational and Analytical Capability.
  • Figure 5: ELMo-Tune-V2 Framework
  • ...and 13 more figures