Lyra: A Hardware-Accelerated RISC-V Verification Framework with Generative Model-Based Processor Fuzzing
Juncheng Huo, Yunfan Gao, Xinxin Liu, Sa Wang, Yungang Bao, Xitong Gao, Kan Shi
TL;DR
Lyra introduces a GPU-CPU-FPGA co-verification framework for RISC-V that overcomes semantic blindness in traditional fuzzing by training a domain-specialized ISA-aware generator, LyraGen, and executing both DUT and reference on FPGA with on-chip coverage. The training phase builds LyraGen from scratch using a novel token-based RISC-V encoding and supervised <instruction, coverage> data, while the inference phase enforces legality and memory-correctness before hardware-accelerated verification. Empirically, Lyra achieves higher coverage and orders-of-magnitude faster end-to-end verification than state-of-the-art software fuzzers, with lower convergence difficulty as coverage grows. The work demonstrates the practicality of combining ISA-aware generation with FPGA-accelerated differential checking to dramatically accelerate processor verification workflows.
Abstract
As processor designs grow more complex, verification remains bottlenecked by slow software simulation and low-quality random test stimuli. Recent research has applied software fuzzers to hardware verification, but these rely on semantically blind random mutations that may generate shallow, low-quality stimuli unable to explore complex behaviors. These limitations result in slow coverage convergence and prohibitively high verification costs. In this paper, we present Lyra, a heterogeneous RISC-V verification framework that addresses both challenges by pairing hardware-accelerated verification with an ISA-aware generative model. Lyra executes the DUT and reference model concurrently on an FPGA SoC, enabling high-throughput differential checking and hardware-level coverage collection. Instead of creating verification stimuli randomly or through simple mutations, we train a domain-specialized generative model, LyraGen, with inherent semantic awareness to generate high-quality, semantically rich instruction sequences. Empirical results show Lyra achieves up to $1.27\times$ higher coverage and accelerates end-to-end verification by up to $107\times$ to $3343\times$ compared to state-of-the-art software fuzzers, while consistently demonstrating lower convergence difficulty.
