PEFA-AI: Advancing Open-source LLMs for RTL generation using Progressive Error Feedback Agentic-AI
Athma Narayanan, Mahesh Subedar, Omesh Tickoo
TL;DR
The paper tackles autonomous RTL code generation by introducing PEFA-AI, a Progressive Error Feedback Agentic-AI framework that orchestrates multiple specialized LLMs and hardware simulators to iteratively self-correct RTL designs. It leverages a four-loop progressive feedback mechanism to validate compilation and functional correctness, while enabling synthesizable results and IP protection through black-box test benches. Benchmarking on VerilogEval and RTLLM1.1 across open- and closed-source models demonstrates state-of-the-art pass rates and improved token efficiency, significantly reducing total LLM calls compared to non-agentic baselines. The approach offers a scalable, modular path toward privacy-preserving, autonomous hardware design augmentation and highlights future work in broader RTL tasks and PPA-aware optimization integration.
Abstract
We present an agentic flow consisting of multiple agents that combine specialized LLMs and hardware simulation tools to collaboratively complete the complex task of Register Transfer Level (RTL) generation without human intervention. A key feature of the proposed flow is the progressive error feedback system of agents (PEFA), a self-correcting mechanism that leverages iterative error feedback to progressively increase the complexity of the approach. The generated RTL includes checks for compilation, functional correctness, and synthesizable constructs. To validate this adaptive approach to code generation, benchmarking is performed using two opensource natural language-to-RTL datasets. We demonstrate the benefits of the proposed approach implemented on an open source agentic framework, using both open- and closed-source LLMs, effectively bridging the performance gap between them. Compared to previously published methods, our approach sets a new benchmark, providing state-of-the-art pass rates while being efficient in token counts.
