Masala-CHAI: A Large-Scale SPICE Netlist Dataset for Analog Circuits by Harnessing AI

Jitendra Bhandari; Vineet Bhat; Yuheng He; Hamed Rahmani; Siddharth Garg; Ramesh Karri

Masala-CHAI: A Large-Scale SPICE Netlist Dataset for Analog Circuits by Harnessing AI

Jitendra Bhandari, Vineet Bhat, Yuheng He, Hamed Rahmani, Siddharth Garg, Ramesh Karri

TL;DR

Masala-CHAI tackles automated SPICE netlist generation for analog circuits by building the largest open dataset of labeled schematics and SPICE netlists from textbooks. It introduces a multi-stage pipeline combining YOLOv8-based component detection, Deep Hough Transform-based net detection, targeted prompt tuning, and an LLM-based verification loop, enabling end-to-end netlist generation from schematic images. Fine-tuning LLMs on this dataset within the AnalogCoder framework yields substantial improvements in Pass@1 and broad task coverage, including notable gains on challenging benchmarks. The work provides an open-source resource and a scalable approach to accelerate analog circuit design automation.

Abstract

Masala-CHAI is a fully automated framework leveraging large language models (LLMs) to generate Simulation Programs with Integrated Circuit Emphasis (SPICE) netlists. It addresses a long-standing challenge in circuit design automation: automating netlist generation for analog circuits. Automating this workflow could accelerate the creation of fine-tuned LLMs for analog circuit design and verification. In this work, we identify key challenges in automated netlist generation and evaluate multimodal capabilities of state-of-the-art LLMs, particularly GPT-4, in addressing them. We propose a three-step workflow to overcome existing limitations: labeling analog circuits, prompt tuning, and netlist verification. This approach enables end-to-end SPICE netlist generation from circuit schematic images, tackling the persistent challenge of accurate netlist generation. We utilize Masala-CHAI to collect a corpus of 7,500 schematics that span varying complexities in 10 textbooks and benchmark various open source and proprietary LLMs. Models fine-tuned on Masala-CHAI when used in LLM-agentic frameworks such as AnalogCoder achieve a notable 46% improvement in Pass@1 scores. We open-source our dataset and code for community-driven development.

Masala-CHAI: A Large-Scale SPICE Netlist Dataset for Analog Circuits by Harnessing AI

TL;DR

Abstract

Masala-CHAI: A Large-Scale SPICE Netlist Dataset for Analog Circuits by Harnessing AI

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (12)