ChipMind: Retrieval-Augmented Reasoning for Long-Context Circuit Design Specifications
Changwen Xing, SamZaak Wong, Xinlai Wan, Yanfeng Lu, Mengli Zhang, Zebin Ma, Lei Qi, Zhengxiong Li, Nan Guan, Zhe Jiang, Xi Wang, Jun Yang
TL;DR
This work addresses the challenge of performing deep, multi-hop reasoning over long industrial IC specifications, which typically exceed standard LLM context windows. It introduces ChipMind, a knowledge graph-augmented framework that constructs a Circuit KG (ChipKG) from specifications and uses adaptive, CSA-guided retrieval to enable iterative, verifiable reasoning. Evaluated on SpecEval-QA, ChipMind achieves state-of-the-art performance with an average improvement of 34.59% and up to 72.73% over strong baselines, and introduces Atomic-ROUGE as a factuality-focused evaluation metric with high correlation to human judgments. The approach demonstrates a practical path to deploying LLM-aided hardware design in industry by improving context coverage, reasoning reliability, and evaluation rigor.
Abstract
While Large Language Models (LLMs) demonstrate immense potential for automating integrated circuit (IC) development, their practical deployment is fundamentally limited by restricted context windows. Existing context-extension methods struggle to achieve effective semantic modeling and thorough multi-hop reasoning over extensive, intricate circuit specifications. To address this, we introduce ChipMind, a novel knowledge graph-augmented reasoning framework specifically designed for lengthy IC specifications. ChipMind first transforms circuit specifications into a domain-specific knowledge graph ChipKG through the Circuit Semantic-Aware Knowledge Graph Construction methodology. It then leverages the ChipKG-Augmented Reasoning mechanism, combining information-theoretic adaptive retrieval to dynamically trace logical dependencies with intent-aware semantic filtering to prune irrelevant noise, effectively balancing retrieval completeness and precision. Evaluated on an industrial-scale specification reasoning benchmark, ChipMind significantly outperforms state-of-the-art baselines, achieving an average improvement of 34.59% (up to 72.73%). Our framework bridges a critical gap between academic research and practical industrial deployment of LLM-aided Hardware Design (LAD).
