Kolmogorov-Arnold Network for Transistor Compact Modeling
Rodion Novkin, Hussam Amrouch
TL;DR
The paper tackles the interpretability bottleneck in neural-network–based transistor compact modeling by introducing Kolmogorov-Arnold Network (KAN) and Fourier KAN (FKAN) to achieve both precision and symbolic interpretability. It benchmarks these architectures against the industry-standard BSIM-CMG and MLP on 7 nm FinFET data, examining ID, QD, and QS predictions and exploring symbolic regression capabilities. Key contributions include the design of edge-activated, spline- and Fourier-based KANs, a comprehensive experimental framework with multiple datasets, and an iterative symbolic-regression approach to extract interpretable formulas. While KANs can deliver high prediction accuracy, derivative behavior poses challenges for circuit-level simulations, highlighting a need for domain-knowledge-guided refinements to realize fully interpretable, scalable transistor models for advanced technology nodes.
Abstract
Neural network (NN)-based transistor compact modeling has recently emerged as a transformative solution for accelerating device modeling and SPICE circuit simulations. However, conventional NN architectures, despite their widespread adoption in state-of-the-art methods, primarily function as black-box problem solvers. This lack of interpretability significantly limits their capacity to extract and convey meaningful insights into learned data patterns, posing a major barrier to their broader adoption in critical modeling tasks. This work introduces, for the first time, Kolmogorov-Arnold network (KAN) for the transistor - a groundbreaking NN architecture that seamlessly integrates interpretability with high precision in physics-based function modeling. We systematically evaluate the performance of KAN and Fourier KAN for FinFET compact modeling, benchmarking them against the golden industry-standard compact model and the widely used MLP architecture. Our results reveal that KAN and FKAN consistently achieve superior prediction accuracy for critical figures of merit, including gate current, drain charge, and source charge. Furthermore, we demonstrate and improve the unique ability of KAN to derive symbolic formulas from learned data patterns - a capability that not only enhances interpretability but also facilitates in-depth transistor analysis and optimization. This work highlights the transformative potential of KAN in bridging the gap between interpretability and precision in NN-driven transistor compact modeling. By providing a robust and transparent approach to transistor modeling, KAN represents a pivotal advancement for the semiconductor industry as it navigates the challenges of advanced technology scaling.
