DeepOHeat-v1: Efficient Operator Learning for Fast and Trustworthy Thermal Simulation and Optimization in 3D-IC Design
Xinling Yu, Ziyue Liu, Hai Li, Yixing Li, Xin Ai, Zhiyu Zeng, Ian Young, Zheng Zhang
TL;DR
DeepOHeat-v1 advances operator learning for 3D-IC thermals by (1) replacing fixed-activation trunk nets with Kolmogorov–Arnold Networks to capture multi-scale patterns, (2) introducing separable training to dramatically cut physics-informed training cost, and (3) adding a confidence-based hybrid optimization that couples fast neural predictions with GMRES refinement for reliability. The approach yields accuracy comparable to high-fidelity finite-difference solvers while delivering substantial speedups in optimization workflows—up to 70.6x in floorplanning—and enables high-resolution analyses previously limited by memory. The combination of adaptive basis learning, scalable training, and trustworthiness assessment makes DeepOHeat-v1 a practical tool for thermal-aware design exploration in complex 3D-IC architectures. Open-source code is provided to facilitate adoption and further development.
Abstract
Thermal analysis is crucial in 3D-IC design due to increased power density and complex heat dissipation paths. Although operator learning frameworks such as DeepOHeat~\cite{liu2023deepoheat} have demonstrated promising preliminary results in accelerating thermal simulation, they face critical limitations in prediction capability for multi-scale thermal patterns, training efficiency, and trustworthiness of results during design optimization. This paper presents DeepOHeat-v1, an enhanced physics-informed operator learning framework that addresses these challenges through three key innovations. First, we integrate Kolmogorov-Arnold Networks with learnable activation functions as trunk networks, enabling an adaptive representation of multi-scale thermal patterns. This approach achieves a 1.25x and 6.29x reduction in error in two representative test cases. Second, we introduce a separable training method that decomposes the basis function along the coordinate axes, achieving 62x training speedup and 31x GPU memory reduction in our baseline case, and enabling thermal analysis at resolutions previously infeasible due to GPU memory constraints. Third, we propose a confidence score to evaluate the trustworthiness of the predicted results, and further develop a hybrid optimization workflow that combines operator learning with finite difference (FD) using Generalized Minimal Residual (GMRES) method for incremental solution refinement, enabling efficient and trustworthy thermal optimization. Experimental results demonstrate that DeepOHeat-v1 achieves accuracy comparable to optimization using high-fidelity finite difference solvers, while speeding up the entire optimization process by $70.6\times$ in our test cases, effectively minimizing the peak temperature through optimal placement of heat-generating components. Open source code is available at https://github.com/xlyu0127/DeepOHeat-v1.
