IR-Aware ECO Timing Optimization Using Reinforcement Learning

Wenjing Jiang; Vidya A. Chhabria; Sachin S. Sapatnekar

IR-Aware ECO Timing Optimization Using Reinforcement Learning

Wenjing Jiang, Vidya A. Chhabria, Sachin S. Sapatnekar

TL;DR

This work addresses IR-induced timing failures in late-stage IC design by coupling IR-aware timing analysis with ECO gate sizing through a reinforcement learning framework guided by Lagrangian relaxation. A relational graph convolutional network (R-GCN) agent sequentially upsizes or downsizes gates on a problem-constrained, post-P&R netlist, using a state that encompasses negative slack regions and an action mask that limits changes to critical-area cells. The LR-based reward and LM updates enable efficient constrained optimization, while problem-specific strategies (action-space reduction, clock-schedule, and LM adaptation) enhance training efficiency and transferability. Evaluation on 45nm benchmarks shows the approach shifts the power-delay Pareto frontier left, reduces runtime, and achieves consistent timing recovery with smaller placement perturbations, with zero-shot and fine-tuning transferability across designs. Overall, the method offers a practical, transferable framework for IR-aware ECO timing that blends traditional LR insights with modern graph-based RL for constrained, low-perturbation optimization.

Abstract

Engineering change orders (ECOs) in late stages make minimal design fixes to recover from timing shifts due to excessive IR drops. This paper integrates IR-drop-aware timing analysis and ECO timing optimization using reinforcement learning (RL). The method operates after physical design and power grid synthesis, and rectifies IR-drop-induced timing degradation through gate sizing. It incorporates the Lagrangian relaxation (LR) technique into a novel RL framework, which trains a relational graph convolutional network (R-GCN) agent to sequentially size gates to fix timing violations. The R-GCN agent outperforms a classical LR-only algorithm: in an open 45nm technology, it (a) moves the Pareto front of the delay-power tradeoff curve to the left (b) saves runtime over the prior approaches by running fast inference using trained models, and (c) reduces the perturbation to placement by sizing fewer cells. The RL model is transferable across timing specifications and to unseen designs with fine tuning.

IR-Aware ECO Timing Optimization Using Reinforcement Learning

TL;DR

Abstract

IR-Aware ECO Timing Optimization Using Reinforcement Learning

Authors

TL;DR

Abstract

Table of Contents

Figures (7)