FPGA Divide-and-Conquer Placement using Deep Reinforcement Learning
Shang Wang, Deepak Ranganatha Sastry Mamillapalli, Tianpei Yang, Matthew E. Taylor
TL;DR
This work tackles FPGA placement by framing it as a Markov decision process and applying deep reinforcement learning to minimize wirelength. It introduces a two-branch architecture with board and netlist encoders and uses Proximal Policy Optimization (PPO) with invalid-action masking to learn a placement policy, while tackling the large search space via a divide-and-conquer decomposition into subtasks. The key contributions are a novel state representation that fuses board observations with netlist context and a systematic decomposition paradigm with multiple weight-sharing configurations, plus empirical evidence showing improved learning efficiency over a non-decomposed baseline and informative findings on weight reuse and exploration. Although not yet surpassing the VTR baseline in all cases, the approach demonstrates feasibility and lays groundwork for multi-objective optimization and broader netlist testing in FPGA placement, with potential impact on faster, more scalable EDA workflows.
Abstract
This paper introduces the problem of learning to place logic blocks in Field-Programmable Gate Arrays (FPGAs) and a learning-based method. In contrast to previous search-based placement algorithms, we instead employ Reinforcement Learning (RL) with the goal of minimizing wirelength. In addition to our preliminary learning results, we also evaluated a novel decomposition to address the nature of large search space when placing many blocks on a chipboard. Empirical experiments evaluate the effectiveness of the learning and decomposition paradigms on FPGA placement tasks.
