Optimization over Trained Neural Networks: Difference-of-Convex Algorithm and Application to Data Center Scheduling

Xinwei Liu; Vladimir Dvorkin

Optimization over Trained Neural Networks: Difference-of-Convex Algorithm and Application to Data Center Scheduling

Xinwei Liu, Vladimir Dvorkin

TL;DR

The paper tackles optimization problems where objectives or constraints are learned by neural networks, focusing on ReLU-based networks. It introduces a bilinear reformulation that penalizes ReLU violations in the objective and solves the resulting problem via a difference-of-convex (DC) algorithm, with a principled method to select the penalty parameter $ ho$. The approach is applied to data-center demand allocation in power grids, replacing a difficult bilevel/OPF model with an NN-embedded optimization that yields significant potential savings, demonstrated on small and large test systems. The results show effective convergence to stationary points and practical improvements in electricity costs under congested grid conditions. The work provides a practical framework for optimization over trained neural networks with strong potential impact on grid operations and other decision-making problems leveraging data-driven cost models.

Abstract

When solving decision-making problems with mathematical optimization, some constraints or objectives may lack analytic expressions but can be approximated from data. When an approximation is made by neural networks, the underlying problem becomes optimization over trained neural networks. Despite recent improvements with cutting planes, relaxations, and heuristics, the problem remains difficult to solve in practice. We propose a new solution based on a bilinear problem reformulation that penalizes ReLU constraints in the objective function. This reformulation makes the problem amenable to efficient difference-of-convex algorithms (DCA), for which we propose a principled approach to penalty selection that facilitates convergence to stationary points of the original problem. We apply the DCA to the problem of the least-cost allocation of data center electricity demand in a power grid, reporting significant savings in congested cases.

Optimization over Trained Neural Networks: Difference-of-Convex Algorithm and Application to Data Center Scheduling

TL;DR

Abstract

Optimization over Trained Neural Networks: Difference-of-Convex Algorithm and Application to Data Center Scheduling

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (5)

Theorems & Definitions (1)