Table of Contents
Fetching ...

Enforcing Hard Linear Constraints in Deep Learning Models with Decision Rules

Gonzalo E. Constante-Flores, Hao Chen, Can Li

TL;DR

This work tackles enforcing hard, input-dependent linear constraints on neural network outputs by introducing a two-subnetwork architecture: a task network for prediction and a safe network that guarantees feasibility. The outputs are merged via a data-dependent convex combination $f_\psi(x) = (1-\alpha_\psi(x)) f^{\mathrm{TN}}_\theta(x) + \alpha_\psi(x) f^{\mathrm{SN}}_\phi(x)$, with equality constraints ensured by projection and inequality constraints enforced for all inputs by the safe network learned through decision rules. The authors prove a universal approximation result for continuous constraint-satisfying functions and provide tractable offline formulations (SDP/LP) for linear decision rules to compute the safe network. Empirical results on DC-OPF and portfolio optimization demonstrate zero constraint violations, competitive objective values, and latency under 3 ms, significantly outperforming iterative methods in speed while maintaining feasibility. The framework offers a scalable, model-agnostic path to deploy constraint-compliant neural predictors in safety-critical domains, with clear directions for extending beyond linear and convex constraint families.

Abstract

Deep learning models are increasingly deployed in safety-critical tasks where predictions must satisfy hard constraints, such as physical laws, fairness requirements, or safety limits. However, standard architectures lack built-in mechanisms to enforce such constraints, and existing approaches based on regularization or projection are often limited to simple constraints, computationally expensive, or lack feasibility guarantees. This paper proposes a model-agnostic framework for enforcing input-dependent linear equality and inequality constraints on neural network outputs. The architecture combines a task network trained for prediction accuracy with a safe network trained using decision rules from the stochastic and robust optimization literature to ensure feasibility across the entire input space. The final prediction is a convex combination of the two subnetworks, guaranteeing constraint satisfaction during both training and inference without iterative procedures or runtime optimization. We prove that the architecture is a universal approximator of constrained functions and derive computationally tractable formulations based on linear decision rules. Empirical results on benchmark regression tasks show that our method consistently satisfies constraints while maintaining competitive accuracy and low inference latency.

Enforcing Hard Linear Constraints in Deep Learning Models with Decision Rules

TL;DR

This work tackles enforcing hard, input-dependent linear constraints on neural network outputs by introducing a two-subnetwork architecture: a task network for prediction and a safe network that guarantees feasibility. The outputs are merged via a data-dependent convex combination , with equality constraints ensured by projection and inequality constraints enforced for all inputs by the safe network learned through decision rules. The authors prove a universal approximation result for continuous constraint-satisfying functions and provide tractable offline formulations (SDP/LP) for linear decision rules to compute the safe network. Empirical results on DC-OPF and portfolio optimization demonstrate zero constraint violations, competitive objective values, and latency under 3 ms, significantly outperforming iterative methods in speed while maintaining feasibility. The framework offers a scalable, model-agnostic path to deploy constraint-compliant neural predictors in safety-critical domains, with clear directions for extending beyond linear and convex constraint families.

Abstract

Deep learning models are increasingly deployed in safety-critical tasks where predictions must satisfy hard constraints, such as physical laws, fairness requirements, or safety limits. However, standard architectures lack built-in mechanisms to enforce such constraints, and existing approaches based on regularization or projection are often limited to simple constraints, computationally expensive, or lack feasibility guarantees. This paper proposes a model-agnostic framework for enforcing input-dependent linear equality and inequality constraints on neural network outputs. The architecture combines a task network trained for prediction accuracy with a safe network trained using decision rules from the stochastic and robust optimization literature to ensure feasibility across the entire input space. The final prediction is a convex combination of the two subnetworks, guaranteeing constraint satisfaction during both training and inference without iterative procedures or runtime optimization. We prove that the architecture is a universal approximator of constrained functions and derive computationally tractable formulations based on linear decision rules. Empirical results on benchmark regression tasks show that our method consistently satisfies constraints while maintaining competitive accuracy and low inference latency.

Paper Structure

This paper contains 23 sections, 4 theorems, 43 equations, 1 figure, 6 tables.

Key Result

Theorem 1

Let $\mathcal{X} \subset \mathbb{R}^k$ be compact, and let $f^*: \mathcal{X} \rightarrow \mathbb{R}^n$ be a continuous function such that $f^*(x) \in \mathcal{C}(x)$ for all $x \in \mathcal{X}$, where $\mathcal{C}(x) \subset \mathbb{R}^n$ is a convex, input-dependent constraint set. Suppose Assumpti satisfies $f_\psi(x) \in \mathcal{C}(x)$ for all $x \in \mathcal{X}$, and

Figures (1)

  • Figure 1: Proposed framework.

Theorems & Definitions (7)

  • Theorem 1: Universal Approximation with Constrained Output
  • Lemma 1: Homogeneous S-Lemma Ben-Tal1998
  • Proposition 1: Kuhn2011
  • Proposition 2: Kuhn2011
  • proof : Proof of Theorem \ref{['thm:uat']}
  • proof : Proof of Proposition \ref{['prop:sdp_duality']}
  • proof : Proof of Proposition \ref{['prop:lp_duality']}