Improving Performance in Combinatorial Optimization Problems with Inequality Constraints: An Evaluation of the Unbalanced Penalization Method on D-Wave Advantage
J. A. Montanez-Barrera, Pim van den Heuvel, Dennis Willsch, Kristel Michielsen
TL;DR
The paper addresses the challenge of encoding inequality constraints in combinatorial optimization for quantum hardware by evaluating unbalanced penalization as an alternative to slack variables. It applies the method to the DFJ traveling salesman problem, mapping to QUBO and Ising formulations and testing on D-Wave Advantage, the D-Wave hybrid solver, and classical solvers. Results show that unbalanced penalization outperforms slack variables on quantum hardware and remains competitive on classical solvers, enabling larger instances (up to 45 cities) and reducing qubit-connectivity demands. This work strengthens the case for practical quantum optimization by providing a robust inequality-constraint encoding that improves scalability and accelerates benchmarking of quantum devices.
Abstract
Combinatorial optimization problems are one of the target applications of current quantum technology, mainly because of their industrial relevance, the difficulty of solving large instances of them classically, and their equivalence to Ising Hamiltonians using the quadratic unconstrained binary optimization (QUBO) formulation. Many of these applications have inequality constraints, usually encoded as penalization terms in the QUBO formulation using additional variables known as slack variables. The slack variables have two disadvantages: (i) these variables extend the search space of optimal and suboptimal solutions, and (ii) the variables add extra qubits and connections to the quantum algorithm. Recently, a new method known as unbalanced penalization has been presented to avoid using slack variables. This method offers a trade-off between additional slack variables to ensure that the optimal solution is given by the ground state of the Ising Hamiltonian, and using an unbalanced heuristic function to penalize the region where the inequality constraint is violated with the only certainty that the optimal solution will be in the vicinity of the ground state. This work tests the unbalanced penalization method using real quantum hardware on D-Wave Advantage for the traveling salesman problem (TSP). The results show that the unbalanced penalization method outperforms the solutions found using slack variables and sets a new record for the largest TSP solved with quantum technology.
