The Generalized Matrix Norm Problem

Adrian Kulmburg

The Generalized Matrix Norm Problem

Adrian Kulmburg

TL;DR

It is shown that the dual formulation of the problem of computing the operator norm of a matrix with respect to norms induced by linear operators can be expressed as a max-min problem which can be solved in polynomial time.

Abstract

We study the computability of the operator norm of a matrix with respect to norms induced by linear operators. Our findings reveal that this problem can be solved exactly in polynomial time in certain situations, and we discuss how it can be approximated in other cases. Along the way, we investigate the concept of push-forward and pull-back of seminorms, which leads us to uncover novel duality principles that come into play when optimizing over the unit ball of norms.

The Generalized Matrix Norm Problem

TL;DR

Abstract

Paper Structure (21 sections, 19 theorems, 117 equations, 5 figures)

This paper contains 21 sections, 19 theorems, 117 equations, 5 figures.

Introduction
Preliminaries
Basic Notation
Extended Seminorms
Duality
Entry-Wise Matrix Norms
Banach Spaces and the Banach Space Adjoint
Constrained Norm Optimization
Push-Forward and Pull-Back Seminorms
The Dual of the Push-Forward and Pull-Back
Tractable Matrix Norms
The Case p ∈ [1,oo], q = oo
The Case p = q = 2
Approximable Matrix Norms
The Case 1 < q < 2 < p < oo
...and 6 more sections

Key Result

Lemma 2.3

Let $\bm{\underline{{X}}}\in\mathbb{R}^{n\times m}$. For $p, q\in[1,\infty]$,

Figures (5)

Figure 1: An overview of the hardness results presented in Kulmburg2024, adapted to reflect the complexity of computing generalized $p\mapsto q;\bm{\underline{{B}}}$-norms. The complexity class $\mathsf{FPTAS}$ is the class of problems that can be approximated within arbitrary accuracy in polynomial time with respect to the representation size of $\bm{\underline{{A}}}$ and $\bm{\underline{{B}}}$ and the accuracy parameter. A solid boundary line means that the boundary is included in the region. For more information, we refer to Kulmburg2024. This graph was inspired by Bhattiprolu2023.
Figure 1: Numerical results for $q=1$ and $p=1.5$. The solid line corresponds to the theoretical worst case approximation ratio from Theorem \ref{['thm:main']}, while the dashed line corresponds to that of Corollary \ref{['cor:main_nesterov']}. Both theoretical bounds over-approximate the worst-case approximation ratio we measured experimentally.
Figure 2: An overview of our results from Sections \ref{['sec:tractable']} and \ref{['sec:approximable']} for the computability/approximability of generalized matrix norms. A solid boundary line means that the boundary is included in the region. Here, $C(p,l,m) = \min\{\gamma_p\sqrt{m},l^{1/p-1/2}\}/\gamma_1$, where $m$ and $l$ are the number of rows of $\bm{\underline{{A}}}$ and $\bm{\underline{{B}}}$, respectively, and $\gamma_s$ for $s\in [1,\infty)$ is the $s$-th root of the $s$-th absolute moment of a Gaussian random variable. This graph was inspired by Bhattiprolu2023.
Figure 2: Numerical results for $q=1$ and $p=2$. The semidefinite relaxation yields, as expected, a worst-case approximation ratio better than $\gamma_2/\gamma_1 = \sqrt{\pi/2}$. It also performs significantly better than the linear relaxation.
Figure 3: Numerical results for $q=1$ and $p=1$. Since, in this case, the exact result can be formulated as a zonotope containment problem, a larger number of cases could be evaluated.

Theorems & Definitions (48)

Definition 2.1: Extended Seminorms and Norms
Definition 2.2: $L_{p,q}$- and $L_{p,q}^\top$-norms
Lemma 2.3: Dual of $L_{p,q}$-norms
Proof 1
Definition 2.4: Banach Space Adjoint
Lemma 2.5
Proof 2
Proposition 3.1: Constrained Matrix Norm Optimization
Corollary 3.2
Proof 3
...and 38 more

The Generalized Matrix Norm Problem

TL;DR

Abstract

The Generalized Matrix Norm Problem

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (5)

Theorems & Definitions (48)