Table of Contents
Fetching ...

Horospherically Convex Optimization on Hadamard Manifolds Part I: Analysis and Algorithms

Christopher Criscitiello, Jungbin Kim

TL;DR

This work introduces horospherical convexity (h-convexity) as a curvature-tolerant generalization of Euclidean convexity for Hadamard manifolds, built from horoballs and Busemann functions. It provides an outer characterization, smoothness notions (L-h-smoothness), and Moreau envelopes, establishing a foundation for first-order optimization on $M$ with curvature-independent guarantees. The paper develops gradient, subgradient, and accelerated methods for sums of h-convex functions, achieving Euclidean-like rates and showing substantially faster behavior in hyperbolic spaces. It also demonstrates a curvature-independent ellipsoid-style approach via localization in hyperbolic space, and discusses practical applications to problems like Tyler’s M-estimator and Horn’s problem. The results suggest promising directions for curvature-insensitive optimization in scaling-type problems, with Part II extending applications and interpolation theory.

Abstract

Geodesic convexity (g-convexity) is a natural generalization of convexity to Riemannian manifolds. However, g-convexity lacks many desirable properties satisfied by Euclidean convexity. For instance, the natural notions of half-spaces and affine functions are themselves not g-convex. Moreover, recent studies have shown that the oracle complexity of geodesically convex optimization necessarily depends on the curvature of the manifold (Criscitiello and Boumal, 2022; Criscitiello and Boumal, 2023; Hamilton and Moitra, 2021), a computational bottleneck for several problems, e.g., tensor scaling. Recently, Lewis et al. (2024) addressed this challenge by proving curvature-independent convergence of subgradient descent, assuming horospherical convexity of the objective's sublevel sets. Using a similar idea, we introduce a generalization of convex functions to Hadamard manifolds, utilizing horoballs and Busemann functions as building blocks (as proxies for half-spaces and affine functions). We refer to this new notion as horospherical convexity (h-convexity). We provide algorithms for both nonsmooth and smooth h-convex optimization, which have curvature-independent guarantees exactly matching those from Euclidean space; this includes generalizations of subgradient descent and Nesterov's accelerated method. Motivated by applications, we extend these algorithms and their convergence rates to minimizing a sum of horospherically convex functions, assuming access to a weighted-Fréchet-mean oracle.

Horospherically Convex Optimization on Hadamard Manifolds Part I: Analysis and Algorithms

TL;DR

This work introduces horospherical convexity (h-convexity) as a curvature-tolerant generalization of Euclidean convexity for Hadamard manifolds, built from horoballs and Busemann functions. It provides an outer characterization, smoothness notions (L-h-smoothness), and Moreau envelopes, establishing a foundation for first-order optimization on with curvature-independent guarantees. The paper develops gradient, subgradient, and accelerated methods for sums of h-convex functions, achieving Euclidean-like rates and showing substantially faster behavior in hyperbolic spaces. It also demonstrates a curvature-independent ellipsoid-style approach via localization in hyperbolic space, and discusses practical applications to problems like Tyler’s M-estimator and Horn’s problem. The results suggest promising directions for curvature-insensitive optimization in scaling-type problems, with Part II extending applications and interpolation theory.

Abstract

Geodesic convexity (g-convexity) is a natural generalization of convexity to Riemannian manifolds. However, g-convexity lacks many desirable properties satisfied by Euclidean convexity. For instance, the natural notions of half-spaces and affine functions are themselves not g-convex. Moreover, recent studies have shown that the oracle complexity of geodesically convex optimization necessarily depends on the curvature of the manifold (Criscitiello and Boumal, 2022; Criscitiello and Boumal, 2023; Hamilton and Moitra, 2021), a computational bottleneck for several problems, e.g., tensor scaling. Recently, Lewis et al. (2024) addressed this challenge by proving curvature-independent convergence of subgradient descent, assuming horospherical convexity of the objective's sublevel sets. Using a similar idea, we introduce a generalization of convex functions to Hadamard manifolds, utilizing horoballs and Busemann functions as building blocks (as proxies for half-spaces and affine functions). We refer to this new notion as horospherical convexity (h-convexity). We provide algorithms for both nonsmooth and smooth h-convex optimization, which have curvature-independent guarantees exactly matching those from Euclidean space; this includes generalizations of subgradient descent and Nesterov's accelerated method. Motivated by applications, we extend these algorithms and their convergence rates to minimizing a sum of horospherically convex functions, assuming access to a weighted-Fréchet-mean oracle.

Paper Structure

This paper contains 62 sections, 32 theorems, 136 equations, 1 figure, 3 tables.

Key Result

Lemma 1

For any $p,x,y\in M$, the following inequality holds:

Figures (1)

  • Figure 1: The set $C$ is shaded gray.

Theorems & Definitions (57)

  • Lemma 1: Triangle comparison
  • proof
  • Example 1: Hyperbolic space
  • Example 2: Positive definite matrices.
  • Definition 1: Horospherically convex functions
  • Proposition 1
  • Proposition 2: Properties of horospherically convex functions
  • Lemma 2
  • proof
  • Remark 1
  • ...and 47 more