Table of Contents
Fetching ...

Nonparametric Exponential Family Regression Under Star-Shaped Constraints

Guanghong Yi, Matey Neykov

TL;DR

This work studies the minimax estimation problem for the natural parameter $\boldsymbol{\theta}$ in nonparametric exponential family regression under star-shaped parameter constraints $K \subset [-M,M]^n$. It develops a universal tree-based estimator, deriving matching lower and upper bounds via KL-divergence/Fano arguments and local entropy, culminating in a minimax rate of $\varepsilon^{*2} \wedge d^2$ with $\varepsilon^{*}=\sup\{\varepsilon: \varepsilon^2 \kappa(M) \le \log N^{\operatorname{loc}}(\varepsilon, c)\}$. An illustrative Bernoulli monotone example demonstrates regime-specific rates: $\varepsilon^2 \asymp n^{1/3}$ for $q=1$, $\varepsilon^2 \asymp \sqrt{n}\log n$ for $q=2$, and $\varepsilon^2 \asymp n^{1-1/q}$ for $q\ge 3$. The results extend minimax theory from Gaussian sequence models to general smooth exponential families under star-shaped constraints, with implications for nonparametric regression under complex shape constraints and local-geometric complexity.

Abstract

We study the minimax rate of estimation in nonparametric exponential family regression under star-shaped constraints. Specifically, the parameter space $K$ is a star-shaped set contained within a bounded box $[-M, M]^n$, where $M$ is a known positive constant. Moreover, we assume that the exponential family is nonsingular and that its cumulant function is twice continuously differentiable. Our main result shows that the minimax rate for this problem is $\varepsilon^{*2} \wedge \operatorname{diam}(K)^2$, up to absolute constants, where $\varepsilon^*$ is defined as \[ \varepsilon^* = \sup \{\varepsilon: \varepsilon^2 κ(M) \leq \log N^{\operatorname{loc}}(\varepsilon)\}, \] with $N^{\operatorname{loc}}(\varepsilon)$ denoting the local entropy and $κ(M)$ is an absolute constant allowed to depend on $M$. We also provide an example and derive its corresponding minimax optimal rate.

Nonparametric Exponential Family Regression Under Star-Shaped Constraints

TL;DR

This work studies the minimax estimation problem for the natural parameter in nonparametric exponential family regression under star-shaped parameter constraints . It develops a universal tree-based estimator, deriving matching lower and upper bounds via KL-divergence/Fano arguments and local entropy, culminating in a minimax rate of with . An illustrative Bernoulli monotone example demonstrates regime-specific rates: for , for , and for . The results extend minimax theory from Gaussian sequence models to general smooth exponential families under star-shaped constraints, with implications for nonparametric regression under complex shape constraints and local-geometric complexity.

Abstract

We study the minimax rate of estimation in nonparametric exponential family regression under star-shaped constraints. Specifically, the parameter space is a star-shaped set contained within a bounded box , where is a known positive constant. Moreover, we assume that the exponential family is nonsingular and that its cumulant function is twice continuously differentiable. Our main result shows that the minimax rate for this problem is , up to absolute constants, where is defined as with denoting the local entropy and is an absolute constant allowed to depend on . We also provide an example and derive its corresponding minimax optimal rate.

Paper Structure

This paper contains 18 sections, 16 theorems, 86 equations, 1 algorithm.

Key Result

Lemma 2.1

The KL divergence between two parameters $\theta, \theta' \in [-M,M]^n$ in any nonsingular exponential family distributions with twice continuously differentiable cumulant function satisfies: Here $c(M)$ and $C(M)$ are positive constants allowed to depend on $M$.

Theorems & Definitions (24)

  • Definition 1.1: $\delta$-covering Set
  • Definition 1.2: $\delta$-packing Set
  • Definition 1.3: Local Entropy
  • Definition 1.4: Star-Shaped Set
  • Lemma 2.1
  • Theorem 2.2: Fano's Inequality
  • Lemma 2.3
  • Definition 2.4
  • Definition 2.5
  • Lemma 2.6: Restatement of prasadan2024informationtheoreticlimitsrobust Lemma 3.1
  • ...and 14 more