Table of Contents
Fetching ...

Empirical risk minimization algorithm for multiclass classification of S.D.E. paths

Christophe Denis, Eddy Ella Mintsa

TL;DR

This work develops a diffusion-path multiclass classifier that treats the drift as class-discriminative and the diffusion as shared but unknown. By formulating an empirical risk minimization problem with a squared (L2) loss and modeling drift and diffusion via B-splines, the authors obtain consistent ERM predictors and derive rates that depend on Hölder smoothness; a margin condition yields fast rates in the binary case. They also introduce an adaptive version to select spline dimensions data-dependently, and demonstrate strong numerical performance on simulated diffusion data, often surpassing plug-in and depth-based baselines. The results advance nonparametric diffusion-path classification and provide practical tools for prediction with diffusion-process features, with potential extensions to time-inhomogeneous and high-dimensional settings.

Abstract

We address the multiclass classification problem for stochastic diffusion paths, assuming that the classes are distinguished by their drift functions, while the diffusion coefficient remains common across all classes. In this setting, we propose a classification algorithm that relies on the minimization of the L 2 risk. We establish rates of convergence for the resulting predictor. Notably, we introduce a margin assumption under which we show that our procedure can achieve fast rates of convergence. Finally, a simulation study highlights the numerical performance of our classification algorithm.

Empirical risk minimization algorithm for multiclass classification of S.D.E. paths

TL;DR

This work develops a diffusion-path multiclass classifier that treats the drift as class-discriminative and the diffusion as shared but unknown. By formulating an empirical risk minimization problem with a squared (L2) loss and modeling drift and diffusion via B-splines, the authors obtain consistent ERM predictors and derive rates that depend on Hölder smoothness; a margin condition yields fast rates in the binary case. They also introduce an adaptive version to select spline dimensions data-dependently, and demonstrate strong numerical performance on simulated diffusion data, often surpassing plug-in and depth-based baselines. The results advance nonparametric diffusion-path classification and provide practical tools for prediction with diffusion-process features, with potential extensions to time-inhomogeneous and high-dimensional settings.

Abstract

We address the multiclass classification problem for stochastic diffusion paths, assuming that the classes are distinguished by their drift functions, while the diffusion coefficient remains common across all classes. In this setting, we propose a classification algorithm that relies on the minimization of the L 2 risk. We establish rates of convergence for the resulting predictor. Notably, we introduce a margin assumption under which we show that our procedure can achieve fast rates of convergence. Finally, a simulation study highlights the numerical performance of our classification algorithm.

Paper Structure

This paper contains 38 sections, 14 theorems, 201 equations, 1 figure, 4 tables.

Key Result

Proposition 2.4

Under Assumption ass:Novikov, we obtain from denis2020consistent that with, for each $k \in [K]$,

Figures (1)

  • Figure 1: Visual description of the considered models. Model 1 on the left, Model 2 in the middle, and Model 3 on the right.

Theorems & Definitions (22)

  • Proposition 2.4
  • Proposition 2.5: zhang2004statistical
  • Theorem 3.1
  • Corollary 3.2
  • Corollary 4.1
  • Proposition 4.3
  • Theorem 4.4
  • Theorem 5.1
  • Proposition A.1: denis2024nonparametric
  • Lemma A.2: denis2024nonparametric
  • ...and 12 more