A unified framework for hard and soft clustering with regularized optimal transport

Jean-Frédéric Diebold; Nicolas Papadakis; Arnaud Dessein; Charles-Alban Deledalle

A unified framework for hard and soft clustering with regularized optimal transport

Jean-Frédéric Diebold, Nicolas Papadakis, Arnaud Dessein, Charles-Alban Deledalle

TL;DR

The convergence property of the generalized $\lambda-\text{EM}$ algorithms is studied and it is shown that each algorithmic step has a closed form solution when inferring finite mixture models of exponential families.

Abstract

In this paper, we formulate the problem of inferring a Finite Mixture Model from discrete data as an optimal transport problem with entropic regularization of parameter $λ\geq 0$. Our method unifies hard and soft clustering, the Expectation-Maximization (EM) algorithm being exactly recovered for $λ=1$. The family of clustering algorithm we propose rely on the resolution of nonconvex problems using alternating minimization. We study the convergence property of our generalized $λ-$EM algorithms and show that each step in the minimization process has a closed form solution when inferring finite mixture models of exponential families. Experiments highlight the benefits of taking a parameter $λ>1$ to improve the inference performance and $λ\to 0$ for classification.

A unified framework for hard and soft clustering with regularized optimal transport

TL;DR

The convergence property of the generalized

algorithms is studied and it is shown that each algorithmic step has a closed form solution when inferring finite mixture models of exponential families.

Abstract

In this paper, we formulate the problem of inferring a Finite Mixture Model from discrete data as an optimal transport problem with entropic regularization of parameter

. Our method unifies hard and soft clustering, the Expectation-Maximization (EM) algorithm being exactly recovered for

. The family of clustering algorithm we propose rely on the resolution of nonconvex problems using alternating minimization. We study the convergence property of our generalized

EM algorithms and show that each step in the minimization process has a closed form solution when inferring finite mixture models of exponential families. Experiments highlight the benefits of taking a parameter

to improve the inference performance and

for classification.

A unified framework for hard and soft clustering with regularized optimal transport

TL;DR

Abstract

A unified framework for hard and soft clustering with regularized optimal transport

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (7)

Theorems & Definitions (9)