Scalable Approximate Algorithms for Optimal Transport Linear Models
Tomasz Kacprzak, Francois Kamper, Michael W. Heiss, Gianluca Janka, Ann M. Dillner, Satoshi Takahama
TL;DR
This work tackles regression problems where the fit is evaluated through an entropy-regularized optimal transport loss, enabling a focus on shape perturbations rather than pure per-sample residuals. It introduces OTLM, a general, scalable framework that extends generalized linear models to include transport-based datafits and convex penalties, solved via Sinkhorn-like scaling with majorization-minimization steps. The authors derive practical MM updates for common penalties and demonstrate the method's scalability on large synthetic problems and real spectroscopic datasets, including muonic X-ray spectra and infrared spectroscopy. The approach offers a flexible, robust, and efficient alternative for linear modeling in contexts where mass transportation provides a more meaningful similarity than conventional Euclidean losses.
Abstract
Recently, linear regression models incorporating an optimal transport (OT) loss have been explored for applications such as supervised unmixing of spectra, music transcription, and mass spectrometry. However, these task-specific approaches often do not generalize readily to a broader class of linear models. In this work, we propose a novel algorithmic framework for solving a general class of non-negative linear regression models with an entropy-regularized OT datafit term, based on Sinkhorn-like scaling iterations. Our framework accommodates convex penalty functions on the weights (e.g. squared-$\ell_2$ and $\ell_1$ norms), and admits additional convex loss terms between the transported marginal and target distribution (e.g. squared error or total variation). We derive simple multiplicative updates for common penalty and datafit terms. This method is suitable for large-scale problems due to its simplicity of implementation and straightforward parallelization.
