Training-set-free two-stage deep learning for spectroscopic data de-noising
Dongchen Huang, Junde Liu, Tian Qian, Hongming Weng
TL;DR
This paper tackles the challenge of denoising spectroscopic data without using training data by introducing a training-set-free, two-stage unsupervised framework. It combines an adaptive prior from principal component pursuit (low-rank plus sparse decomposition with $L+S=I$) and a CNN-based encoder–decoder to model clean spectra, complemented by a small network for sparse noise and trained with an AdamW optimizer using a discrepant learning-rate schedule. Empirically, it achieves substantial speedups (around 4×) and preserves key spectral features on ARPES data (e.g., FeSe at the M point), with second-derivative and MDC analyses showing clearer band structures. The authors also provide theoretical insight, showing a benign landscape for the linearized model where all stationary points are either global minima or strict saddles, supporting efficient convergence and suggesting broader relevance to non-convex optimization in scientific imaging.
Abstract
De-noising is a prominent step in the spectra post-processing procedure. Previous machine learning-based methods are fast but mostly based on supervised learning and require a training set that may be typically expensive in real experimental measurements. Unsupervised learning-based algorithms are slow and require many iterations to achieve convergence. Here, we bridge this gap by proposing a training-set-free two-stage deep learning method. We show that the fuzzy fixed input in previous methods can be improved by introducing an adaptive prior. Combined with more advanced optimization techniques, our approach can achieve five times acceleration compared to previous work. Theoretically, we study the landscape of a corresponding non-convex linear problem, and our results indicates that this problem has benign geometry for first-order algorithms to converge.
