Machine Learning Analysis of Anomalous Diffusion
Wenjie Cai, Yi Hu, Xiang Qu, Hui Zhao, Gongyi Wang, Jing Li, Zihan Huang
TL;DR
This paper surveys the integration of machine learning with anomalous diffusion analysis, emphasizing two core avenues: single-trajectory characterization (parameter inference and segmentation) and representation learning (predefined feature-based fingerprints, penultimate-layer embeddings, and autoencoder latent representations). It contrasts classical ML with deep learning, illustrates diverse architectures (CNNs, RNNs, GNNs), and discusses benchmarking through the AnDi Challenge, including its emphasis on segmentation in the 2024 version. The work highlights the strengths and limitations of each representation strategy, demonstrates how learned representations can generalize across models and conditions, and advocates a move toward open datasets, richer simulators, and hybrid, interpretable approaches. Collectively, the review maps a roadmap for applying AI to statistical physics and biophysics, with practical impact on accurately decoding diffusion mechanisms in complex systems.
Abstract
The rapid advancements in machine learning have made its application to anomalous diffusion analysis both essential and inevitable. This review systematically introduces the integration of machine learning techniques for enhanced analysis of anomalous diffusion, focusing on two pivotal aspects: single trajectory characterization via machine learning and representation learning of anomalous diffusion. We extensively compare various machine learning methods, including both classical machine learning and deep learning, used for the inference of diffusion parameters and trajectory segmentation. Additionally, platforms such as the Anomalous Diffusion Challenge that serve as benchmarks for evaluating these methods are highlighted. On the other hand, we outline three primary strategies for representing anomalous diffusion: the combination of predefined features, the feature vector from the penultimate layer of neural network, and the latent representation from the autoencoder, analyzing their applicability across various scenarios. This investigation paves the way for future research, offering valuable perspectives that can further enrich the study of anomalous diffusion and advance the application of artificial intelligence in statistical physics and biophysics.
