FakET: Simulating Cryo-Electron Tomograms with Neural Style Transfer
Pavol Harar, Lukas Herrmann, Philipp Grohs, David Haselbach
TL;DR
This work tackles the scarcity of labeled cryoET data for particle localization and classification by introducing FakET, a Neural Style Transfer–based method that simulates the TEM forward operator using unlabeled reference data. FakET achieves data quality comparable to the SHREC physics-based simulator while offering a 750-fold speedup and 33-fold memory reduction, enabling large tilt-series generation without calibration. The authors validate FakET by training DeepFinder on faket-generated data, obtaining near-benchmark localization and substantially close classification performance, with further gains achievable via limited fine-tuning. The approach provides a practical, open-source solution for pre-training and evaluating neural networks in cryoET, with potential extensions to experimental data validation and domain-specific pre-training.
Abstract
In cryo-electron microscopy, accurate particle localization and classification are imperative. Recent deep learning solutions, though successful, require extensive training data sets. The protracted generation time of physics-based models, often employed to produce these data sets, limits their broad applicability. We introduce FakET, a method based on Neural Style Transfer, capable of simulating the forward operator of any cryo transmission electron microscope. It can be used to adapt a synthetic training data set according to reference data producing high-quality simulated micrographs or tilt-series. To assess the quality of our generated data, we used it to train a state-of-the-art localization and classification architecture and compared its performance with a counterpart trained on benchmark data. Remarkably, our technique matches the performance, boosts data generation speed 750 times, uses 33 times less memory, and scales well to typical transmission electron microscope detector sizes. It leverages GPU acceleration and parallel processing. The source code is available at https://github.com/paloha/faket.
