Struc2mapGAN: improving synthetic cryo-EM density maps with generative adversarial networks
Chenwei Zhang, Anne Condon, Khanh Dao Duc
TL;DR
This work tackles the challenge of generating high-fidelity synthetic cryo-EM density maps from molecular structures, where traditional simulation-based methods fail to reproduce complex features such as SSEs. It introduces struc2mapGAN, a data-driven 3D GAN with a nested U-Net++ generator trained against curated experimental-like targets using a SmoothL1Loss term, achieving superior correlation and structural similarity to real maps while remaining computationally efficient. Through a carefully designed preprocessing, augmentation, and tile-based training pipeline, the model outperforms conventional simulators across multiple metrics (SSIM, ChimeraX correlation, PCC) and demonstrates robust performance across resolutions, with map generation times suitable for real-time workflows. The work suggests future directions in resolution-conditioned generation and diffusion/attention-based enhancements, and highlights potential applications in template-based particle picking and integration with structure predictions from AlphaFold.
Abstract
Generating synthetic cryogenic electron microscopy 3D density maps from molecular structures has potential important applications in structural biology. Yet existing simulation-based methods cannot mimic all the complex features present in experimental maps, such as secondary structure elements. As an alternative, we propose struc2mapGAN, a novel data-driven method that employs a generative adversarial network to produce improved experimental-like density maps from molecular structures. More specifically, struc2mapGAN uses a nested U-Net architecture as the generator, with an additional L1 loss term and further processing of raw training experimental maps to enhance learning efficiency. While struc2mapGAN can promptly generate maps after training, we demonstrate that it outperforms existing simulation-based methods for a wide array of tested maps and across various evaluation metrics.
