SynCellFactory: Generative Data Augmentation for Cell Tracking
Moritz Sturm, Lorenzo Cerrone, Fred A. Hamprecht
TL;DR
SynCellFactory tackles data scarcity in cell tracking by decoupling appearance and dynamics and using diffusion-based rendering to generate photorealistic, annotated cell videos. It employs a motion model to simulate cell populations and two specialized ControlNets (CN-Pos for accurate positioning and CN-Mov for temporal evolution) to produce sequences with pseudo ground-truth segmentation. Automated training minimizes domain knowledge, enabling scalable augmentation from a single annotated timelapse; experiments on seven 2D CTC datasets show TRA improvements for most cases and three official CTC results surpassing prior methods. The approach demonstrates the practical potential of generative AI for boosting deep learning-based cell tracking, while acknowledging limitations in complex scenes and long sequences that guide future enhancements.
Abstract
Cell tracking remains a pivotal yet challenging task in biomedical research. The full potential of deep learning for this purpose is often untapped due to the limited availability of comprehensive and varied training data sets. In this paper, we present SynCellFactory, a generative cell video augmentation. At the heart of SynCellFactory lies the ControlNet architecture, which has been fine-tuned to synthesize cell imagery with photorealistic accuracy in style and motion patterns. This technique enables the creation of synthetic yet realistic cell videos that mirror the complexity of authentic microscopy time-lapses. Our experiments demonstrate that SynCellFactory boosts the performance of well-established deep learning models for cell tracking, particularly when original training data is sparse.
