TopoCellGen: Generating Histopathology Cell Topology with a Diffusion Model
Meilong Xu, Saumya Gupta, Xiaoling Hu, Chen Li, Shahira Abousamra, Dimitris Samaras, Prateek Prasanna, Chao Chen
TL;DR
TopoCellGen introduces a topology-aware diffusion framework for generating multi-class histopathology cell layouts by enforcing topological constraints via persistent homology. It combines a cell-count conditioned diffusion process with intra-class and inter-class topological regularizations and introduces TopoFD, a Fréchet-based metric on persistence diagrams, to quantify topological fidelity beyond traditional image-based metrics. The method is extended to layout-guided image generation and validated on BRCA-M2C and Lizard datasets, where it achieves superior FID, TopoFD, and downstream task performance, while reducing count errors. By aligning synthetic layouts with biologically meaningful topology and counts, the approach provides more realistic data augmentation and enhances interpretability for tumor microenvironment analyses.
Abstract
Accurately modeling multi-class cell topology is crucial in digital pathology, as it provides critical insights into tissue structure and pathology. The synthetic generation of cell topology enables realistic simulations of complex tissue environments, enhances downstream tasks by augmenting training data, aligns more closely with pathologists' domain knowledge, and offers new opportunities for controlling and generalizing the tumor microenvironment. In this paper, we propose a novel approach that integrates topological constraints into a diffusion model to improve the generation of realistic, contextually accurate cell topologies. Our method refines the simulation of cell distributions and interactions, increasing the precision and interpretability of results in downstream tasks such as cell detection and classification. To assess the topological fidelity of generated layouts, we introduce a new metric, Topological Frechet Distance (TopoFD), which overcomes the limitations of traditional metrics like FID in evaluating topological structure. Experimental results demonstrate the effectiveness of our approach in generating multi-class cell layouts that capture intricate topological relationships. Code is available at https://github.com/Melon-Xu/TopoCellGen.
