Recognizing Pneumonia in Real-World Chest X-rays with a Classifier Trained with Images Synthetically Generated by Nano Banana
Jiachuan Peng, Kyle Lam, Jianing Qiu
TL;DR
The paper investigates recognizing pneumonia in chest X-rays by training a classifier on synthetically generated CXRs produced by Nano Banana, testing its generalization on real-world datasets. A ResNet-50 model fine-tuned on cropped synthetic images achieves strong external validation metrics (AUROC up to 0.923 and 0.824; AUPR up to 0.913 and 0.900 on two datasets), demonstrating feasibility of synthetic data for medical AI development. The study highlights the importance of post-generation processing (cropping to remove watermarks) and shows that synthetic-data-trained models focus on clinically relevant lung regions, with additional analyses supporting discriminative feature learning. Nevertheless, limitations in prompt control, generalization to other domains, and regulatory/ethical considerations indicate that substantial validation and oversight are required before clinical deployment.
Abstract
We trained a classifier with synthetic chest X-ray (CXR) images generated by Nano Banana, the latest AI model for image generation and editing, released by Google. When directly applied to real-world CXRs having only been trained with synthetic data, the classifier achieved an AUROC of 0.923 (95% CI: 0.919 - 0.927), and an AUPR of 0.900 (95% CI: 0.894 - 0.907) in recognizing pneumonia in the 2018 RSNA Pneumonia Detection dataset (14,863 CXRs), and an AUROC of 0.824 (95% CI: 0.810 - 0.836), and an AUPR of 0.913 (95% CI: 0.904 - 0.922) in the Chest X-Ray dataset (5,856 CXRs). These external validation results on real-world data demonstrate the feasibility of this approach and suggest potential for synthetic data in medical AI development. Nonetheless, several limitations remain at present, including challenges in prompt design for controlling the diversity of synthetic CXR data and the requirement for post-processing to ensure alignment with real-world data. However, the growing sophistication and accessibility of medical intelligence will necessitate substantial validation, regulatory approval, and ethical oversight prior to clinical translation.
