Local Lesion Generation is Effective for Capsule Endoscopy Image Data Augmentation in a Limited Data Setting
Adrian B. Chłopowiec, Adam R. Chłopowiec, Krzysztof Galus, Wojciech Cebula, Martin Tabakov
TL;DR
The paper tackles limited data and severe class imbalance in capsule endoscopy by introducing two local lesion generation techniques: PBDA, which uses Poisson Blending to compositely insert lesions, and IIDA, which fine-tunes LaMa for lesion inpainting within healthy tissue. Together, these methods yield substantial performance gains on the Kvasir Capsule Dataset, achieving a macro F1-score of 33.07% and surpassing prior work by up to 7.84 percentage points, with IIDA providing strong standalone improvements and PBDA offering complementary gains. The combination of PBDA and IIDA sets a new benchmark for generative data augmentation in medical imaging, outperforming de novo GANs and diffusion models by focusing edits on localized regions and maintaining label integrity. This approach is particularly impactful for clinical contexts where data are scarce, as it enables robust augmentation with annotated synthetic lesions, improving lesion detection and classification while preserving interpretability and annotation ease.
Abstract
Limited medical imaging datasets challenge deep learning models by increasing risks of overfitting and reduced generalization, particularly in Generative Adversarial Networks (GANs), where discriminators may overfit, leading to training divergence. This constraint also impairs classification models trained on small datasets. Generative Data Augmentation (GDA) addresses this by expanding training datasets with synthetic data, although it requires training a generative model. We propose and evaluate two local lesion generation approaches to address the challenge of augmenting small medical image datasets. The first approach employs the Poisson Image Editing algorithm, a classical image processing technique, to create realistic image composites that outperform current state-of-the-art methods. The second approach introduces a novel generative method, leveraging a fine-tuned Image Inpainting GAN to synthesize realistic lesions within specified regions of real training images. A comprehensive comparison of the two proposed methods demonstrates that effective local lesion generation in a data-constrained setting allows for reaching new state-of-the-art results in capsule endoscopy lesion classification. Combination of our techniques achieves a macro F1-score of 33.07%, surpassing the previous best result by 7.84 percentage points (p.p.) on the highly imbalanced Kvasir Capsule Dataset, a benchmark for capsule endoscopy. To the best of our knowledge, this work is the first to apply a fine-tuned Image Inpainting GAN for GDA in medical imaging, demonstrating that an image-conditional GAN can be adapted effectively to limited datasets to generate high-quality examples, facilitating effective data augmentation. Additionally, we show that combining this GAN-based approach with classical image processing techniques further improves the results.
