Synthetic Art Generation and DeepFake Detection A Study on Jamini Roy Inspired Dataset
Kushal Agrawal, Romi Banerjee
TL;DR
This study investigates diffusion-based art generation within the Jamini Roy style and its implications for deepfake detection in non-Western art. By fine-tuning Stable Diffusion 3 with DreamBooth and augmenting generation with ControlNet and IPAdapter, the authors create a Jamini Roy–inspired dataset of real and AI-generated works and conduct thorough qualitative, quantitative, and frequency-domain analyses. They evaluate three state-of-the-art detectors, finding that diffusion-induced artifacts can be obscured by advanced generation techniques, though residual frequency-domain cues persist. The work highlights challenges in detecting culturally specific AI-generated art and lays groundwork for hybrid detection approaches and broader cross-cultural forensic research.
Abstract
The intersection of generative AI and art is a fascinating area that brings both exciting opportunities and significant challenges, especially when it comes to identifying synthetic artworks. This study takes a unique approach by examining diffusion-based generative models in the context of Indian art, specifically focusing on the distinctive style of Jamini Roy. To explore this, we fine-tuned Stable Diffusion 3 and used techniques like ControlNet and IPAdapter to generate realistic images. This allowed us to create a new dataset that includes both real and AI-generated artworks, which is essential for a detailed analysis of what these models can produce. We employed various qualitative and quantitative methods, such as Fourier domain assessments and autocorrelation metrics, to uncover subtle differences between synthetic images and authentic pieces. A key takeaway from recent research is that existing methods for detecting deepfakes face considerable challenges, especially when the deepfakes are of high quality and tailored to specific cultural contexts. This highlights a critical gap in current detection technologies, particularly in light of the challenges identified above, where high-quality and culturally specific deepfakes are difficult to detect. This work not only sheds light on the increasing complexity of generative models but also sets a crucial foundation for future research aimed at effective detection of synthetic art.
