Exploring Model Quantization in GenAI-based Image Inpainting and Detection of Arable Plants
Sourav Modak, Ahmet Oğuz Saltık, Anthony Stein
TL;DR
The paper addresses data diversity and on-device compute constraints in GenAI-based weed detection. It introduces a progressive Stable Diffusion inpainting data-augmentation pipeline, augmented by post-training quantization (FP16/INT8), to expand training samples by up to 200%. Evaluations on YOLO11(l) and RT-DETR(l) show quantization effects are model- and augmentation-dependent, with inpainting helping recover accuracy under lower precision and enabling edge deployment on Jetson Orin Nano. The work demonstrates practical feasibility and highlights directions to expand quantization strategies and annotation strategies for robust field deployment.
Abstract
Deep learning-based weed control systems often suffer from limited training data diversity and constrained on-board computation, impacting their real-world performance. To overcome these challenges, we propose a framework that leverages Stable Diffusion-based inpainting to augment training data progressively in 10% increments -- up to an additional 200%, thus enhancing both the volume and diversity of samples. Our approach is evaluated on two state-of-the-art object detection models, YOLO11(l) and RT-DETR(l), using the mAP50 metric to assess detection performance. We explore quantization strategies (FP16 and INT8) for both the generative inpainting and detection models to strike a balance between inference speed and accuracy. Deployment of the downstream models on the Jetson Orin Nano demonstrates the practical viability of our framework in resource-constrained environments, ultimately improving detection accuracy and computational efficiency in intelligent weed management systems.
