Advancing site-specific disease and pest management in precision agriculture: From reasoning-driven foundation models to adaptive, feedback-based learning
Nitin Rai, Daeun, Choi, Nathan S. Boyd, Arnold W. Schumann
TL;DR
This review surveys how foundation models (FMs) are reshaping site-specific disease management in crops, with an emphasis on large language models (LLMs) and vision-language models (VLMs). It discusses how FMs enable multimodal perception, text-based reasoning, and interactive guidance, enabling new capabilities for targeted spraying and extension services. The paper highlights trends in the adoption of FMs, the rising prominence of VLMs over LLMs, and the nascent integration of reinforcement learning (RL), adaptive learning (AL), and digital twin (DT) frameworks to create closed-loop, feedback-driven SSDM. It also outlines key challenges—data quality, sim-to-real transfer, and safety—and opportunities for practical deployment through domain-adapted models, adapters, few-shot learning, and multi-farm DT networks. Collectively, the work maps a trajectory toward autonomous, data-driven crop protection that can adapt to diverse crops, diseases, and field conditions while reducing chemical inputs.
Abstract
Site-specific disease management (SSDM) in crops has advanced rapidly through machine and deep learning (ML and DL) for real-time computer vision. Research evolved from handcrafted feature extraction to large-scale automated feature learning. With foundation models (FMs), crop disease datasets are now processed in fundamentally new ways. Unlike traditional neural networks, FMs integrate visual and textual data, interpret symptoms in text, reason about symptom-management relationships, and support interactive QA for growers and educators. Adaptive and imitation learning in robotics further enables field-based disease management. This review screened approx. 40 articles on FM applications for SSDM, focusing on large-language models (LLMs) and vision-language models (VLMs), and discussing their role in adaptive learning (AL), reinforcement learning (RL), and digital twin frameworks for targeted spraying. Key findings: (a) FMs are gaining traction with surging literature in 2023-24; (b) VLMs outpace LLMs, with a 5-10x increase in publications; (c) RL and AL are still nascent for smart spraying; (d) digital twins with RL can simulate targeted spraying virtually; (e) addressing the sim-to-real gap is critical for real-world deployment; (f) human-robot collaboration remains limited, especially in human-in-the-loop approaches where robots detect early symptoms and humans validate uncertain cases; (g) multi-modal FMs with real-time feedback will drive next-gen SSDM. For updates, resources, and contributions, visit, https://github.com/nitin-dominic/AgriPathogenDatabase, to submit papers, code, or datasets.
