Synthesizing Forestry Images Conditioned on Plant Phenotype Using a Generative Adversarial Network
Debasmita Pal, Arun Ross
TL;DR
This work tackles the challenge of visualizing forest sites with controllable plant phenotypes by generating synthetic forestry imagery conditioned on a continuous GCC attribute over a ROI describing a vegetation type. It introduces a CGAN framework with ROI masking and continuous conditioning, augmented by self-attention and spectral normalization, and evaluates the realism and phenotypic fidelity of generated images using SSIM and FID, while measuring GCC and predicting RCC via RMSPE. The study demonstrates site-specific modeling on NEON PhenoCam data (Harvard and Bartlett), cross-site transfer, and scalability to other vegetation types, showing the model can produce diverse, phenotype-consistent images and infer additional attributes from synthetic samples. The proposed approach offers a pathway for visualizing phenology-driven forestry scenarios and could inform ecological analysis and remote sensing interpretation, with future work aimed at diffusion-based generation and incorporating broader environmental parameters.
Abstract
Plant phenology and phenotype prediction using remote sensing data are increasingly gaining attention within the plant science community as a promising approach to enhance agricultural productivity. This work focuses on generating synthetic forestry images that satisfy certain phenotypic attributes, viz. canopy greenness. We harness a Generative Adversarial Network (GAN) to synthesize biologically plausible and phenotypically stable forestry images conditioned on the greenness of vegetation (a continuous attribute) over a specific region of interest, describing a particular vegetation type in a mixed forest. The training data is based on the automated digital camera imagery provided by the National Ecological Observatory Network (NEON) and processed by the PhenoCam Network. Our method helps render the appearance of forest sites specific to a greenness value. The synthetic images are subsequently utilized to predict another phenotypic attribute, viz., redness of plants. The quality of the synthetic images is assessed using the Structural SIMilarity (SSIM) index and Fréchet Inception Distance (FID). Further, the greenness and redness indices of the synthetic images are compared against those of the original images using Root Mean Squared Percentage Error (RMSPE) to evaluate their accuracy and integrity. The generalizability and scalability of our proposed GAN model are established by effectively transforming it to generate synthetic images for other forest sites and vegetation types. From a broader perspective, this approach could be leveraged to visualize forestry based on different phenotypic attributes in the context of various environmental parameters.
