MaizeEar-SAM: Zero-Shot Maize Ear Phenotyping

Hossein Zaremehrjerdi; Lisa Coffey; Talukder Jubery; Huyu Liu; Jon Turkus; Kyle Linders; James C. Schnable; Patrick S. Schnable; Baskar Ganapathysubramanian

MaizeEar-SAM: Zero-Shot Maize Ear Phenotyping

Hossein Zaremehrjerdi, Lisa Coffey, Talukder Jubery, Huyu Liu, Jon Turkus, Kyle Linders, James C. Schnable, Patrick S. Schnable, Baskar Ganapathysubramanian

TL;DR

MaizeEar-SAM tackles the labor-intensive measurement of maize yield components by automating kernels-per-row counting with a zero-shot segmentation framework. The method employs the Segment Anything Model (SAM) for kernel masking and a graph-theoretic shortest-path approach to identify an in-ear kernel row, combined with a multi-path averaging strategy to improve robustness. The contributions include an annotation-free workflow, formalizing kernels-per-row through a graph-based definition and releasing open-source code; evaluated on the High-Intensity Phenotyping Sites (HIPS) dataset with sub-second to second-level per-ear timing on a high-end GPU, enabling thousands of ears phenotyped per day. This work reduces subjectivity in trait measurement, supports scalable data collection for GWAS and breeding, and broadens accessibility of frugal, high-throughput phenotyping.

Abstract

Quantifying the variation in yield component traits of maize (Zea mays L.), which together determine the overall productivity of this globally important crop, plays a critical role in plant genetics research, plant breeding, and the development of improved farming practices. Grain yield per acre is calculated by multiplying the number of plants per acre, ears per plant, number of kernels per ear, and the average kernel weight. The number of kernels per ear is determined by the number of kernel rows per ear multiplied by the number of kernels per row. Traditional manual methods for measuring these two traits are time-consuming, limiting large-scale data collection. Recent automation efforts using image processing and deep learning encounter challenges such as high annotation costs and uncertain generalizability. We tackle these issues by exploring Large Vision Models for zero-shot, annotation-free maize kernel segmentation. By using an open-source large vision model, the Segment Anything Model (SAM), we segment individual kernels in RGB images of maize ears and apply a graph-based algorithm to calculate the number of kernels per row. Our approach successfully identifies the number of kernels per row across a wide range of maize ears, showing the potential of zero-shot learning with foundation vision models combined with image processing techniques to improve automation and reduce subjectivity in agronomic data collection. All our code is open-sourced to make these affordable phenotyping methods accessible to everyone.

MaizeEar-SAM: Zero-Shot Maize Ear Phenotyping

TL;DR

Abstract

MaizeEar-SAM: Zero-Shot Maize Ear Phenotyping

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (11)