Few-Shot Fruit Segmentation via Transfer Learning
Jordan A. James, Heather K. Manching, Amanda M. Hulse-Kemp, William J. Beksi
TL;DR
The paper addresses the challenge of few-shot semantic segmentation for in-field fruits under limited labeled data. It introduces a specialized pre-training strategy using the CitDet citrus dataset to bootstrap learning for apples and transfers this knowledge to the MinneApple target task. A light-weight three-branch decoder (Spatial, Context, ADB) with BAG fusion learns fruit shapes and boundaries to enable effective transfer across domains. Evaluations on MinneApple show improved few-shot and zero-shot performance and highlight boundary refinement, signaling practical potential for automated fruit harvesting and yield estimation in agriculture.
Abstract
Advancements in machine learning, computer vision, and robotics have paved the way for transformative solutions in various domains, particularly in agriculture. For example, accurate identification and segmentation of fruits from field images plays a crucial role in automating jobs such as harvesting, disease detection, and yield estimation. However, achieving robust and precise infield fruit segmentation remains a challenging task since large amounts of labeled data are required to handle variations in fruit size, shape, color, and occlusion. In this paper, we develop a few-shot semantic segmentation framework for infield fruits using transfer learning. Concretely, our work is aimed at addressing agricultural domains that lack publicly available labeled data. Motivated by similar success in urban scene parsing, we propose specialized pre-training using a public benchmark dataset for fruit transfer learning. By leveraging pre-trained neural networks, accurate semantic segmentation of fruit in the field is achieved with only a few labeled images. Furthermore, we show that models with pre-training learn to distinguish between fruit still on the trees and fruit that have fallen on the ground, and they can effectively transfer the knowledge to the target fruit dataset.
