Data-Driven Worker Activity Recognition and Efficiency Estimation in Manual Fruit Harvesting
Uddhav Bhattarai, Rajkishan Arikapudi, Steven A. Fennimore, Frank N Martin, Stavros G. Vougioukas
TL;DR
The paper addresses inefficiencies in manual strawberry harvesting by deploying instrumented picking carts to collect mass, location, and motion data, and developing a CNN-LSTM-based activity recognizer to label periods as 'Pick' or 'NoPick'. This enables precise estimation of picker efficiency and tray-fill times, validated on season-long data with high accuracy (mean ~95–96%) and an average active-harvest time of ~75%. Key contributions include a robust data annotation pipeline, a compact CNN-LSTM architecture with a U-shaped encoder and temporal LSTM modules, and a publicly released, richly annotated dataset for further research. The practical impact lies in improved labor management, optimized collection-station logistics, and enhanced yield-aware harvest planning in commercial strawberry production.
Abstract
Manual fruit harvesting is common in agriculture, but the amount of time pickers spend on non-productive activities can make it very inefficient. Accurately identifying picking vs. non-picking activity is crucial for estimating picker efficiency and optimising labour management and harvest processes. In this study, a practical system was developed to calculate the efficiency of pickers in commercial strawberry harvesting. Instrumented picking carts were developed to record the harvested fruit weight, geolocation, and cart movement in real time. These carts were deployed during the commercial strawberry harvest season in Santa Maria, CA. The collected data was then used to train a CNN-LSTM-based deep neural network to classify a picker's activity into "Pick" and "NoPick" classes. Experimental evaluations showed that the CNN-LSTM model showed promising activity recognition performance with an F1 score accuracy of over 0.97. The recognition results were then used to compute picker efficiency and the time required to fill a tray. Analysis of the season-long harvest data showed that the average picker efficiency was 75.07% with an estimation accuracy of 95.22%. Furthermore, the average tray fill time was 6.79 minutes with an estimation accuracy of 96.43%. When integrated into commercial harvesting, the proposed technology can aid growers in monitoring automated worker activity and optimising harvests to reduce non-productive time and enhance overall harvest efficiency.
