RowDetr: End-to-End Crop Row Detection Using Polynomials

Rahul Harsha Cheppally; Ajay Sharda

RowDetr: End-to-End Crop Row Detection Using Polynomials

Rahul Harsha Cheppally, Ajay Sharda

TL;DR

RowDetr tackles GPS-denied under-canopy crop-row navigation by introducing a end-to-end transformer-based framework that represents rows as polynomials. It couples a lightweight backbone, a hybrid encoder, polynomial proposals, PolySampler, and a multi-scale deformable attention decoder, optimized with the PolyOptLoss energy-based objective and Hungarian one-to-one matching. The method achieves state-of-the-art accuracy (up to TuSimple F1 = 0.74) and low latency (as fast as 3.5 ms with INT8 on Jetson Orin AGX) across diverse crops and conditions, enabling robust edge deployment. The work demonstrates the practical impact of polynomial parameterization for real-time, occlusion-prone, and curved crop rows in autonomous agricultural robotics, with strong potential for deployment in GPS-denied field operations.

Abstract

Crop row detection enables autonomous robots to navigate in gps denied environments. Vision based strategies often struggle in the environments due to gaps, curved crop rows and require post-processing steps. Furthermore, labeling crop rows in under the canopy environments accurately is very difficult due to occlusions. This study introduces RowDetr, an efficient end-to-end transformer-based neural network for crop row detection in precision agriculture. RowDetr leverages a lightweight backbone and a hybrid encoder to model straight, curved, or occluded crop rows with high precision. Central to the architecture is a novel polynomial representation that enables direct parameterization of crop rows, eliminating computationally expensive post-processing. Key innovations include a PolySampler module and multi-scale deformable attention, which work together with PolyOptLoss, an energy-based loss function designed to optimize geometric alignment between predicted and the annotated crop rows, while also enhancing robustness against labeling noise. RowDetr was evaluated against other state-of-the-art end-to-end crop row detection methods like AgroNav and RolColAttention on a diverse dataset of 6,962 high-resolution images, used for training, validation, and testing across multiple crop types with annotated crop rows. The system demonstrated superior performance, achieved an F1 score up to 0.74 and a lane position deviation as low as 0.405. Furthermore, RowDetr achieves a real-time inference latency of 6.7ms, which was optimized to 3.5ms with INT8 quantization on an NVIDIA Jetson Orin AGX. This work highlighted the critical efficiency of polynomial parameterization, making RowDetr particularly suitable for deployment on edge computing devices in agricultural robotics and autonomous farming equipment. Index terms > Crop Row Detection, Under Canopy Navigation, Transformers, RT-DETR, RT-DETRv2

RowDetr: End-to-End Crop Row Detection Using Polynomials

TL;DR

Abstract

RowDetr: End-to-End Crop Row Detection Using Polynomials

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (11)