Keypoint Semantic Integration for Improved Feature Matching in Outdoor Agricultural Environments

Rajitha de Silva; Jonathan Cox; Marija Popovic; Cesar Cadena; Cyrill Stachniss; Riccardo Polvara

Keypoint Semantic Integration for Improved Feature Matching in Outdoor Agricultural Environments

Rajitha de Silva, Jonathan Cox, Marija Popovic, Cesar Cadena, Cyrill Stachniss, Riccardo Polvara

TL;DR

This work addresses perceptual aliasing in outdoor vineyard perception by embedding semantic context into keypoint descriptors (KSI). A modular pipeline combines semantic-instance embeddings from panoptic masks with standard descriptors, enabling a single-pass, heterogeneous descriptor matching that improves robustness for relative pose estimation and long-term visual localisation across seasonal changes. The SemanticBLT dataset supports training segmentation models for vineyard objects, and comprehensive ablations justify design choices such as addition-based semantic integration, selective normalisation, and heterogeneous matching. Findings show consistent improvements in semantically meaningful regions and practical runtime on both desktop and embedded hardware, highlighting substantial impact for vineyard robotics and similar repetitive-domain environments.

Abstract

Robust robot navigation in outdoor environments requires accurate perception systems capable of handling visual challenges such as repetitive structures and changing appearances. Visual feature matching is crucial to vision-based pipelines but remains particularly challenging in natural outdoor settings due to perceptual aliasing. We address this issue in vineyards, where repetitive vine trunks and other natural elements generate ambiguous descriptors that hinder reliable feature matching. We hypothesise that semantic information tied to keypoint positions can alleviate perceptual aliasing by enhancing keypoint descriptor distinctiveness. To this end, we introduce a keypoint semantic integration technique that improves the descriptors in semantically meaningful regions within the image, enabling more accurate differentiation even among visually similar local features. We validate this approach in two vineyard perception tasks: (i) relative pose estimation and (ii) visual localisation. Across all tested keypoint types and descriptors, our method improves matching accuracy by 12.6%, demonstrating its effectiveness over multiple months in challenging vineyard conditions.

Keypoint Semantic Integration for Improved Feature Matching in Outdoor Agricultural Environments

TL;DR

Abstract

Keypoint Semantic Integration for Improved Feature Matching in Outdoor Agricultural Environments

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (4)