Enhancing Ecological Monitoring with Multi-Objective Optimization: A Novel Dataset and Methodology for Segmentation Algorithms
Sophia J. Abraham, Jin Huang, Brandon RichardWebster, Michael Milford, Jonathan D. Hauenstein, Walter Scheirer
TL;DR
The paper introduces a 6,096-image aerial dataset of native and invasive grasses from the Bega Valley and a homotopy-based multi-objective fine-tuning framework that balances segmentation accuracy with contextual coherence, demonstrated on SAM. By integrating DiceCELoss and a smoothness loss with a gradually evolving homotopy parameter, it achieves robust segmentation in noisy ecological imagery and provides strong baselines across multiple architectures. The dataset includes field-collected imagery, auxiliary metadata, and a simplified grass/non-grass annotation protocol, enabling large-scale ecological monitoring research. Results show the proposed Multi-Objective SAM achieves top performance with efficient training, highlighting its practical potential for early detection and management of invasive grasses like African lovegrass. The work advances ecological CV by offering a high-quality dataset, a novel training paradigm, and actionable insights for real-world environmental monitoring and sustainable land management.
Abstract
We introduce a unique semantic segmentation dataset of 6,096 high-resolution aerial images capturing indigenous and invasive grass species in Bega Valley, New South Wales, Australia, designed to address the underrepresented domain of ecological data in the computer vision community. This dataset presents a challenging task due to the overlap and distribution of grass species, which is critical for advancing models in ecological and agronomical applications. Our study features a homotopy-based multi-objective fine-tuning approach that balances segmentation accuracy and contextual consistency, applicable to various models. By integrating DiceCELoss for pixel-wise classification and a smoothness loss for spatial coherence, this method evolves during training to enhance robustness against noisy data. Performance baselines are established through a case study on the Segment Anything Model (SAM), demonstrating its effectiveness. Our annotation methodology, emphasizing pen size, zoom control, and memory management, ensures high-quality dataset creation. The dataset and code will be made publicly available, aiming to drive research in computer vision, machine learning, and ecological studies, advancing environmental monitoring and sustainable development.
