Towards AI-Guided Open-World Ecological Taxonomic Classification
Cheng Yaw Low, Heejoon Koo, Jaewoo Park, Kaleb Mesfin Asfaw, Meeyoung Cha
TL;DR
The paper tackles open-world ecological taxonomy by addressing the intertwined challenges of long-tailed distributions, fine-grained differentiation, spatiotemporal domain shifts, and open-set taxa in plant classification. It introduces TaxoNet, a domain-specific embedding backbone equipped with a dual-margin penalization loss and a norm-guided sampling strategy to balance learning signals across head and tail taxa, supplemented by adaptive margin scaling and open-set awareness. The approach yields significant macro-recall gains on three diverse plant datasets, improves domain generalization, and enhances unseen-taxa rejection, while also demonstrating limitations of general-purpose multimodal models in fine-grained plant taxonomy. Together, these contributions advance scalable, open-world plant biodiversity monitoring and motivate future development of domain-specialized multimodal foundations for ecological reasoning.
Abstract
AI-guided classification of ecological families, genera, and species underpins global sustainability efforts such as biodiversity monitoring, conservation planning, and policy-making. Progress toward this goal is hindered by long-tailed taxonomic distributions from class imbalance, along with fine-grained taxonomic variations, test-time spatiotemporal domain shifts, and closed-set assumptions that can only recognize previously seen taxa. We introduce the Open-World Ecological Taxonomy Classification, a unified framework that captures the co-occurrence of these challenges in realistic ecological settings. To address them, we propose TaxoNet, an embedding-based encoder with a dual-margin penalization loss that strengthens learning signals from rare underrepresented taxa while mitigating the dominance of overrepresented ones, directly confronting interrelated challenges. We evaluate our method on diverse ecological domains: Google Auto-Arborist (urban trees), iNat-Plantae (Plantae observations from various ecosystems in iNaturalist-2019), and NAFlora-Mini (a curated herbarium collection). Our model consistently outperforms baselines, particularly for rare taxa, establishing a strong foundation for open-world plant taxonomic monitoring. Our findings further show that general-purpose multimodal foundation models remain constrained in plant-domain applications.
