Classification of freshwater snails of the genus Radomaniola with multimodal triplet networks
Dennis Vetter, Muhammad Ahsan, Diana Delicado, Thomas A. Neubauer, Thomas Wilke, Gemma Roig
TL;DR
This study tackles the problem of classifying Radomaniola freshwater snails under a small, imbalanced, multi-class setting with subtle visual differences. It introduces a multimodal triplet-network framework that fuses images, shell measurements, and genetic distances, employing offline triplet mining and a dynamic margin to learn meaningful embeddings that support accurate classification. The approach achieves expert-level accuracy (mean >98.5%) and remains effective with limited data, suggesting practical utility for fieldwork and taxonomic workflows. By combining transfer learning, multimodal fusion, and similarity-based learning, the paper demonstrates a scalable tool that can accelerate species identification while preserving biological relevance and potential explainability in collaboration with domain experts.
Abstract
In this paper, we present our first proposal of a machine learning system for the classification of freshwater snails of the genus Radomaniola. We elaborate on the specific challenges encountered during system design, and how we tackled them; namely a small, very imbalanced dataset with a high number of classes and high visual similarity between classes. We then show how we employed triplet networks and the multiple input modalities of images, measurements, and genetic information to overcome these challenges and reach a performance comparable to that of a trained domain expert.
