DogFLW: Dog Facial Landmarks in the Wild Dataset
George Martvel, Greta Abele, Annika Bremhorst, Chiara Canori, Nareed Farhat, Giulia Pedretti, Ilan Shimshoni, Anna Zamansky
TL;DR
DogFLW introduces a 46-point canine facial landmark dataset—DogFLW—comprising 3,274 in-the-wild images across 120 breeds, with bounding boxes and visibility flags for each landmark. The landmark scheme is grounded in dog facial musculature and DogFACS, and annotations were produced via a human-in-the-loop approach to drastically reduce labeling time. Benchmarks using the Ensemble Landmark Detector (ELD) with YOLOv8 and EfficientNetV2S show a full-training $NME_{iod}$ of 6.52, with ear regions and certain breeds posing substantial challenges. The work demonstrates the feasibility and value of robust canine facial analysis for emotion and welfare research, while identifying data diversity—particularly ear types and long-fur breeds—as essential directions for future improvements. Overall, DogFLW provides a foundational resource to advance canine affective computing and welfare monitoring through AI-driven landmark detection.
Abstract
Affective computing for animals is a rapidly expanding research area that is going deeper than automated movement tracking to address animal internal states, like pain and emotions. Facial expressions can serve to communicate information about these states in mammals. However, unlike human-related studies, there is a significant shortage of datasets that would enable the automated analysis of animal facial expressions. Inspired by the recently introduced Cat Facial Landmarks in the Wild dataset, presenting cat faces annotated with 48 facial anatomy-based landmarks, in this paper, we develop an analogous dataset containing 3,274 annotated images of dogs. Our dataset is based on a scheme of 46 facial anatomy-based landmarks. The DogFLW dataset is available from the corresponding author upon a reasonable request.
