OregairuChar: A Benchmark Dataset for Character Appearance Frequency Analysis in My Teen Romantic Comedy SNAFU
Qi Sun, Dingju Zhou, Lina Zhang
TL;DR
Understanding narrative dynamics in anime through character appearance frequency, especially in stylized domains, is the core problem. The paper introduces OregairuChar, a dataset of 1600 frames with 2860 bounding boxes for $K=11$ main characters from My Teen Romantic Comedy SNAFU Season 3, and provides a baseline demo using YOLOv5 with an embedding-based identity refinement to enable temporal appearance analysis. Key contributions include the high-quality, temporally coherent dataset, comprehensive model evaluation on stylized content, and a demonstration of automated appearance-frequency analysis across episodes. This resource enables computational narrative studies, supporting development of temporally aware, character-centric detection and deeper exploration of narrative dynamics in stylized media.
Abstract
The analysis of character appearance frequency is essential for understanding narrative structure, character prominence, and story progression in anime. In this work, we introduce OregairuChar, a benchmark dataset designed for appearance frequency analysis in the anime series My Teen Romantic Comedy SNAFU. The dataset comprises 1600 manually selected frames from the third season, annotated with 2860 bounding boxes across 11 main characters. OregairuChar captures diverse visual challenges, including occlusion, pose variation, and inter-character similarity, providing a realistic basis for appearance-based studies. To enable quantitative research, we benchmark several object detection models on the dataset and leverage their predictions for fine-grained, episode-level analysis of character presence over time. This approach reveals patterns of character prominence and their evolution within the narrative. By emphasizing appearance frequency, OregairuChar serves as a valuable resource for exploring computational narrative dynamics and character-centric storytelling in stylized media.
