Oral Imaging for Malocclusion Issues Assessments: OMNI Dataset, Deep Learning Baselines and Benchmarking
Pujun Xue, Junyi Ge, Xiaotong Jiang, Siyang Song, Zijian Wu, Yupeng Huo, Weicheng Xie, Linlin Shen, Xiaoqin Zhou, Xiaofeng Liu, Min Gu
TL;DR
This work introduces the OMNI dataset, the first public multi-view dental image collection tailored for automated malocclusion diagnosis, comprising 4,166 RGB images from 384 participants across five views with ten annotated malocclusion categories. It benchmarks six deep learning baselines—three CNNs, two Transformers, and one GNN-based GraphTeethNet—using a COCO-format annotation pipeline and standard $mAP$ evaluation, providing a comprehensive baseline and revealing the potential of inter-tooth relational modeling. Key findings show Deformable DETR achieving strong performance, GraphTeethNet benefiting from multi-dimensional edge features, and the overall feasibility of DL methods for malocclusion localization and diagnosis on OMNI, albeit with dataset imbalance as a limitation. The dataset and benchmarks offer a valuable foundation for future dental AI research and pave the way for multi-view, automated, and scalable malocclusion assessment in clinical practice.
Abstract
Malocclusion is a major challenge in orthodontics, and its complex presentation and diverse clinical manifestations make accurate localization and diagnosis particularly important. Currently, one of the major shortcomings facing the field of dental image analysis is the lack of large-scale, accurately labeled datasets dedicated to malocclusion issues, which limits the development of automated diagnostics in the field of dentistry and leads to a lack of diagnostic accuracy and efficiency in clinical practice. Therefore, in this study, we propose the Oral and Maxillofacial Natural Images (OMNI) dataset, a novel and comprehensive dental image dataset aimed at advancing the study of analyzing dental images for issues of malocclusion. Specifically, the dataset contains 4166 multi-view images with 384 participants in data collection and annotated by professional dentists. In addition, we performed a comprehensive validation of the created OMNI dataset, including three CNN-based methods, two Transformer-based methods, and one GNN-based method, and conducted automated diagnostic experiments for malocclusion issues. The experimental results show that the OMNI dataset can facilitate the automated diagnosis research of malocclusion issues and provide a new benchmark for the research in this field. Our OMNI dataset and baseline code are publicly available at https://github.com/RoundFaceJ/OMNI.
