Mitigating Long-tail Distribution in Oracle Bone Inscriptions: Dataset, Model, and Benchmark

Jinhao Li; Zijian Chen; Runze Jiang; Tingzhu Chen; Changbo Wang; Guangtao Zhai

Mitigating Long-tail Distribution in Oracle Bone Inscriptions: Dataset, Model, and Benchmark

Jinhao Li, Zijian Chen, Runze Jiang, Tingzhu Chen, Changbo Wang, Guangtao Zhai

TL;DR

This work tackles long-tail bias in oracle bone inscription recognition by constructing Oracle-P15K, a structure-aligned dataset with 14,542 images and expert-crafted glyphs. It then introduces OBIDiff, a diffusion-based generator with glyph and style encoders and a CLIP-based style representation to transfer rubbing textures while preserving glyph structure. The dataset and model are shown to improve OBI generation quality, enhance downstream recognition and denoising performance, and provide a realistic benchmark for four noise types. The results, along with user preference studies, support the viability of structure-aligned data and controllable synthesis for cultural heritage restoration and misinformation mitigation.

Abstract

The oracle bone inscription (OBI) recognition plays a significant role in understanding the history and culture of ancient China. However, the existing OBI datasets suffer from a long-tail distribution problem, leading to biased performance of OBI recognition models across majority and minority classes. With recent advancements in generative models, OBI synthesis-based data augmentation has become a promising avenue to expand the sample size of minority classes. Unfortunately, current OBI datasets lack large-scale structure-aligned image pairs for generative model training. To address these problems, we first present the Oracle-P15K, a structure-aligned OBI dataset for OBI generation and denoising, consisting of 14,542 images infused with domain knowledge from OBI experts. Second, we propose a diffusion model-based pseudo OBI generator, called OBIDiff, to achieve realistic and controllable OBI generation. Given a clean glyph image and a target rubbing-style image, it can effectively transfer the noise style of the original rubbing to the glyph image. Extensive experiments on OBI downstream tasks and user preference studies show the effectiveness of the proposed Oracle-P15K dataset and demonstrate that OBIDiff can accurately preserve inherent glyph structures while transferring authentic rubbing styles effectively.

Mitigating Long-tail Distribution in Oracle Bone Inscriptions: Dataset, Model, and Benchmark

TL;DR

Abstract

Mitigating Long-tail Distribution in Oracle Bone Inscriptions: Dataset, Model, and Benchmark

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (14)