NeuroPapyri: A Deep Attention Embedding Network for Handwritten Papyri Retrieval

Giuseppe De Gregorio; Simon Perrin; Rodrigo C. G. Pena; Isabelle Marthot-Santaniello; Harold Mouchère

NeuroPapyri: A Deep Attention Embedding Network for Handwritten Papyri Retrieval

Giuseppe De Gregorio, Simon Perrin, Rodrigo C. G. Pena, Isabelle Marthot-Santaniello, Harold Mouchère

TL;DR

This paper addresses the challenge of retrieving historical papyri images with interpretability by introducing NeuroPapyri, a CNN-plus-multi-head-attention embedding network for handwritten Greek papyri. It trains a dual loss via a weighted combination $Loss = w_1 Loss_A + w_2 Loss_T, w_1+w_2=1$ to produce discriminative embeddings and visible attention maps, enabling paleographers to understand model decisions. Across synthetic AL-PUBv2 and ICDAR2023 Iliad-based datasets, NeuroPapyri demonstrates strong character-identification performance and state-of-the-art document retrieval (Top-1 accuracy up to 96.57% and F1@1 around 94.00). The attention visualizations support interpretability and collaboration between historians and computer scientists, with potential extensions to writer identification and dating of papyri in future work.

Abstract

The intersection of computer vision and machine learning has emerged as a promising avenue for advancing historical research, facilitating a more profound exploration of our past. However, the application of machine learning approaches in historical palaeography is often met with criticism due to their perceived ``black box'' nature. In response to this challenge, we introduce NeuroPapyri, an innovative deep learning-based model specifically designed for the analysis of images containing ancient Greek papyri. To address concerns related to transparency and interpretability, the model incorporates an attention mechanism. This attention mechanism not only enhances the model's performance but also provides a visual representation of the image regions that significantly contribute to the decision-making process. Specifically calibrated for processing images of papyrus documents with lines of handwritten text, the model utilizes individual attention maps to inform the presence or absence of specific characters in the input image. This paper presents the NeuroPapyri model, including its architecture and training methodology. Results from the evaluation demonstrate NeuroPapyri's efficacy in document retrieval, showcasing its potential to advance the analysis of historical manuscripts.

NeuroPapyri: A Deep Attention Embedding Network for Handwritten Papyri Retrieval

TL;DR

to produce discriminative embeddings and visible attention maps, enabling paleographers to understand model decisions. Across synthetic AL-PUBv2 and ICDAR2023 Iliad-based datasets, NeuroPapyri demonstrates strong character-identification performance and state-of-the-art document retrieval (Top-1 accuracy up to 96.57% and F1@1 around 94.00). The attention visualizations support interpretability and collaboration between historians and computer scientists, with potential extensions to writer identification and dating of papyri in future work.

Abstract

Paper Structure (16 sections, 1 equation, 7 figures, 5 tables)

This paper contains 16 sections, 1 equation, 7 figures, 5 tables.

Introduction
Related Works
The Model
Retrieval of Original Papyrus
Datasets
Synthetic Dataset
ICDAR2023 Competition Dataset
Experimentation and Results
Character Identification
Synthetic dataset
ICDAR2023 competition dataset
Document Retrieval
Comparative Investigation.
Ablative Study
Discussion
...and 1 more sections

Figures (7)

Figure 1: Main architecture of the NeuroPapyri model.
Figure 2: NeuroPapyri can be trained using two different losses. Attention loss focuses on attention maps while Triplet loss aims to train the model to solve the main target problem.
Figure 3: Image from the synthetic dataset. Transcription: $\Phi O O \Pi \Sigma M \Lambda \Lambda \Gamma N$.
Figure 4: a) Character distribution in the ICDAR2023 competition training set; b) Identification rate for each letter in the ICDAR2023 test set.
Figure 5: Attention maps for some text lines of the ICDAR2023 datasetseuret2023icdar.
...and 2 more figures

NeuroPapyri: A Deep Attention Embedding Network for Handwritten Papyri Retrieval

TL;DR

Abstract

NeuroPapyri: A Deep Attention Embedding Network for Handwritten Papyri Retrieval

Authors

TL;DR

Abstract

Table of Contents

Figures (7)