Segmenting Dead Sea Scroll Fragments for a Scientific Image Set
Bronson Brown-deVost, Berat Kurar-Barakat, Nachum Dershowitz
TL;DR
The paper tackles the challenge of segmenting Dead Sea Scroll fragments from IAA image archives, where bars, labels, a black background, and backing substrates complicate standard segmentation. It introduces a multi-step, tailor-made pipeline that combines Faster R-CNN bar detection with image-alignment, infrared thresholding, and backing-substrate masking to produce accurate fragment masks, followed by contour extraction and filtering. A public evaluation dataset of 139 fragments with train/validation/test splits and ground-truth WKT annotations enables reproducible performance assessment, yielding high mean IoU ($0.97$) and precision/recall around $0.98$–$0.99$. The workflow is designed to be generalizable to other manuscript collections and provides a practical foundation for digital joins, reconstruction, and glyph-level analysis while promoting standards for benchmarking in fragment segmentation.
Abstract
This paper presents a customized pipeline for segmenting manuscript fragments from images curated by the Israel Antiquities Authority (IAA). The images present challenges for standard segmentation methods due to the presence of the ruler, color, and plate number bars, as well as a black background that resembles the ink and varying backing substrates. The proposed pipeline, consisting of four steps, addresses these challenges by isolating and solving each difficulty using custom tailored methods. Further, the usage of a multi-step pipeline will surely be helpful from a conceptual standpoint for other image segmentation projects that encounter problems that have proven intractable when applying any of the more commonly used segmentation techniques. In addition, we create a dataset with bar detection and fragment segmentation ground truth and evaluate the pipeline steps qualitatively and quantitatively on it. This dataset is publicly available to support the development of the field. It aims to address the lack of standard sets of fragment images and evaluation metrics and enable researchers to evaluate their methods in a reliable and reproducible manner.
