Machine learning applications in archaeological practices: a review
Mathias Bellat, Jordy D. Orellana Figueroa, Jonathan S. Reeves, Ruhollah Taghizadeh-Mehrjardi, Claudio Tennie, Thomas Scholten
TL;DR
Problem: limited understanding of how ML methods are applied across archaeological subfields and the methodological pitfalls that accompany their use. Approach: a rapid systematic review of 135 publications (1997–2022) with a broad taxonomy of ML families, archaeological subfields, inputs, and evaluation metrics, plus a proposed workflow for coherent, transparent practice. Findings: ANNs and ensemble methods dominate, with automatic structure detection and artefact classification as the most common tasks; ML uptake varies by subfield and data type, and methodological issues—especially around reporting, data availability, and interpretability—are repeatedly highlighted. Significance: the study provides a practical framework and best-practice recommendations to improve rigor, reproducibility, and collaboration in applying ML to archaeology, while acknowledging limitations and ethical considerations.
Abstract
Artificial intelligence and machine learning applications in archaeology have increased significantly in recent years, and these now span all subfields, geographical regions, and time periods. The prevalence and success of these applications have remained largely unexamined, as recent reviews on the use of machine learning in archaeology have only focused only on specific subfields of archaeology. Our review examined an exhaustive corpus of 135 articles published between 1997 and 2022. We observed a significant increase in the number of publications from 2019 onwards. Automatic structure detection and artefact classification were the most represented tasks in the articles reviewed, followed by taphonomy, and archaeological predictive modelling. From the review, clustering and unsupervised methods were underrepresented compared to supervised models. Artificial neural networks and ensemble learning account for two thirds of the total number of models used. However, if machine learning models are gaining in popularity they remain subject to misunderstanding. We observed, in some cases, poorly defined requirements and caveats of the machine learning methods used. Furthermore, the goals and the needs of machine learning applications for archaeological purposes are in some cases unclear or poorly expressed. To address this, we proposed a workflow guide for archaeologists to develop coherent and consistent methodologies adapted to their research questions, project scale and data. As in many other areas, machine learning is rapidly becoming an important tool in archaeological research and practice, useful for the analyses of large and multivariate data, although not without limitations. This review highlights the importance of well-defined and well-reported structured methodologies and collaborative practices to maximise the potential of applications of machine learning methods in archaeology.
