Curating art exhibitions using machine learning
Eurico Covas
TL;DR
The paper tackles the problem of curating art exhibitions by teaching AI to imitate human curators using a Met Museum–based dataset of past exhibitions. It compares four modeling approaches, ranging from self-contained text statistics to OpenAI embeddings and a GPT‑4o-mini fine-tuned system, all trained on ($x$, $y$) pairs where $x$ is the exhibition text and $y$ encodes artworks metadata. Across 80/20 train/validation splits, embedding-based methods—especially the full OpenAI GPT‑style mapping—show the strongest signal, achieving high generalised-tag accuracy for several metadata fields and improved, though imperfect, artwork hit rates. The results demonstrate that modest-size models can approach, and in some metrics exceed, random baselines and that richer embeddings and fine-tuning yield meaningful improvements, with potential applications in AI-assisted or virtual exhibition curation. Limitations include hallucinations from large language models and data constraints, underscoring the need for robust validation when deploying in real galleries.
Abstract
Here we present a series of artificial models - a total of four related models - based on machine learning techniques that attempt to learn from existing exhibitions which have been curated by human experts, in order to be able to do similar curatorship work. Out of our four artificial intelligence models, three achieve a reasonable ability at imitating these various curators responsible for all those exhibitions, with various degrees of precision and curatorial coherence. In particular, we can conclude two key insights: first, that there is sufficient information in these exhibitions to construct an artificial intelligence model that replicates past exhibitions with an accuracy well above random choices; and second, that using feature engineering and carefully designing the architecture of modest size models can make them almost as good as those using the so-called large language models such as GPT in a brute force approach.
