Table of Contents
Fetching ...

Approximating Spatial Distance Through Confront Networks: Application to the Segmentation of Medieval Avignon

Margot Ferrand, Vincent Labatut

TL;DR

The paper tackles the challenge of analyzing medieval urban space when source data are partial and imprecise by modeling relative spatial relations as confront networks and extracting graph representations from terrier-type land records. It develops a modular, graph-based pipeline with systematic and optional steps, and two objective criteria (data coverage and distance fidelity) to select the best graph among 16 variants. The optimal approach (extended, flat relations, with splitting of long streets and selective removal of non-punctual objects) demonstrates improved proximity encoding and yields a meaningful community structure that aligns with historical urban organization. The resulting insights into Avignon's urban segmentation demonstrate the method's value for historical spatial analysis and provide openly available data and code for reuse and extension.

Abstract

In historical studies, the older the sources, the more common it is to have access to data that are only partial, and/or unreliable or imprecise. This can make it difficult, or even impossible, to perform certain tasks of interest, such as the segmentation of some urban space based on the location of its constituting elements. Indeed, traditional approaches to tackle this specific task require knowing the position of all these elements before clustering them. Yet, alternative information is sometimes available, which can be leveraged to address this challenge. For instance, in the Middle Ages, land registries typically do not provide exact addresses, but rather locate spatial objects relative to each other, e.g. x being to the North of y. Spatial graphs are particularly adapted to model such spatial relationships, called confronts, which is why we propose their use over standard tabular databases. However, historical data are rich and allow extracting confront networks in many ways, making the process non-trivial. In this article, we propose several extraction methods and compare them to identify the most appropriate. We postulate that the best candidate must constitute an optimal trade-off between covering as much of the original data as possible, and providing the best graph-based approximation of spatial distance. Leveraging a dataset that describes Avignon during its papal period, we show empirically that the best results require ignoring some of the information present in the original historical sources, and that including additional information from secondary sources significantly improves the confront network. We illustrate the relevance of our method by partitioning the best graph that we extracted, and discussing its community structure in terms of urban space organization, from a historical perspective. Our data and source code are both publicly available online.

Approximating Spatial Distance Through Confront Networks: Application to the Segmentation of Medieval Avignon

TL;DR

The paper tackles the challenge of analyzing medieval urban space when source data are partial and imprecise by modeling relative spatial relations as confront networks and extracting graph representations from terrier-type land records. It develops a modular, graph-based pipeline with systematic and optional steps, and two objective criteria (data coverage and distance fidelity) to select the best graph among 16 variants. The optimal approach (extended, flat relations, with splitting of long streets and selective removal of non-punctual objects) demonstrates improved proximity encoding and yields a meaningful community structure that aligns with historical urban organization. The resulting insights into Avignon's urban segmentation demonstrate the method's value for historical spatial analysis and provide openly available data and code for reuse and extension.

Abstract

In historical studies, the older the sources, the more common it is to have access to data that are only partial, and/or unreliable or imprecise. This can make it difficult, or even impossible, to perform certain tasks of interest, such as the segmentation of some urban space based on the location of its constituting elements. Indeed, traditional approaches to tackle this specific task require knowing the position of all these elements before clustering them. Yet, alternative information is sometimes available, which can be leveraged to address this challenge. For instance, in the Middle Ages, land registries typically do not provide exact addresses, but rather locate spatial objects relative to each other, e.g. x being to the North of y. Spatial graphs are particularly adapted to model such spatial relationships, called confronts, which is why we propose their use over standard tabular databases. However, historical data are rich and allow extracting confront networks in many ways, making the process non-trivial. In this article, we propose several extraction methods and compare them to identify the most appropriate. We postulate that the best candidate must constitute an optimal trade-off between covering as much of the original data as possible, and providing the best graph-based approximation of spatial distance. Leveraging a dataset that describes Avignon during its papal period, we show empirically that the best results require ignoring some of the information present in the original historical sources, and that including additional information from secondary sources significantly improves the confront network. We illustrate the relevance of our method by partitioning the best graph that we extracted, and discussing its community structure in terms of urban space organization, from a historical perspective. Our data and source code are both publicly available online.

Paper Structure

This paper contains 41 sections, 17 figures, 6 tables.

Figures (17)

  • Figure 1: Top: Example of declaration retrieved from a terrier, Vaucluse Departmental Archives, 1G10 f.9v. It includes the original text (first frame) and its English translation (second frame). Each color represents a different piece of information: tenant (red), property (orange), location (blue, 5 different confronts here), and fees (green). Italics denote entities of interest. Diagram available at http://doi.org/10.5281/zenodo.14175830 under CC-BY license. Bottom: Terrier of Bishop Anglic Grimoard, Vaucluse Departmental Archives, 1G10 f.1.
  • Figure 2: Left: Density map of properties (declared and undeclared) in our dataset; location by interpolation using the grid method. Right: seven parishes of Avignon, and main geological landmarks. Plots available at http://doi.org/10.5281/zenodo.14175830 under CC-BY license.
  • Figure 3: Straightforward extraction of a graph, from our database. Each colored shape represents a spatial object (e.g. a building). The edges depend on the spatial relations described in the historical sources. Figure available at http://doi.org/10.5281/zenodo.14175830 under CC-BY license.
  • Figure 4: The three strategies proposed to handle 1- and 2-dimensional objects such as the street in this figure (shown in brown): removing it by not representing it at all in the graph, keeping it as it is by modeling it through a single vertex, or splitting it and representing its constituting pieces with several linearly connected vertices. Figure available at http://doi.org/10.5281/zenodo.14175830 under CC-BY license.
  • Figure 5: The two strategies proposed to handle hierarchical relationships, such as a parish containing eight buildings in this figure: keep them and make the graph hierarchical, or remove them and make it flat. The dotted edges show the hierarchical relationships. Figure available at http://doi.org/10.5281/zenodo.14175830 under CC-BY license.
  • ...and 12 more figures