Coordinates from Context: Using LLMs to Ground Complex Location References

Tessa Masis; Brendan O'Connor

Coordinates from Context: Using LLMs to Ground Complex Location References

Tessa Masis, Brendan O'Connor

TL;DR

This paper tackles grounding compositional location references by ground-truthing bounding boxes through a two-part system: a recaller that maps mentioned places to coordinates and a reasoner that predicts the final bounding box $b_l$ from a description $d_l$. It demonstrates that LLMs possess stronger geospatial reasoning than knowledge, and introduces a geoparsed-augmented end-to-end approach that leverages external coordinates to improve grounding, achieving state-of-the-art results on the GeoCoDe dataset with relatively small fine-tuned models. Bounding boxes provide a practical grounding for locations not present in geographic databases, enabling meaningful area-overlap metrics and more robust grounding of large-scale regions. Limitations include evaluation on a single, English-language dataset and reliance on third-party geoparsers, underscoring the need for broader, multilingual benchmarks and privacy-preserving tooling.

Abstract

Geocoding is the task of linking a location reference to an actual geographic location and is essential for many downstream analyses of unstructured text. In this paper, we explore the challenging setting of geocoding compositional location references. Building on recent work demonstrating LLMs' abilities to reason over geospatial data, we evaluate LLMs' geospatial knowledge versus reasoning skills relevant to our task. Based on these insights, we propose an LLM-based strategy for geocoding compositional location references. We show that our approach improves performance for the task and that a relatively small fine-tuned LLM can achieve comparable performance with much larger off-the-shelf models.

Coordinates from Context: Using LLMs to Ground Complex Location References

TL;DR

from a description

. It demonstrates that LLMs possess stronger geospatial reasoning than knowledge, and introduces a geoparsed-augmented end-to-end approach that leverages external coordinates to improve grounding, achieving state-of-the-art results on the GeoCoDe dataset with relatively small fine-tuned models. Bounding boxes provide a practical grounding for locations not present in geographic databases, enabling meaningful area-overlap metrics and more robust grounding of large-scale regions. Limitations include evaluation on a single, English-language dataset and reliance on third-party geoparsers, underscoring the need for broader, multilingual benchmarks and privacy-preserving tooling.

Coordinates from Context: Using LLMs to Ground Complex Location References

TL;DR

Abstract

Coordinates from Context: Using LLMs to Ground Complex Location References

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (2)