Table of Contents
Fetching ...

Temporal Image Caption Retrieval Competition -- Description and Results

Jakub Pokrywka, Piotr Wierzchoń, Kornel Weryszko, Krzysztof Jassem

TL;DR

This paper addresses the multimodal challenge of Text-Image retrieval and introduces a novel task that extends the modalities to include temporal data and provides an analysis of the delivered dataset and the process of its creation.

Abstract

Multimodal models, which combine visual and textual information, have recently gained significant recognition. This paper addresses the multimodal challenge of Text-Image retrieval and introduces a novel task that extends the modalities to include temporal data. The Temporal Image Caption Retrieval Competition (TICRC) presented in this paper is based on the Chronicling America and Challenging America projects, which offer access to an extensive collection of digitized historic American newspapers spanning 274 years. In addition to the competition results, we provide an analysis of the delivered dataset and the process of its creation.

Temporal Image Caption Retrieval Competition -- Description and Results

TL;DR

This paper addresses the multimodal challenge of Text-Image retrieval and introduces a novel task that extends the modalities to include temporal data and provides an analysis of the delivered dataset and the process of its creation.

Abstract

Multimodal models, which combine visual and textual information, have recently gained significant recognition. This paper addresses the multimodal challenge of Text-Image retrieval and introduces a novel task that extends the modalities to include temporal data. The Temporal Image Caption Retrieval Competition (TICRC) presented in this paper is based on the Chronicling America and Challenging America projects, which offer access to an extensive collection of digitized historic American newspapers spanning 274 years. In addition to the competition results, we provide an analysis of the delivered dataset and the process of its creation.
Paper Structure (18 sections, 1 equation, 7 figures, 2 tables)

This paper contains 18 sections, 1 equation, 7 figures, 2 tables.

Figures (7)

  • Figure 1: Sample picture with a caption above. This picture comes from a newspaper issued dated Jan 11, 1928.
  • Figure 2: Sample input picture
  • Figure 3: Picture selected on the whole page.
  • Figure 4: Testing distribution over the years
  • Figure 6: Word and character per caption statistics in testing dataset
  • ...and 2 more figures