ICDAR2017 Competition on Reading Chinese Text in the Wild (RCTW-17)
Baoguang Shi, Cong Yao, Minghui Liao, Mingkun Yang, Pei Xu, Linyan Cui, Serge Belongie, Shijian Lu, Xiang Bai
TL;DR
This paper presents the ICDAR 2017 Competition on Reading Chinese Text in the Wild (RCTW-17) and the CTW-12k dataset, a large-scale Chinese scene-text corpus annotated with four-point polygons and UTF-8 transcripts. It defines two tasks—text localization and end-to-end recognition—and introduces polygon-based IoU and edit-distance–based evaluation protocols (AED/NED) to assess performance. The study reports submissions from 19 teams, analyzes top-performing approaches, and discusses common challenges such as long-text detection, perspective distortion, and confusable characters. The work establishes a foundation for Chinese text reading research in natural images and outlines plans for ongoing online evaluation and dataset refinement.
Abstract
Chinese is the most widely used language in the world. Algorithms that read Chinese text in natural images facilitate applications of various kinds. Despite the large potential value, datasets and competitions in the past primarily focus on English, which bares very different characteristics than Chinese. This report introduces RCTW, a new competition that focuses on Chinese text reading. The competition features a large-scale dataset with 12,263 annotated images. Two tasks, namely text localization and end-to-end recognition, are set up. The competition took place from January 20 to May 31, 2017. 23 valid submissions were received from 19 teams. This report includes dataset description, task definitions, evaluation protocols, and results summaries and analysis. Through this competition, we call for more future research on the Chinese text reading problem. The official website for the competition is http://rctw.vlrlab.net
