Table of Contents
Fetching ...

CNVSRC 2023: The First Chinese Continuous Visual Speech Recognition Challenge

Chen Chen, Zehua Liu, Xiaolou Li, Lantian Li, Dong Wang

TL;DR

This paper comprehensively reviews the challenge, encompassing the data profile, task specifications, and baseline system construction, and summarises the representative techniques employed by the submitted systems, highlighting the most effective approaches.

Abstract

The first Chinese Continuous Visual Speech Recognition Challenge aimed to probe the performance of Large Vocabulary Continuous Visual Speech Recognition (LVC-VSR) on two tasks: (1) Single-speaker VSR for a particular speaker and (2) Multi-speaker VSR for a set of registered speakers. The challenge yielded highly successful results, with the best submission significantly outperforming the baseline, particularly in the single-speaker task. This paper comprehensively reviews the challenge, encompassing the data profile, task specifications, and baseline system construction. It also summarises the representative techniques employed by the submitted systems, highlighting the most effective approaches. Additional information and resources about this challenge can be accessed through the official website at http://cnceleb.org/competition.

CNVSRC 2023: The First Chinese Continuous Visual Speech Recognition Challenge

TL;DR

This paper comprehensively reviews the challenge, encompassing the data profile, task specifications, and baseline system construction, and summarises the representative techniques employed by the submitted systems, highlighting the most effective approaches.

Abstract

The first Chinese Continuous Visual Speech Recognition Challenge aimed to probe the performance of Large Vocabulary Continuous Visual Speech Recognition (LVC-VSR) on two tasks: (1) Single-speaker VSR for a particular speaker and (2) Multi-speaker VSR for a set of registered speakers. The challenge yielded highly successful results, with the best submission significantly outperforming the baseline, particularly in the single-speaker task. This paper comprehensively reviews the challenge, encompassing the data profile, task specifications, and baseline system construction. It also summarises the representative techniques employed by the submitted systems, highlighting the most effective approaches. Additional information and resources about this challenge can be accessed through the official website at http://cnceleb.org/competition.
Paper Structure (23 sections, 1 equation, 4 tables)