Table of Contents
Fetching ...

Active Learning for Multilingual Fingerspelling Corpora

Shuai Wang, Eric Nalisnick

TL;DR

The paper tackles data scarcity in sign-language recognition by coupling active learning with cross-language transfer on multilingual fingerspelling corpora. It demonstrates that active learning with variation ratios consistently yields gains over random sampling, with near full-data performance reachable from a small data fraction for most datasets. Transfer active learning shows more nuanced benefits: pre-training on related fingerspelling data can help, particularly when visual similarity aligns (e.g., GSL to ISL), but benefits are not uniformly guaranteed and may be driven by visual rather than linguistic factors. These results underscore the importance of disentangling visual and linguistic similarities in transfer learning for sign-language tasks and point to future directions for more robust, data-efficient sign-language systems.

Abstract

We apply active learning to help with data scarcity problems in sign languages. In particular, we perform a novel analysis of the effect of pre-training. Since many sign languages are linguistic descendants of French sign language, they share hand configurations, which pre-training can hopefully exploit. We test this hypothesis on American, Chinese, German, and Irish fingerspelling corpora. We do observe a benefit from pre-training, but this may be due to visual rather than linguistic similarities

Active Learning for Multilingual Fingerspelling Corpora

TL;DR

The paper tackles data scarcity in sign-language recognition by coupling active learning with cross-language transfer on multilingual fingerspelling corpora. It demonstrates that active learning with variation ratios consistently yields gains over random sampling, with near full-data performance reachable from a small data fraction for most datasets. Transfer active learning shows more nuanced benefits: pre-training on related fingerspelling data can help, particularly when visual similarity aligns (e.g., GSL to ISL), but benefits are not uniformly guaranteed and may be driven by visual rather than linguistic factors. These results underscore the importance of disentangling visual and linguistic similarities in transfer learning for sign-language tasks and point to future directions for more robust, data-efficient sign-language systems.

Abstract

We apply active learning to help with data scarcity problems in sign languages. In particular, we perform a novel analysis of the effect of pre-training. Since many sign languages are linguistic descendants of French sign language, they share hand configurations, which pre-training can hopefully exploit. We test this hypothesis on American, Chinese, German, and Irish fingerspelling corpora. We do observe a benefit from pre-training, but this may be due to visual rather than linguistic similarities
Paper Structure (16 sections, 1 equation, 6 figures)

This paper contains 16 sections, 1 equation, 6 figures.

Figures (6)

  • Figure 1: Example Images. The above figure shows the letters 'A' and 'P' from the ASL, CSL, GSL, and ISL data sets respectively.
  • Figure 2: Single Corpus Active Learning: The figure above shows AL results for each fingerspelling corpus. Using variation ratios (blue) as an acquisition function is clearly superior to random sampling (yellow).
  • Figure 3: Transfer Active Learning: The figure above shows AL results for each fingerspelling corpus with pre-training and no pre-training. Pre-training with a non-fingerspelling corpus degrades performance in all cases.
  • Figure 4: Per Class Results for Transfer Active Learning: Generally, we see the gains from pre-training are had by shared letters (red) but not exclusively ('X' in ISL is not shared by GSL).
  • Figure 5: Transfer Active Learning: The figure above shows AL results for each fingerspelling corpus with pre-training and no pre-training for resolution = 96×96. Pre-training with a double hand-fingerspelling(Indian_SL) exists as a reference.
  • ...and 1 more figures