Table of Contents
Fetching ...

FFSTC: Fongbe to French Speech Translation Corpus

D. Fortune Kponou, Frejus A. A. Laleye, Eugene C. Ezin

TL;DR

This work introduces FFSTC, the first public end-to-end Fongbe–French speech translation corpus, totaling about $31$ hours of spoken Fongbe with French text. The dataset is created through a multi-source collection and a rigorous, multi-tier validation process, resulting in a high-quality resource with $16{,}447$ sentences and diverse vocabulary. Baseline end-to-end translations are established using Fairseq transformer_s and conformer models, achieving BLEU scores of $8.96$ and $8.14$, respectively, providing a benchmark for future improvements. The paper demonstrates the feasibility and value of targeted, end-to-end SL datasets for under-resourced, tonal languages and outlines clear directions for leveraging pre-training and transfer learning to advance performance in West African language translation scenarios.

Abstract

In this paper, we introduce the Fongbe to French Speech Translation Corpus (FFSTC) for the first time. This corpus encompasses approximately 31 hours of collected Fongbe language content, featuring both French transcriptions and corresponding Fongbe voice recordings. FFSTC represents a comprehensive dataset compiled through various collection methods and the efforts of dedicated individuals. Furthermore, we conduct baseline experiments using Fairseq's transformer_s and conformer models to evaluate data quality and validity. Our results indicate a score of 8.96 for the transformer_s model and 8.14 for the conformer model, establishing a baseline for the FFSTC corpus.

FFSTC: Fongbe to French Speech Translation Corpus

TL;DR

This work introduces FFSTC, the first public end-to-end Fongbe–French speech translation corpus, totaling about hours of spoken Fongbe with French text. The dataset is created through a multi-source collection and a rigorous, multi-tier validation process, resulting in a high-quality resource with sentences and diverse vocabulary. Baseline end-to-end translations are established using Fairseq transformer_s and conformer models, achieving BLEU scores of and , respectively, providing a benchmark for future improvements. The paper demonstrates the feasibility and value of targeted, end-to-end SL datasets for under-resourced, tonal languages and outlines clear directions for leveraging pre-training and transfer learning to advance performance in West African language translation scenarios.

Abstract

In this paper, we introduce the Fongbe to French Speech Translation Corpus (FFSTC) for the first time. This corpus encompasses approximately 31 hours of collected Fongbe language content, featuring both French transcriptions and corresponding Fongbe voice recordings. FFSTC represents a comprehensive dataset compiled through various collection methods and the efforts of dedicated individuals. Furthermore, we conduct baseline experiments using Fairseq's transformer_s and conformer models to evaluate data quality and validity. Our results indicate a score of 8.96 for the transformer_s model and 8.14 for the conformer model, establishing a baseline for the FFSTC corpus.
Paper Structure (11 sections, 1 figure, 2 tables)