Table of Contents
Fetching ...

Introduction to speech recognition

Gabriel Dauphin

Abstract

This document contains lectures and practical experimentations using Matlab and implementing a system which is actually correctly classifying three words (one, two and three) with the help of a very small database. To achieve this performance, it uses speech modeling specificities, powerful computer algorithms (dynamic time warping and Dijktra's algorithm) and machine learning (nearest neighbor). This document introduces also some machine learning evaluation metrics.

Introduction to speech recognition

Abstract

This document contains lectures and practical experimentations using Matlab and implementing a system which is actually correctly classifying three words (one, two and three) with the help of a very small database. To achieve this performance, it uses speech modeling specificities, powerful computer algorithms (dynamic time warping and Dijktra's algorithm) and machine learning (nearest neighbor). This document introduces also some machine learning evaluation metrics.
Paper Structure (84 sections, 158 equations, 31 figures)

This paper contains 84 sections, 158 equations, 31 figures.

Figures (31)

  • Figure 1: Schéma détaillé de l'oreille.
  • Figure 2: Courbe d'audibilité humaine.
  • Figure 3: Représentation temporelle du signal sonore.
  • Figure 4: Découpage d'une sinusoïde en trames qui ne se chevauchent pas.
  • Figure 5: Découpage d'une sinusoïde en trames qui se chevauchent.
  • ...and 26 more figures