Table of Contents
Fetching ...

Enhancing ASR Performance through OCR Word Frequency Analysis: Theoretical Foundations

Kyudan Jung, Nam-Joon Kim, Hyun Gon Ryu, Hyuk-Jae Lee

TL;DR

The power law is introduced as the theoretical foundation for the relative frequency methodology mentioned in this approach and introduced the power law as the theoretical foundation for the relative frequency methodology mentioned in this approach.

Abstract

As the interest in large language models grows, the importance of accuracy in automatic speech recognition has become more pronounced. This is especially true for lectures that include specialized terminology. In such cases, the success rate of traditional ASR models tends to be low, presenting a significant challenge. A method using the word frequency difference approach has been proposed to improve ASR performance for specialized terminology. We investigated this proposal through experiments and data analysis to determine if it effectively addresses the issue. In addition, we introduced the power law as the theoretical foundation for the relative frequency methodology mentioned in this approach.

Enhancing ASR Performance through OCR Word Frequency Analysis: Theoretical Foundations

TL;DR

The power law is introduced as the theoretical foundation for the relative frequency methodology mentioned in this approach and introduced the power law as the theoretical foundation for the relative frequency methodology mentioned in this approach.

Abstract

As the interest in large language models grows, the importance of accuracy in automatic speech recognition has become more pronounced. This is especially true for lectures that include specialized terminology. In such cases, the success rate of traditional ASR models tends to be low, presenting a significant challenge. A method using the word frequency difference approach has been proposed to improve ASR performance for specialized terminology. We investigated this proposal through experiments and data analysis to determine if it effectively addresses the issue. In addition, we introduced the power law as the theoretical foundation for the relative frequency methodology mentioned in this approach.
Paper Structure (9 sections, 3 equations, 1 figure)

This paper contains 9 sections, 3 equations, 1 figure.

Figures (1)

  • Figure 1: (a) Graph of RF using the existing method ref3, with log(RF rank) on the x-axis and log(RF) on the y-axis. (b) Graph of RF for data processed using Method 1 in addition to the existing method. (c) Graph of RF for data processed using both Method 1 and Method 2 in addition to the existing method.