Table of Contents
Fetching ...

Voice Communication Analysis in Esports

Aymeric Vinot, Nicolas Perez

TL;DR

This work investigates how voice communication affects team performance in esports, focusing on League of Legends. It proposes NLP-based metrics to quantify two critical issues: duplicate communications and parasite (unclear/irrelevant) communications, using semantic sentence embeddings and fixed-threshold detection. The authors implement an audio-processing pipeline that transcribes speech with Whisper, diarizes speakers, and aligns text to timestamps, then validate their methods through experiments showing that a general-purpose embedding model ($\text{mxbai-embed-large-v1}$) yields the most balanced performance across tasks, while acknowledging jargon-related limitations. Practical contributions include tools for coaches to profile and improve team communication, with a clear path toward specialized models and multimodal analyses to better correlate communication quality with in-game outcomes.

Abstract

In most team-based esports, voice communications are prominent in the team efficiency and synergy. In fact it has been observed that not only the skill aspect of the team but also the team effective voice communication comes into play when trying to have good performance in official matches. With the recent emergence of LLM (Large Language Models) tools regarding NLP (Natural Language Processing) (Vaswani et. al.), we decided to try applying them in order to have a better understanding on how to improve the effectiveness of the voice communications. In this paper the study has been made through the prism of League of Legends esport. However the main concepts and ideas can be easily applicable in any other team related esports.

Voice Communication Analysis in Esports

TL;DR

This work investigates how voice communication affects team performance in esports, focusing on League of Legends. It proposes NLP-based metrics to quantify two critical issues: duplicate communications and parasite (unclear/irrelevant) communications, using semantic sentence embeddings and fixed-threshold detection. The authors implement an audio-processing pipeline that transcribes speech with Whisper, diarizes speakers, and aligns text to timestamps, then validate their methods through experiments showing that a general-purpose embedding model () yields the most balanced performance across tasks, while acknowledging jargon-related limitations. Practical contributions include tools for coaches to profile and improve team communication, with a clear path toward specialized models and multimodal analyses to better correlate communication quality with in-game outcomes.

Abstract

In most team-based esports, voice communications are prominent in the team efficiency and synergy. In fact it has been observed that not only the skill aspect of the team but also the team effective voice communication comes into play when trying to have good performance in official matches. With the recent emergence of LLM (Large Language Models) tools regarding NLP (Natural Language Processing) (Vaswani et. al.), we decided to try applying them in order to have a better understanding on how to improve the effectiveness of the voice communications. In this paper the study has been made through the prism of League of Legends esport. However the main concepts and ideas can be easily applicable in any other team related esports.

Paper Structure

This paper contains 29 sections, 7 equations, 11 figures, 2 tables.

Figures (11)

  • Figure 1: Pipeline of audio transcription from bain2022whisperx
  • Figure 2: Duplicate communication scores on each sentence for each speaker. Each sentence is assigned a score ranging from 0 to 1 telling how much this said sentence is repetitive in terms of semantic similarity compared to the previous sentences spoken in a time frame of $W = 15s$
  • Figure 3: Duplicate communication scores on each sentence for each speaker on another game
  • Figure 4: Bottom : Overall interference score of the speaker's communication. Above : interference percentage on each parasite phrasings
  • Figure 5: Similarity score of each sentence spoken with eash parasite phrasing
  • ...and 6 more figures