VEMOCLAP: A video emotion classification web application

Serkan Sulun; Paula Viana; Matthew E. P. Davies

VEMOCLAP: A video emotion classification web application

Serkan Sulun, Paula Viana, Matthew E. P. Davies

TL;DR

This work improves the previous work, which exploits open-source pretrained models that work on video frames and audio, and then efficiently fuse the resulting pretrained features using multi-head cross-attention to increase classification accuracy on the Ekman-6 video emotion dataset.

Abstract

We introduce VEMOCLAP: Video EMOtion Classifier using Pretrained features, the first readily available and open-source web application that analyzes the emotional content of any user-provided video. We improve our previous work, which exploits open-source pretrained models that work on video frames and audio, and then efficiently fuse the resulting pretrained features using multi-head cross-attention. Our approach increases the state-of-the-art classification accuracy on the Ekman-6 video emotion dataset by 4.3% and offers an online application for users to run our model on their own videos or YouTube videos. We invite the readers to try our application at serkansulun.com/app.

VEMOCLAP: A video emotion classification web application

TL;DR

Abstract

VEMOCLAP: A video emotion classification web application

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (2)