Team MTS @ AutoMin 2021: An Overview of Existing Summarization Approaches and Comparison to Unsupervised Summarization Techniques

Olga Iakovenko; Anna Andreeva; Anna Lapidus; Liana Mikaelyan

Team MTS @ AutoMin 2021: An Overview of Existing Summarization Approaches and Comparison to Unsupervised Summarization Techniques

Olga Iakovenko, Anna Andreeva, Anna Lapidus, Liana Mikaelyan

TL;DR

This paper analyzes existing approaches to text and speech summarization, proposes an unsupervised summarization technique based on clustering and provides a pipeline that includes an adapted automatic speech recognition block able to run on real-life recordings.

Abstract

Remote communication through video or audio conferences has become more popular than ever because of the worldwide pandemic. These events, therefore, have provoked the development of systems for automatic minuting of spoken language leading to AutoMin 2021 challenge. The following paper illustrates the results of the research that team MTS has carried out while participating in the Automatic Minutes challenge. In particular, in this paper we analyze existing approaches to text and speech summarization, propose an unsupervised summarization technique based on clustering and provide a pipeline that includes an adapted automatic speech recognition block able to run on real-life recordings. The proposed unsupervised technique outperforms pre-trained summarization models on the automatic minuting task with Rouge 1, Rouge 2 and Rouge L values of 0.21, 0.02 and 0.2 on the dev set, with Rouge 1, Rouge 2, Rouge L, Adequacy, Grammatical correctness and Fluency values of 0.180, 0.035, 0.098, 1.857, 2.304, 1.911 on the test set accordingly

Team MTS @ AutoMin 2021: An Overview of Existing Summarization Approaches and Comparison to Unsupervised Summarization Techniques

TL;DR

Abstract

Paper Structure (18 sections, 4 figures, 8 tables)

This paper contains 18 sections, 4 figures, 8 tables.

Introduction
Background
Experiments
Datasets
Automin-confidential-data
ICSI
AMI
Algorithms
Speech Recognition
Text Summarization
Text-To-Text Transfer Transformer
Pegasus
PreSumm
VecSumm
Evaluation
...and 3 more sections

Figures (4)

Figure 1: Conventional structure of a speech summarization system
Figure 2: Developed speech summarization pipeline
Figure 3: Flow of the VecSumm approach
Figure 4: Meaningful clusters and their cluster centers

Team MTS @ AutoMin 2021: An Overview of Existing Summarization Approaches and Comparison to Unsupervised Summarization Techniques

TL;DR

Abstract

Team MTS @ AutoMin 2021: An Overview of Existing Summarization Approaches and Comparison to Unsupervised Summarization Techniques

Authors

TL;DR

Abstract

Table of Contents

Figures (4)