Ensembling Finetuned Language Models for Text Classification

Sebastian Pineda Arango; Maciej Janowski; Lennart Purucker; Arber Zela; Frank Hutter; Josif Grabocka

Ensembling Finetuned Language Models for Text Classification

Sebastian Pineda Arango, Maciej Janowski, Lennart Purucker, Arber Zela, Frank Hutter, Josif Grabocka

TL;DR

A metadataset with predictions from five large finetuned models on six datasets is presented, and results of different ensembling strategies from these predictions are reported, shedding light on how ensembling can improve the performance of finetuned text classifiers and incentivize future adoption of ensembles in such tasks.

Abstract

Finetuning is a common practice widespread across different communities to adapt pretrained models to particular tasks. Text classification is one of these tasks for which many pretrained models are available. On the other hand, ensembles of neural networks are typically used to boost performance and provide reliable uncertainty estimates. However, ensembling pretrained models for text classification is not a well-studied avenue. In this paper, we present a metadataset with predictions from five large finetuned models on six datasets, and report results of different ensembling strategies from these predictions. Our results shed light on how ensembling can improve the performance of finetuned text classifiers and incentivize future adoption of ensembles in such tasks.

Ensembling Finetuned Language Models for Text Classification

TL;DR

Abstract

Ensembling Finetuned Language Models for Text Classification

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (4)