Table of Contents
Fetching ...

Ensembling Finetuned Language Models for Text Classification

Sebastian Pineda Arango, Maciej Janowski, Lennart Purucker, Arber Zela, Frank Hutter, Josif Grabocka

TL;DR

A metadataset with predictions from five large finetuned models on six datasets is presented, and results of different ensembling strategies from these predictions are reported, shedding light on how ensembling can improve the performance of finetuned text classifiers and incentivize future adoption of ensembles in such tasks.

Abstract

Finetuning is a common practice widespread across different communities to adapt pretrained models to particular tasks. Text classification is one of these tasks for which many pretrained models are available. On the other hand, ensembles of neural networks are typically used to boost performance and provide reliable uncertainty estimates. However, ensembling pretrained models for text classification is not a well-studied avenue. In this paper, we present a metadataset with predictions from five large finetuned models on six datasets, and report results of different ensembling strategies from these predictions. Our results shed light on how ensembling can improve the performance of finetuned text classifiers and incentivize future adoption of ensembles in such tasks.

Ensembling Finetuned Language Models for Text Classification

TL;DR

A metadataset with predictions from five large finetuned models on six datasets is presented, and results of different ensembling strategies from these predictions are reported, shedding light on how ensembling can improve the performance of finetuned text classifiers and incentivize future adoption of ensembles in such tasks.

Abstract

Finetuning is a common practice widespread across different communities to adapt pretrained models to particular tasks. Text classification is one of these tasks for which many pretrained models are available. On the other hand, ensembles of neural networks are typically used to boost performance and provide reliable uncertainty estimates. However, ensembling pretrained models for text classification is not a well-studied avenue. In this paper, we present a metadataset with predictions from five large finetuned models on six datasets, and report results of different ensembling strategies from these predictions. Our results shed light on how ensembling can improve the performance of finetuned text classifiers and incentivize future adoption of ensembles in such tasks.

Paper Structure

This paper contains 14 sections, 1 equation, 4 figures, 7 tables.

Figures (4)

  • Figure 1: Mean error across datasets for different hyperparameter combinations.
  • Figure 2: Error for different hyperparameters using 10 % of the data.
  • Figure 3: Error vs. LoRA Rank, extended version. The error variation is small across different LoRA rank values.
  • Figure 4: Error vs. LoRA Rank, mini version. The error variation is small across different LoRA rank values.