The Model Arena for Cross-lingual Sentiment Analysis: A Comparative Study in the Era of Large Language Models

Xiliang Zhu; Shayna Gardiner; Tere Roldán; David Rossouw

The Model Arena for Cross-lingual Sentiment Analysis: A Comparative Study in the Era of Large Language Models

Xiliang Zhu, Shayna Gardiner, Tere Roldán, David Rossouw

TL;DR

This work undertakes an empirical analysis to compare the cross-lingual transfer capability of public Small Multilingual Language Models like XLM-R, against English-centric LLMs such as Llama-3, in the context of sentiment analysis across English, Spanish, French and Chinese.

Abstract

Sentiment analysis serves as a pivotal component in Natural Language Processing (NLP). Advancements in multilingual pre-trained models such as XLM-R and mT5 have contributed to the increasing interest in cross-lingual sentiment analysis. The recent emergence in Large Language Models (LLM) has significantly advanced general NLP tasks, however, the capability of such LLMs in cross-lingual sentiment analysis has not been fully studied. This work undertakes an empirical analysis to compare the cross-lingual transfer capability of public Small Multilingual Language Models (SMLM) like XLM-R, against English-centric LLMs such as Llama-3, in the context of sentiment analysis across English, Spanish, French and Chinese. Our findings reveal that among public models, SMLMs exhibit superior zero-shot cross-lingual performance relative to LLMs. However, in few-shot cross-lingual settings, public LLMs demonstrate an enhanced adaptive potential. In addition, we observe that proprietary GPT-3.5 and GPT-4 lead in zero-shot cross-lingual capability, but are outpaced by public models in few-shot scenarios.

The Model Arena for Cross-lingual Sentiment Analysis: A Comparative Study in the Era of Large Language Models

TL;DR

Abstract

Paper Structure (21 sections, 2 figures, 6 tables)

This paper contains 21 sections, 2 figures, 6 tables.

Introduction
Background
Cross-lingual Sentiment Analysis
Sentiment Analysis in Conversational Transcripts
Methodology
Supervised Fine-tuning
In-context Learning
Experiment
Dataset
Selected pre-trained Models
Experiment Setup
Results
Zero-shot Cross-lingual Transfer
Few-shot Cross-lingual Transfer
Conclusion
...and 6 more sections

Figures (2)

Figure 1: Diagram of zero- and few- shot cross-lingual sentiment analysis from English (EN) to French (FR) under Supervised Fine-tuning (left) and In-context learning (right).
Figure 2: Average F1 score performance comparison (across ES, FR and ZH) under N-shot settings. GPT-3.5 is not included in this 600-shot due to the context length limit.

The Model Arena for Cross-lingual Sentiment Analysis: A Comparative Study in the Era of Large Language Models

TL;DR

Abstract

The Model Arena for Cross-lingual Sentiment Analysis: A Comparative Study in the Era of Large Language Models

Authors

TL;DR

Abstract

Table of Contents

Figures (2)