Table of Contents
Fetching ...

dzStance at StanceEval2024: Arabic Stance Detection based on Sentence Transformers

Mohamed Lichouri, Khaled Lounnas, Khelil Rafik Ouaras, Mohamed Abi, Anis Guechtouli

TL;DR

This work assesses Arabic stance detection by comparing TF-IDF features with Sentence Transformers on Mawqif 2022 data across three topics. It demonstrates that embedding-based representations (xlm-r-bert-base-nli-stsb-mean-tokens) with Logistic Regression outperform traditional TF-IDF with LinearSVC, achieving an overall F1 of 68.48% in the experiments and placing competitively in the StanceEval2024 rankings (e.g., 13th overall). The findings underscore the effectiveness of transformer-based embeddings for capturing semantic nuances in Arabic text and suggest avenues for further improvements via hybrid feature strategies and targeted fine-tuning. The work contributes practical, reproducible approaches for Arabic stance detection with potential impact on social discourse analysis and monitoring.

Abstract

This study compares Term Frequency-Inverse Document Frequency (TF-IDF) features with Sentence Transformers for detecting writers' stances--favorable, opposing, or neutral--towards three significant topics: COVID-19 vaccine, digital transformation, and women empowerment. Through empirical evaluation, we demonstrate that Sentence Transformers outperform TF-IDF features across various experimental setups. Our team, dzStance, participated in a stance detection competition, achieving the 13th position (74.91%) among 15 teams in Women Empowerment, 10th (73.43%) in COVID Vaccine, and 12th (66.97%) in Digital Transformation. Overall, our team's performance ranked 13th (71.77%) among all participants. Notably, our approach achieved promising F1-scores, highlighting its effectiveness in identifying writers' stances on diverse topics. These results underscore the potential of Sentence Transformers to enhance stance detection models for addressing critical societal issues.

dzStance at StanceEval2024: Arabic Stance Detection based on Sentence Transformers

TL;DR

This work assesses Arabic stance detection by comparing TF-IDF features with Sentence Transformers on Mawqif 2022 data across three topics. It demonstrates that embedding-based representations (xlm-r-bert-base-nli-stsb-mean-tokens) with Logistic Regression outperform traditional TF-IDF with LinearSVC, achieving an overall F1 of 68.48% in the experiments and placing competitively in the StanceEval2024 rankings (e.g., 13th overall). The findings underscore the effectiveness of transformer-based embeddings for capturing semantic nuances in Arabic text and suggest avenues for further improvements via hybrid feature strategies and targeted fine-tuning. The work contributes practical, reproducible approaches for Arabic stance detection with potential impact on social discourse analysis and monitoring.

Abstract

This study compares Term Frequency-Inverse Document Frequency (TF-IDF) features with Sentence Transformers for detecting writers' stances--favorable, opposing, or neutral--towards three significant topics: COVID-19 vaccine, digital transformation, and women empowerment. Through empirical evaluation, we demonstrate that Sentence Transformers outperform TF-IDF features across various experimental setups. Our team, dzStance, participated in a stance detection competition, achieving the 13th position (74.91%) among 15 teams in Women Empowerment, 10th (73.43%) in COVID Vaccine, and 12th (66.97%) in Digital Transformation. Overall, our team's performance ranked 13th (71.77%) among all participants. Notably, our approach achieved promising F1-scores, highlighting its effectiveness in identifying writers' stances on diverse topics. These results underscore the potential of Sentence Transformers to enhance stance detection models for addressing critical societal issues.
Paper Structure (10 sections, 2 tables)