Statistical Measures for Explainable Aspect-Based Sentiment Analysis: A Case Study on Environmental Discourse in Reddit

Luisa Stracqualursi; Patrizia Agati

Statistical Measures for Explainable Aspect-Based Sentiment Analysis: A Case Study on Environmental Discourse in Reddit

Luisa Stracqualursi, Patrizia Agati

TL;DR

A statistical, model-agnostic framework to assess the behavioral transparency and trustworthiness of ABSA models, which relies on several metrics, such as the entropy of polarity distributions, soft-count-based dominance scores, and sentiment divergence between sources, whose robustness is validated through bootstrap resampling and sensitivity analysis.

Abstract

Aspect-Based Sentiment Analysis (ABSA) provides a fine-grained understanding of opinions by linking sentiment to specific aspects in text. While transformer-based models excel at this task, their black-box nature limits their interpretability, posing risks in real-world applications without labeled data. This paper introduces a statistical, model-agnostic framework to assess the behavioral transparency and trustworthiness of ABSA models. Our framework relies on several metrics, such as the entropy of polarity distributions, soft-count-based dominance scores, and sentiment divergence between sources, whose robustness is validated through bootstrap resampling and sensitivity analysis. A case study on environmentally focused Reddit communities illustrates how the proposed indicators provide interpretable diagnostics of model certainty, decisiveness, and cross-source variability. The results show that statistical indicators computed on soft outputs can complement traditional approaches, offering a computationally efficient methodology for validating, monitoring, and interpreting ABSA models in contexts where labeled data are unavailable.

Statistical Measures for Explainable Aspect-Based Sentiment Analysis: A Case Study on Environmental Discourse in Reddit

TL;DR

Abstract

Paper Structure (26 sections, 8 equations, 2 figures, 4 tables)

This paper contains 26 sections, 8 equations, 2 figures, 4 tables.

Introduction
Related Work
Methodology
Stage 1: Aspect Extraction and Probabilistic Sentiment Prediction
Stage 2: Computation of Statistical Indicators
Illustrative Example.
Polarity Entropy $H(A)$.
Sentiment Dominance $D(A)$.
Formal Definitions.
Illustrative Example.
Sentiment Divergence $JSD(A)$.
Stage 3: Robustness and Stability Assessment
Confidence-Based Filtering and Robustness Assessment.
Bootstrap Resampling for Stability Assessment.
Alignment with the SAFE Framework
...and 11 more sections

Figures (2)

Figure 1: Top 10 ABSA results, Confidence vs Dominance/Entropy
Figure 2: Explainability-Robustness Tradeoff via Confidence Filtering

Statistical Measures for Explainable Aspect-Based Sentiment Analysis: A Case Study on Environmental Discourse in Reddit

TL;DR

Abstract

Statistical Measures for Explainable Aspect-Based Sentiment Analysis: A Case Study on Environmental Discourse in Reddit

Authors

TL;DR

Abstract

Table of Contents

Figures (2)