Shannon Entropy is better Feature than Category and Sentiment in User Feedback Processing

Andres Rojas Paredes; Brenda Mareco

Shannon Entropy is better Feature than Category and Sentiment in User Feedback Processing

Andres Rojas Paredes, Brenda Mareco

TL;DR

This study tackles how to prioritize user reviews for app development by ranking them with a weighted feature function. It shows that Shannon Entropy, computed directly from review text, can replace traditional features such as category, sentiment, and length, yielding equal or better $NDCG$ scores than standard feature sets under an exhaustive two-decimal weight search. However, increasing weight precision reveals formidable computational limits, and fairness concerns arise as country bias affects rankings; bias mitigation reduces ranking quality, highlighting a trade-off between accuracy and fairness. Overall, entropy-based features emerge as a promising, computationally lightweight alternative to classifier-dependent attributes, warranting further exploration of entropy-related metrics in requirements engineering feedback processing.

Abstract

App reviews in mobile app stores contain useful information which is used to improve applications and promote software evolution. This information is processed by automatic tools which prioritize reviews. In order to carry out this prioritization, reviews are decomposed into features like category and sentiment. Then, a weighted function assigns a weight to each feature and a review ranking is calculated. Unfortunately, in order to extract category and sentiment from reviews, its is required at least a classifier trained in an annotated corpus. Therefore this task is computational demanding. Thus, in this work, we propose Shannon Entropy as a simple feature which can replace standard features. Our results show that a Shannon Entropy based ranking is better than a standard ranking according to the NDCG metric. This result is promising even if we require fairness by means of algorithmic bias. Finally, we highlight a computational limit which appears in the search of the best ranking.

Shannon Entropy is better Feature than Category and Sentiment in User Feedback Processing

TL;DR

scores than standard feature sets under an exhaustive two-decimal weight search. However, increasing weight precision reveals formidable computational limits, and fairness concerns arise as country bias affects rankings; bias mitigation reduces ranking quality, highlighting a trade-off between accuracy and fairness. Overall, entropy-based features emerge as a promising, computationally lightweight alternative to classifier-dependent attributes, warranting further exploration of entropy-related metrics in requirements engineering feedback processing.

Abstract

Paper Structure (25 sections, 4 equations, 2 figures, 6 tables)

This paper contains 25 sections, 4 equations, 2 figures, 6 tables.

Introduction
Research Questions
Methodology
Data exploration and preparation
Data collection and annotation process
Experiment design
Pipeline implementation
Preprocessing
Feature extraction
Ranking
Quality Testing
Bias testing
Statistics
Results
Experiment 1: looking for the best ranking using standard features.
...and 10 more sections

Figures (2)

Figure 1: User Feedback Processing Pipeline
Figure 2: Entropy by country

Shannon Entropy is better Feature than Category and Sentiment in User Feedback Processing

TL;DR

Abstract

Shannon Entropy is better Feature than Category and Sentiment in User Feedback Processing

Authors

TL;DR

Abstract

Table of Contents

Figures (2)