A Sentimental Education: Sentiment Analysis Using Subjectivity Summarization Based on Minimum Cuts
Bo Pang, Lillian Lee
TL;DR
The paper tackles improving sentiment polarity classification by isolating subjective content in reviews. It introduces a minimum-cut graph formulation to fuse per-sentence subjectivity signals with cross-sentence proximity constraints, producing compact subjectivity extracts. When these extracts are used with NB or SVM polarity classifiers, they achieve equal or better accuracy with far fewer words, and context-aware graph cuts offer additional gains. This approach enables efficient, context-sensitive sentiment analysis and contributes to effective sentiment summarization methods.
Abstract
Sentiment analysis seeks to identify the viewpoint(s) underlying a text span; an example application is classifying a movie review as "thumbs up" or "thumbs down". To determine this sentiment polarity, we propose a novel machine-learning method that applies text-categorization techniques to just the subjective portions of the document. Extracting these portions can be implemented using efficient techniques for finding minimum cuts in graphs; this greatly facilitates incorporation of cross-sentence contextual constraints.
