Targeted Distillation for Sentiment Analysis

Yice Zhang; Guangyu Xie; Jingjie Lin; Jianzhu Bao; Qianlong Wang; Xi Zeng; Ruifeng Xu

Targeted Distillation for Sentiment Analysis

Yice Zhang, Guangyu Xie, Jingjie Lin, Jianzhu Bao, Qianlong Wang, Xi Zeng, Ruifeng Xu

TL;DR

The paper tackles the practicality gap in sentiment analysis by proposing a targeted distillation framework that decouples knowledge and alignment into KnowDist and ICLDist. KnowDist harvests sentiment knowledge via multi-perspective prompting, while ICLDist tunes the student’s ability to follow task instructions through diversified, few-shot prompts, enabling strong generalization to unseen tasks. Extensive experiments across multiple teacher–student pairs on the comprehensive SentiBench benchmark show substantial gains over generic distillation and reveal that smaller models can outperform larger originals, with broad task coverage. The work also offers a structured benchmark and thoughtful ablations to demonstrate the complementary roles of KnowDist and ICLDist and discusses practical considerations and limitations for future improvement.

Abstract

This paper explores targeted distillation methods for sentiment analysis, aiming to build compact and practical models that preserve strong and generalizable sentiment analysis capabilities. To this end, we conceptually decouple the distillation target into knowledge and alignment and accordingly propose a two-stage distillation framework. Moreover, we introduce SentiBench, a comprehensive and systematic sentiment analysis benchmark that covers a diverse set of tasks across 12 datasets. We evaluate a wide range of models on this benchmark. Experimental results show that our approach substantially enhances the performance of compact models across diverse sentiment analysis tasks, and the resulting models demonstrate strong generalization to unseen tasks, showcasing robust competitiveness against existing small-scale models.

Targeted Distillation for Sentiment Analysis

TL;DR

Abstract

Targeted Distillation for Sentiment Analysis

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (5)