Can Pre-trained Language Models Understand Chinese Humor?
Yuyan Chen, Zhixu Li, Jiaqing Liang, Yanghua Xiao, Bang Liu, Yunwen Chen
TL;DR
This work systematically investigates whether pre-trained language models understand Chinese humor by introducing a three-step evaluation framework with four humor-related tasks and a large Chinese humor dataset. It shows that original PLMs have limited humor understanding in zero-shot settings, but fine-tuning yields substantial improvements, and linguistic knowledge—especially Chinese pinyin embeddings—significantly boosts performance when applied via explicit fusion. Interpretability analyses reveal that PLMs rely on recognizable clue words, with saliency evidence supporting partial alignment with human humor perception, though gaps remain. Downstream transfer to sentiment tasks is modest, indicating domain-specific benefits and the need for richer cultural knowledge to enhance generalization in humor understanding and generation.
Abstract
Humor understanding is an important and challenging research in natural language processing. As the popularity of pre-trained language models (PLMs), some recent work makes preliminary attempts to adopt PLMs for humor recognition and generation. However, these simple attempts do not substantially answer the question: {\em whether PLMs are capable of humor understanding?} This paper is the first work that systematically investigates the humor understanding ability of PLMs. For this purpose, a comprehensive framework with three evaluation steps and four evaluation tasks is designed. We also construct a comprehensive Chinese humor dataset, which can fully meet all the data requirements of the proposed evaluation framework. Our empirical study on the Chinese humor dataset yields some valuable observations, which are of great guiding value for future optimization of PLMs in humor understanding and generation.
