Exploring Chinese Humor Generation: A Study on Two-Part Allegorical Sayings
Rongwu Xu
TL;DR
This work investigates how language models can understand and generate Chinese humor through two-part allegorical sayings. It compares a three-stage fine-tuning pipeline on a Pinyin-augmented mT5 (PmT5) with contrastive learning against zero-shot and few-shot prompting of a large LLM (ChatGPT) across completion and from-scratch generation tasks. Evaluation combines automatic metrics (BLEU, ROUGE, BERTScore) with human judgments on coherency and humor, aided by a learned scoring model trained on annotated data. Findings show that prompting an LLM can match the fine-tuned model in many cases, but human creativity remains superior; Pinyin augmentation and contrastive learning improve generation quality, highlighting the potential and current limits of computational Chinese humor generation.
Abstract
Humor, a culturally nuanced aspect of human language, poses challenges for computational understanding and generation, especially in Chinese humor, which remains relatively unexplored in the NLP community. This paper investigates the capability of state-of-the-art language models to comprehend and generate Chinese humor, specifically focusing on training them to create allegorical sayings. We employ two prominent training methods: fine-tuning a medium-sized language model and prompting a large one. Our novel fine-tuning approach incorporates fused Pinyin embeddings to consider homophones and employs contrastive learning with synthetic hard negatives to distinguish humor elements. Human-annotated results show that these models can generate humorous allegorical sayings, with prompting proving to be a practical and effective method. However, there is still room for improvement in generating allegorical sayings that match human creativity.
