Evaluating Prompt Engineering Strategies for Sentiment Control in AI-Generated Texts
Kerstin Sahler, Sophie Jentzsch
TL;DR
This work systematically evaluates prompt engineering as a resource-efficient method to steer sentiment in LLM-generated text, using Ekman’s six emotions and a DistilRoBERTa emotion classifier to compare Vanilla, Zero-Shot, Zero-Shot CoT, Few-Shot, and CoT prompts against fine-tuning. The study finds that Few-Shot prompts with carefully crafted human-written examples yield the strongest emotion steering, outperforming even a fine-tuned baseline in several cases, while Zero-Shot approaches offer a lightweight but sometimes weaker alternative. CoT prompts show mixed results, with reasoning text occasionally aiding but often underperforming relative to Few-Shot and Zero-Shot strategies. The results highlight the practical value of prompt design for emotion-adaptive AI, especially in data-limited contexts, while acknowledging linguistic scope, evaluation methods, and stylistic transfer as important avenues for future work.
Abstract
The groundbreaking capabilities of Large Language Models (LLMs) offer new opportunities for enhancing human-computer interaction through emotion-adaptive Artificial Intelligence (AI). However, deliberately controlling the sentiment in these systems remains challenging. The present study investigates the potential of prompt engineering for controlling sentiment in LLM-generated text, providing a resource-sensitive and accessible alternative to existing methods. Using Ekman's six basic emotions (e.g., joy, disgust), we examine various prompting techniques, including Zero-Shot and Chain-of-Thought prompting using gpt-3.5-turbo, and compare it to fine-tuning. Our results indicate that prompt engineering effectively steers emotions in AI-generated texts, offering a practical and cost-effective alternative to fine-tuning, especially in data-constrained settings. In this regard, Few-Shot prompting with human-written examples was the most effective among other techniques, likely due to the additional task-specific guidance. The findings contribute valuable insights towards developing emotion-adaptive AI systems.
