Sarcasm Detection Using Deep Convolutional Neural Networks: A Modular Deep Learning Framework
Manas Zambre, Sarika Bobade
TL;DR
The paper tackles sarcasm detection in text, a task challenged by reliance on context and tone. It proposes a modular deep learning framework that fuses four detectors—Sentiment, Contextual Embedding, Linguistic Features, and Emotion Detection—via DCNNs and BERT to form a unified representation for sarcasm classification. A conceptual multimodal case study demonstrates that integrating visual cues with text (BERT+DenseNet) improves accuracy over text alone, illustrating the practical potential for applications in moderation and conversational agents. The approach emphasizes extensibility, explainability, and fairness, outlining a path toward multilingual and audio-visual enhancements in future work.
Abstract
Sarcasm is a nuanced and often misinterpreted form of communication, especially in text, where tone and body language are absent. This paper proposes a modular deep learning framework for sarcasm detection, leveraging Deep Convolutional Neural Networks (DCNNs) and contextual models such as BERT to analyze linguistic, emotional, and contextual cues. The system integrates sentiment analysis, contextual embeddings, linguistic feature extraction, and emotion detection through a multi-layer architecture. While the model is in the conceptual stage, it demonstrates feasibility for real-world applications such as chatbots and social media analysis.
