NLP-Based Extended Lexicon Model For Sarcasm Detection With Tweets And Emojis

M Sreenandan Reddy; Anitha Bukkacherla; Chowdeshwari Pujari; Baba Fareed Shaik; Neeraja Narla; Dileep Kumar Gosala

doi:10.33425/3066-1226.1088

Global Journal of Engineering Innovations and Interdisciplinary Research

NLP-Based Extended Lexicon Model For Sarcasm Detection With Tweets And Emojis

M Sreenandan Reddy, Anitha Bukkacherla, Chowdeshwari Pujari, Baba Fareed Shaik, Neeraja Narla, Dileep Kumar Gosala

10.33425/3066-1226.1088

Sarcasm detection in social media, especially on platforms like Twitter, poses a significant challenge due to the informal and context-dependent nature of language. This research presents an NLP-based extended lexicon model for sarcasm detection, leveraging both textual features and emojis to enhance interpretability and accuracy. The proposed model integrates a sentiment lexicon with syntactic and semantic cues, enriched by emoji sentiment analysis, to capture subtle contradictions and ironic tones in tweets. A dataset of labeled tweets is preprocessed and analyzed using natural language processing techniques, including tokenization, POS tagging, and feature extraction. Machine learning classifiers are then employed to detect sarcasm. Experimental results demonstrate that the inclusion of emoji sentiment and lexicon-based features significantly improves detection performance, highlighting the importance of multimodal analysis in understanding sarcasm in social media texts.

PDF