Publication
Title
Sarcasm detection using an ensemble approach
Author
Abstract
We present an ensemble approach for the detection of sarcasm in Reddit and Twitter responses in the context of The Second Workshop on Figurative Language Processing held in conjunction with ACL 2020(1). The ensemble is trained on the predicted sarcasm probabilities of four component models and on additional features, such as the sentiment of the comment, its length, and source (Reddit or Twitter) in order to learn which of the component models is the most reliable for which input. The component models consist of an LSTM with hashtag and emoji representations; a CNN-LSTM with casing, stop word, punctuation, and sentiment representations; an MLP based on Infersent embeddings; and an SVM trained on stylometric and emotion-based features. All component models use the two conversational turns preceding the response as context, except for the SVM, which only uses features extracted from the response. The ensemble itself consists of an adaboost classifier with the decision tree algorithm as base estimator and yields F1-scores of 67% and 74% on the Reddit and Twitter test data, respectively.
Language
English
Source (journal)
FIGURATIVE LANGUAGE PROCESSING
Source (book)
2nd Workshop on Figurative Language Processing, JUL09, 2020, ELECTR NETWORK
Publication
Stroudsburg : Assoc computational linguistics-acl , 2020
ISBN
978-1-952148-12-5
DOI
10.18653/V1/2020.FIGLANG-1.36
Volume/pages
(2020) , p. 264-269
ISI
000563422200036
Full text (Publisher's DOI)
Full text (publisher's version - intranet only)
UAntwerpen
Faculty/Department
Research group
Project info
Artificial intelligence for creative language use.
Publication type
Subject
Affiliation
Publications with a UAntwerp address
External links
Web of Science
Record
Identifier
Creation 19.10.2020
Last edited 13.11.2024
To cite this reference