DWAEF: a deep weighted average ensemble framework harnessing novel indicators for sarcasm detection

Tracking #: 755-1735


Responsible editor: 

Tobias Kuhn

Submission Type: 

Research Paper

Abstract: 

Sarcasm is a linguistic phenomenon often indicating a disparity between literal and inferred meanings. Due to its complexity, it is typically difficult to discern it within an online text message. Consequently, in recent years sarcasm detection has received considerable attention from both academia and industry. Nevertheless, the majority of current approaches simply model low-level indicators of sarcasm in various machine learning algorithms. This paper aims to present sarcasm in a new light by utilizing novel indicators in a deep weighted average ensemble-based framework (DWAEF). The novel indicators pertain to exploiting the presence of simile and metaphor in text and detecting the subtle shift in tone at a sentence’s structural level. A graph neural network (GNN) structure is implemented to detect the presence of simile, bidirectional encoder representations from transformers (BERT) embeddings are exploited to detect metaphorical instances and fuzzy logic is employed to account for the shift of tone. To account for the existence of sarcasm, the DWAEF integrates the inputs from the novel indicators. The performance of the framework is evaluated on a self-curated dataset of online text messages. A comparative report between the results acquired using primitive features and those obtained using a combination of primitive features and proposed indicators is provided. The highest accuracy of 92% was achieved after applying DWAEF, the proposed framework which combines the primitive features and novel indicators together as compared to 78.58% obtained using Support Vector Machine (SVM) which was the lowest among all classifiers.

Manuscript: 

Supplementary Files (optional): 

Previous Version: 

Tags: 

  • Reviewed

Data repository URLs: 

Date of Submission: 

Tuesday, April 11, 2023

Date of Decision: 

Wednesday, May 17, 2023


Nanopublication URLs:
http://ds.kpxl.org/RA6eSikzkbevekNVjaaamsCz-AFVrq-mCFT_qTcoHCxsk
http://ds.kpxl.org/RApZcpS_Ro-ZlpocsthO-DKyKLilpPrv2djkpJofyhLgU
http://ds.kpxl.org/RAmCgCNcwHh5NB4phto4lF8oFmRV_CflDgtcFiTOFb5iU
http://ds.kpxl.org/RAqiDn7GfKtIRRUPkgznE_hNk26RF1pPAkRkXfwUMQ1CQ
http://ds.kpxl.org/RAGjDt_jDgYZox7R7yQe3VH59avpgBJt_hXL6gA5-W5DQ
http://ds.kpxl.org/RAxFchwnoXbrufZ4B5x1U612WL5a_8jtng_QgtR2jotXM
http://ds.kpxl.org/RA7x4i_5elLD0qgWaslUofxWOBM5dqxpsQW335YArLJWk
http://ds.kpxl.org/RA6PE7E1Vzb9vBWtfVVmJ0i3d68rvFd4alYR7mEd_6Hes

Decision: 

Accept

Solicited Reviews:


2 Comments

Meta-Review by Editor

The reviewers agree that this manuscript should be accepted, with only one remaining issue still to be addressed. Moreover, for the final publication, the dataset should be made available in a persistent way through one of the established third-party data repositories. I recommend Zenodo.org for that. Apart from that, the paper is ready for publication.

Tobias Kuhn (https://orcid.org/0000-0002-1267-0234)