DWAEF: a deep weighted average ensemble framework harnessing novel indicators for sarcasm detection

Tracking #: 755-1735

Authors:

	Name	ORCID
	Richa Sharma	https://orcid.org/0000-0002-4472-1681
	Simrat Deol	https://orcid.org/0000-0002-6785-9691
	Udit Kaushish	https://orcid.org/0000-0003-0636-4000
	Prakher Pandey	https://orcid.org/0000-0002-3340-8112
	Vishal Maurya	https://orcid.org/0000-0002-5169-209X

Responsible editor:

Tobias Kuhn

Submission Type:

Research Paper

Abstract:

Sarcasm is a linguistic phenomenon often indicating a disparity between literal and inferred meanings. Due to its complexity, it is typically difficult to discern it within an online text message. Consequently, in recent years sarcasm detection has received considerable attention from both academia and industry. Nevertheless, the majority of current approaches simply model low-level indicators of sarcasm in various machine learning algorithms. This paper aims to present sarcasm in a new light by utilizing novel indicators in a deep weighted average ensemble-based framework (DWAEF). The novel indicators pertain to exploiting the presence of simile and metaphor in text and detecting the subtle shift in tone at a sentence’s structural level. A graph neural network (GNN) structure is implemented to detect the presence of simile, bidirectional encoder representations from transformers (BERT) embeddings are exploited to detect metaphorical instances and fuzzy logic is employed to account for the shift of tone. To account for the existence of sarcasm, the DWAEF integrates the inputs from the novel indicators. The performance of the framework is evaluated on a self-curated dataset of online text messages. A comparative report between the results acquired using primitive features and those obtained using a combination of primitive features and proposed indicators is provided. The highest accuracy of 92% was achieved after applying DWAEF, the proposed framework which combines the primitive features and novel indicators together as compared to 78.58% obtained using Support Vector Machine (SVM) which was the lowest among all classifiers.

Manuscript:

ds-paper-755.pdf

Supplementary Files (optional):

ds-supplementary-755-1205.pdf

Previous Version:

DWAEF: A Deep Weighted Average Ensemble Framework Harnessing Novel Indicators for Sarcasm Detection

Data repository URLs:

https://docs.google.com/spreadsheets/d/1_2XQja9_Vpvzan9YQT3gNePJ4u10nYWchqFtlNpmMfU/edit?usp=sharing

https://github.com/simdeol/Simile-Dataset

Date of Submission:

Tuesday, April 11, 2023

Date of Decision:

Wednesday, May 17, 2023

Decision:

Solicited Reviews:

Review #1 submitted on 20/Apr/2023

By ALESSANDRA TERESA ORCID logo

https://orcid.org/0000-0002-4409-6679

Review Details

Reviewer has chosen not to be Anonymous

Overall Impression: Good
Suggested Decision: Accept
Technical Quality of the paper: Average
Presentation: Good
Reviewer`s confidence: High
Significance: Moderate significance
Background: Reasonable
Novelty: Limited novelty
Data availability: All used and produced data (if any) are FAIR and openly available in established data repositories
Length of the manuscript: The length of this manuscript is about right

Summary of paper in a few sentences (summary of changes and improvements for second round reviews):

The authors have made many changes and the manuscript has improved enormously. The paper is now fit for publication.

Reasons to accept:

The authors have made many changes and the manuscript has improved enormously. The paper is now fit for publication.

Reasons to reject:

The authors have made many changes and the manuscript has improved enormously. The paper is now fit for publication.

Nanopublication comments:

Further comments:

Review #2 submitted on 21/Apr/2023

By Kyle Gorman ORCID logo

https://orcid.org/0000-0002-4233-6595

Review Details

Reviewer has chosen not to be Anonymous

Overall Impression: Good
Suggested Decision: Accept
Technical Quality of the paper: Good
Presentation: Good
Reviewer`s confidence: High
Significance: Moderate significance
Background: Reasonable
Novelty: Clear novelty
Data availability: All used and produced data (if any) are FAIR and openly available in established data repositories
Length of the manuscript: The length of this manuscript is about right

Summary of paper in a few sentences (summary of changes and improvements for second round reviews):

This is a revision of a paper developing a rather detailed model of sarcasm detection.

Reasons to accept:

I feel that nearly all of my concerns about the first draft have been adressed and the paper has become much more reproducible. I congratulate the authors from the substantial improvements made.

Reasons to reject:

The only issues lingering I see is the following:

* Starting on page 2 or so the authors' literature review reports a bunch of accuracy and f-score numbers from unrelated corpora. As I said in my initial review, these are misleading and should be removed since the authors develop their own corpus. These prior results are uninformative---in fact misleading---and there's no obvious way to put them in context of the current paper, short of having the authors run their system on these corpora.

Nanopublication comments:

Further comments:

RESPONSE TO REVIEWERS

We, the co-authors of the paper titled 'DWAEF: a deep weighted average ensemble framework harnessing novel indicators for sarcasm detection', are grateful to the learned reviewers for their efforts in assessing our paper and providing us with insightful reviews. We have tried our best in addressing all points raised in the reviews and hereby submit the revised version of the paper. The revised manuscript '740-1720_DWAEF__A_Deep_Weighted_Average_Ensemble_Framework_Harnessing_Novel_Indicators_for_Sarcasm_Detection__Revised.pdf' is uploaded in the 'Manuscript' section. The URL of the dataset is given in the 'Data repository URLs' section. The responses to reviewers' comments have been documented in a file named '740-1720_ResponseSheet.pdf'. The file is uploaded in the 'Supplementary File' section. I earnestly hope that the modifications done in the manuscript and our responses are up to the expectations of the reviewers and editors. Regards, Richa Sharma Corresponding author

2 Comments

Meta-Review by Editor

Submitted by Tobias Kuhn on Mon, 05/15/2023 - 05:01

The reviewers agree that this manuscript should be accepted, with only one remaining issue still to be addressed. Moreover, for the final publication, the dataset should be made available in a persistent way through one of the established third-party data repositories. I recommend Zenodo.org for that. Apart from that, the paper is ready for publication.

Tobias Kuhn (https://orcid.org/0000-0002-1267-0234)

Dataset files uploaded on Zenodo.org

Submitted by Richa Sharma on Mon, 05/15/2023 - 11:53

As per your kind suggestion we have uploaded dataset files on Zenodo.org. Following is the link to the datasets.

https://zenodo.org/record/7937808#.ZGJUQnZBy3A

Kindly let us know if any other requirement is to be fulfilled.

Regards,

Richa Sharma

Data Science

DWAEF: a deep weighted average ensemble framework harnessing novel indicators for sarcasm detection

Tracking #: 755-1735

Authors:

Responsible editor:

Submission Type:

Abstract:

Manuscript:

Supplementary Files (optional):

Previous Version:

Tags:

Data repository URLs:

Date of Submission:

Date of Decision:

Decision:

2 Comments

Meta-Review by Editor

Dataset files uploaded on Zenodo.org