Abstract
Stance detection is a relatively new concept in data mining that aims to assign a stance label (favor, against, or none) to a social media post towards a specific pre-determined target. These targets may not be referred to in the post, and may not be the target of opinion in the post. In this paper, we propose a novel enhanced method for identifying the writer's stance of a given tweet. This comprises a three-phase process for stance detection: (a) tweets preprocessing; here we clean and normalize tweets (e.g., remove stop-words) to generate words and stems lists, (b) features generation; in this step, we create and fuse two dictionaries for generating features vector, and lastly (c) classification; all the instances of the features are classified based on the list of targets. Our innovative feature selection proposes fusion of two ranked lists (top-k) of term frequency-inverse document frequency (tf-idf) scores and the sentiment information. We evaluate our method using six different classifiers: K nearest neighbor (K-NN), discernibility-based K-NN, weighted K-NN, class-based K-NN, exemplar-based K-NN, and Support Vector Machines. Furthermore, we investigate the use of Principal Component Analysis and study its effect on performance. The model is evaluated on the benchmark dataset (SemEval-2016 task 6), and the results significance is determined using t-test. We achieve our best performance of macro F-score (averaged across all topics) of 76.45% using the weighted K-NN classifier. This tops the current state-of-the-art score of 74.44% on the same dataset.
| Original language | English |
|---|---|
| Pages (from-to) | 29-40 |
| Number of pages | 12 |
| Journal | Information Fusion |
| Volume | 67 |
| DOIs | |
| State | Published - Mar 2021 |
| Externally published | Yes |
Bibliographical note
Publisher Copyright:© 2020
Keywords
- K-NN variants
- Sentiment analysis
- Stance detection
- Support vector machines
- Top-k
ASJC Scopus subject areas
- Software
- Signal Processing
- Information Systems
- Hardware and Architecture
Fingerprint
Dive into the research topics of 'A novel approach to stance detection in social media tweets by fusing ranked lists and sentiments'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver