Call for Papers November 2024 | Email: editor@uijrt.com | ISSN: 2582-6832 | Google Scholar | Impact Factor: 5.794

Paper Details
Subject:
Paper ID: UIJRTV4I50004
Volume: 04
Issue: 05
Pages: 27-39
Date: March 2023
ISSN: 2582-6832
Statistics:

Loading

  Full Text [PDF]

Cite this
Ibrahim N. Awol and Sosina M. Gashaw, 2023. Lexicon-Stance Based Amharic Fake News Detection. United International Journal for Research & Technology (UIJRT), 4(5), pp27-39.
Abstract
Due to the noisy nature of social media content, and the rapid propagation of fake news, the identification and detection of false information become a challenging problem. In recent years, several studies propose to use content based text representation approaches to automatically detect fake news as early as possible before being propagated into social Medias. However, fake news has a different stylistic nature of writing, and attempting to capture its various unique features may help us improve detection rather than focusing solely on text representation. In this study, we propose to hybrid stance-based features (Page score, headline to article similarity and headline to headline similarities) with the previous text representation lexicon based detection. To build the detection model, we used three machine learning algorithms: Logistic regression, Passive Aggressive and Decision tree. The proposed approach is evaluated using a newly collected Amharic fake news dataset from Facebook. Our experiment results show that the hybrid features (lexicon-stance) are capable of improving the previous lexicon based detection results up to 4.1% accuracy, 3% precision, 4% recall and 4% F1-score using passive aggressive algorithm. In addition, the hybrid feature improves the area under curve from 0.982 to 0.995 by reducing the false positive rate by 4% and improved the true positive rate by 4.4%. Furthermore, we found that Page score, out of the proposed stance features included, has contributed the most to the improvement of lexicon-based fake news detection.

Keywords: Content-based detection, Stance-based detection, Lexicon-based detection, text representation approach, Fake news.


Related Papers