The Detection of Indonesian Hoax Content about COVID-19 Vaccine using Naive Bayes Multinomial Method

  • Annisa Puspa Kirana Information Technology Department, State Polytechnic of Malang, Indonesia https://orcid.org/0000-0002-4622-1445
  • Gunawan Budi Prasetyo Information Technology Department, State Polytechnic of Malang, Indonesia
  • Ela Widya Lestari Information Technology Department, State Polytechnic of Malang, Indonesia
Keywords: text mining, information gain, naive Bayes, multinomial, Twitter, covid-19

Abstract

One media currently famously used in all worlds is twitter. The ease of dissemination and the exchange of information is accelerating. Every day, millions of tweets exist using various information, such as politics, technology, sports, academics, and others. The information that is widely found is about COVID-19-19 nowadays. The information on Twitter is not entirely accurate or according to facts and needs to be proven true. Therefore, this study aims to try to detect the information contained in Indonesia using methods of Naive Bayes Multinomial by using the Information Gain feature selection. The classification process is carried out by crawling tweets, preprocessing, then using feature selection, namely Information Gain, and classification using the Multinomial Naive Bayes method. Meanwhile, the validation needs in this study use k-fold cross-validation where the existing dataset is divided into training and testing data that will be tested with a confusion matrix. Researchers have carried out the confusion matrix testing process using 720 datasets divided as train data & the test data received an average accuracy value of 81.39%, precision of 80.36%, and recall of 79.73%. The highest accuracy is using k-fold two. The accuracy value reaches 88.8%, the precision value is 79.1%, and the recall value is 86.3%. The lowest accuracy was obtained on the 8th k-fold with an accuracy value of 73.6%, precision 75.4%, and recall 86.9%.

Downloads

Download data is not yet available.
Published
2023-02-24
How to Cite
[1]
A. P. Kirana, G. B. Prasetyo, and E. W. Lestari, “The Detection of Indonesian Hoax Content about COVID-19 Vaccine using Naive Bayes Multinomial Method”, Indones.J.electronic.electromed.med.inf, vol. 5, no. 1, Feb. 2023.
Section
Research Article