PARTICLE SWARM OPTIMIZATION IN TWIN SUPPORT VECTOR MACHINE TO CLASSIFY FAKE NEWS

Junita Amalia(1), Novita Enjelia Hutapea(2), Merika Manurung(3), Tiara Octavia Situmorang(4),


(1) Institut Teknologi Del
(2) Institut Teknologi Del
(3) Institut Teknologi Del
(4) Institut Teknologi Del
Corresponding Author

Abstract


Slang word is a complex word, difficult and cannot be ignored. Slang is used by certain circles and is limited so that not everyone knows the meaning of the conversations carried out by group members. Based on previous research that has been done, namely making slang using a manual process that requires quite a lot of time to collect slang words, so that our research aims to collect slang words by applying Deep Learning, namely Natural Language Processing using the word embedding FastText method to speed up the collection process. slang words. The author implements the techniques and algorithms that have been designed in the previous stage. This stage will ensure that the processes carried out in the research can be carried out in accordance with the theories that support the research. From the combined data between YouTube comments and the Indonesian dictionary, itwas found that 421 words are slang words. These slang words are obtained by means of the process of lookingfor word similarities (similarity words) between YouTube comments and Indonesian dictionaries. In building a slang dictionary from the youtube comment dataset with a pre-trained FastText model, a preprocessing process and normalization is carried out. After the normalization process was carried out to get normal words from each slang candidate, the results of the slang dictionary were 278 rows consisting of four columns, namely the lexical column, threshold, slang candidate, and normal words using a threshold of 0.05, 0.1 and 0.2 Keywords: Slang, Pre-trained FastText, NLP, Similarity Word

References


C. Juditha, “Interaksi Komunikasi Hoax di Media Sosial serta Antisipasinya,” Jurnal Pekommas, vol. 3, 2018.

R. PEBI, “Analisis Performansi Algoritma Naïve Bayes dan Support Vector Machine untuk Deteksi Berita Hoax Berbahasa Inggris,” Telkom University, 2020. [Online]. Available: https://repository.telkomuniversity.ac.id/pustaka/158821/analisis-performansi-algoritma-na-ve-bayes-dan-support-vector-machine-untuk-deteksi-berita-hoax-berbahasa-inggris.html.

K. Li, G. Zhou, Y. Yang, F. Li dan Z. Jiao, “A novel prediction method for favorable reservoir of oil field based on grey wolf optimizer and Twin Support Vector Machine,” Journal of Petroleum Science and Engineering, vol. 189, 2020.

K. K. Bharti dan S. Pandey, “Fake account detection in twitter using logistic regression with Particle Swarm Optimization,” Application of soft computing, 2021.

N. Surantha, T. F. Lesmana dan S. M. Isa, “Sleep stage classification using extreme learning machine and Particle Swarm Optimization for healthcare big data,” Journal of Big Data, 2021.

B. PRATAMA, “HOAX DAN FAKE NEWS DALAM UU-ITE,” Binus University, 9 August 2018. [Online]. Available: https://business-law.binus.ac.id/2018/08/09/hoax-dan-fake-news-dalam-uu-ite/.

G. Nurvinda, “Pentingnya Preprocessing dalam Pengolahan Data Statistik,” DQLab, 12 April 2021. [Online]. Available: https://www.dqlab.id/pentingnya-preprocessing-dalam-pengolahan-data-statistik#:~:text=Preprocessing%20data%20sangat%20penting%20karena,gunakan.

R. Tineges, “Tahapan Text Preprocessing dalam Teknik Pengolahan Data,” DQLab, 17 June 2021. [Online]. Available: https://www.dqlab.id/tahapan-text-preprocessing-dalam-teknik-pengolahan-data.

Jayadeva, R. Khemchandani dan S. Chandra, Twin Support Vector Machines, 2017.

E. Hazan, Optimization for Machine learning, 2019.

S. Khomsah, “Naive Bayes Classifier Optimization on Sentiment Analysis of Hotel Reviews,” Jurnal Penelitian Pos dan Informatika, 2020.

A. P. Engelbrecht, Computationa Intelligece.

A. P. Piotrowski, J. J. Napiorkowski dan A. E. Piotrowska, “Population size in Particle Swarm Optimization,” Swarm and Evolutionary Computation, 2020.

M. Santos, “Towards Data Science,” 18 May 2020. [Online]. Available: https://towardsdatascience.com/explaining-precision-vs-recall-to-everyone-295d4848edaf.

“Teknologi Big Data,” 29 May 2020. [Online]. Available: https://www.teknologi-bigdata.com/2020/05/validitas-rapid-test-covid-19-akurasi-accuracy-vs-F1-Score.html#:~:text=F1%2Dscore%20digunakan%20ketika%20False,True%20Positive%20dan%20True%20Negative.


Full Text: PDF (Bahasa Indonesia)

Article Metrics

Abstract View : 273 times
PDF (Bahasa Indonesia) Download : 198 times

DOI: 10.58486/jsr.v6i2.176

Refbacks

  • There are currently no refbacks.