ENHANCING SPAM COMMENT DETECTION ON SOCIAL MEDIA WITH EMOJI FEATURE AND POST-COMMENT PAIRS APPROACH USING ENSEMBLE METHODS OF MACHINE LEARNING

Antonius Rachmat Chrismanto and Anny Kartika Sari and Yohanes Suyanto (2023) ENHANCING SPAM COMMENT DETECTION ON SOCIAL MEDIA WITH EMOJI FEATURE AND POST-COMMENT PAIRS APPROACH USING ENSEMBLE METHODS OF MACHINE LEARNING. Journal of IEEE Access, 11. pp. 80246-80265. ISSN 2169-3536

[img] Text (Artikel Jurnal)
Enhancing Spam Comment Detection on Social Media.pdf - Published Version

Download (1MB)

Abstract

Every time a well-known public figure posts something on social media, it encourages many users to comment. Unfortunately, not all comments are relevant to the post. Some are spam comments which can disrupt the overall flow of information. This research employed two strategies to address issues in text spam detection on social media. The first strategy was utilizing emojis that had been frequently discarded in many studies. In fact, many social media users use emojis to convey their intentions. The second strategy was utilizing stacked post-comment pairs, which was different from many spam detection systems that solely focused on comment-only data. The post-comment pairs were required to detect whether a comment was relevant (not spam) or spam based on the post context. This research used the SpamID-Pair dataset derived from social media for Indonesian spam comment detection. After a comprehensive investigation, the emoji-text feature, the stacked post-comment pairs, and ensemble voting could boost detection performance (in terms of accuracy and F1). Adding manual features also improved detection performance. Based on the experiment, the best stand-alone methods for spam comment detection are the SVM (RBF kernel) and the soft voting ensemble method for the best average performance.

Item Type: Article
Uncontrolled Keywords: spam detection, ensemble method, emoji feature, post-comment pair, social media
Subjects: Q Ilmu Pengetahuan > Matematika > Komputer Elektronik. Ilmu Komputer
Depositing User: Admin Repository
Date Deposited: 05 Sep 2024 04:21
Last Modified: 05 Sep 2024 04:21
URI: http://katalog.ukdw.ac.id/id/eprint/9073

Actions (login required)

View Item View Item