Identifying Unconvincing User on Social Media with Limited Features

Yufei Li, Tianhao Chen, Patrick Cheong Iao Pang

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Unconvincing users on social media are responsible for many malicious online activities. Therefore, developing classifiers to identify them is crucial. Due to the trade-off between the crawling costs of features and sample size, it remains challenging to distinguish unconvincing users from normal human users using limited features with low API call cost. To address the challenge, we present a case study where we develop machine learning-based classifiers with limited features to specifically identify a predefined category of unconvincing users: fake Twitter followers. Four models, namely Random Forest (RF), XGBoost, LightGBM, and Support Vector Machine (SVM) are employed and trained using a tabular dataset consisting of only 19 low-cost features. Experimental results show that the LightGBM classifier achieves the highest accuracy at 98.7%. Furthermore, feature importance analysis is carried out on LightGBM, and both SHAP analysis and Information Gain (IG) results indicate that the ratio between friends and followers is the most important feature.

Original languageEnglish
Title of host publication2023 9th International Conference on Computer and Communications, ICCC 2023
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages2217-2221
Number of pages5
ISBN (Electronic)9798350317251
DOIs
Publication statusPublished - 2023
Event9th International Conference on Computer and Communications, ICCC 2023 - Hybrid, Chengdu, China
Duration: 8 Dec 202311 Dec 2023

Publication series

Name2023 9th International Conference on Computer and Communications, ICCC 2023

Conference

Conference9th International Conference on Computer and Communications, ICCC 2023
Country/TerritoryChina
CityHybrid, Chengdu
Period8/12/2311/12/23

Keywords

  • bot detection
  • fake Twitter followers
  • machine learning
  • social media analysis

Fingerprint

Dive into the research topics of 'Identifying Unconvincing User on Social Media with Limited Features'. Together they form a unique fingerprint.

Cite this