Identifying Unconvincing User on Social Media with Limited Features

Yufei Li, Tianhao Chen, Patrick Cheong Iao Pang

研究成果: Conference contribution同行評審

摘要

Unconvincing users on social media are responsible for many malicious online activities. Therefore, developing classifiers to identify them is crucial. Due to the trade-off between the crawling costs of features and sample size, it remains challenging to distinguish unconvincing users from normal human users using limited features with low API call cost. To address the challenge, we present a case study where we develop machine learning-based classifiers with limited features to specifically identify a predefined category of unconvincing users: fake Twitter followers. Four models, namely Random Forest (RF), XGBoost, LightGBM, and Support Vector Machine (SVM) are employed and trained using a tabular dataset consisting of only 19 low-cost features. Experimental results show that the LightGBM classifier achieves the highest accuracy at 98.7%. Furthermore, feature importance analysis is carried out on LightGBM, and both SHAP analysis and Information Gain (IG) results indicate that the ratio between friends and followers is the most important feature.

原文English
主出版物標題2023 9th International Conference on Computer and Communications, ICCC 2023
發行者Institute of Electrical and Electronics Engineers Inc.
頁面2217-2221
頁數5
ISBN(電子)9798350317251
DOIs
出版狀態Published - 2023
事件9th International Conference on Computer and Communications, ICCC 2023 - Hybrid, Chengdu, China
持續時間: 8 12月 202311 12月 2023

出版系列

名字2023 9th International Conference on Computer and Communications, ICCC 2023

Conference

Conference9th International Conference on Computer and Communications, ICCC 2023
國家/地區China
城市Hybrid, Chengdu
期間8/12/2311/12/23

指紋

深入研究「Identifying Unconvincing User on Social Media with Limited Features」主題。共同形成了獨特的指紋。

引用此