Table of Contents
Fetching ...

Exploring social bots: A feature-based approach to improve bot detection in social networks

Salvador Lopez-Joya, Jose A. Diaz-Garcia, M. Dolores Ruiz, Maria J. Martin-Bautista

TL;DR

Through an exhaustive process of research, inference and feature selection, this paper is able to surpass the state of the art on several metrics using classical machine learning algorithms and identify the types of features that are most important in detecting automated accounts.

Abstract

The importance of social media in our daily lives has unfortunately led to an increase in the spread of misinformation, political messages and malicious links. One of the most popular ways of carrying out those activities is using automated accounts, also known as bots, which makes the detection of such accounts a necessity. This paper addresses that problem by investigating features based on the user account profile and its content, aiming to understand the relevance of each feature as a basis for improving future bot detectors. Through an exhaustive process of research, inference and feature selection, we are able to surpass the state of the art on several metrics using classical machine learning algorithms and identify the types of features that are most important in detecting automated accounts.

Exploring social bots: A feature-based approach to improve bot detection in social networks

TL;DR

Through an exhaustive process of research, inference and feature selection, this paper is able to surpass the state of the art on several metrics using classical machine learning algorithms and identify the types of features that are most important in detecting automated accounts.

Abstract

The importance of social media in our daily lives has unfortunately led to an increase in the spread of misinformation, political messages and malicious links. One of the most popular ways of carrying out those activities is using automated accounts, also known as bots, which makes the detection of such accounts a necessity. This paper addresses that problem by investigating features based on the user account profile and its content, aiming to understand the relevance of each feature as a basis for improving future bot detectors. Through an exhaustive process of research, inference and feature selection, we are able to surpass the state of the art on several metrics using classical machine learning algorithms and identify the types of features that are most important in detecting automated accounts.

Paper Structure

This paper contains 21 sections, 4 equations, 8 figures, 13 tables.

Figures (8)

  • Figure 1: Feature engineering diagram
  • Figure 2: Profile sidebar fill colour ranking
  • Figure 3: Experimentation flowchart
  • Figure 4: Feature ranking for all datasets
  • Figure 5: Results of classification depending on the number of features selected and the computation time using RF.
  • ...and 3 more figures