Table of Contents
Fetching ...

PhishGuard: A Multi-Layered Ensemble Model for Optimal Phishing Website Detection

Md Sultanul Islam Ovi, Md. Hasibur Rahman, Mohammad Arif Hossain

TL;DR

PhishGuard, an optimal custom ensemble model designed to improve phishing site detection, is introduced, demonstrating that optimization methods in conjunction with ensemble learning greatly improve phishing detection performance.

Abstract

Phishing attacks are a growing cybersecurity threat, leveraging deceptive techniques to steal sensitive information through malicious websites. To combat these attacks, this paper introduces PhishGuard, an optimal custom ensemble model designed to improve phishing site detection. The model combines multiple machine learning classifiers, including Random Forest, Gradient Boosting, CatBoost, and XGBoost, to enhance detection accuracy. Through advanced feature selection methods such as SelectKBest and RFECV, and optimizations like hyperparameter tuning and data balancing, the model was trained and evaluated on four publicly available datasets. PhishGuard outperformed state-of-the-art models, achieving a detection accuracy of 99.05% on one of the datasets, with similarly high results across other datasets. This research demonstrates that optimization methods in conjunction with ensemble learning greatly improve phishing detection performance.

PhishGuard: A Multi-Layered Ensemble Model for Optimal Phishing Website Detection

TL;DR

PhishGuard, an optimal custom ensemble model designed to improve phishing site detection, is introduced, demonstrating that optimization methods in conjunction with ensemble learning greatly improve phishing detection performance.

Abstract

Phishing attacks are a growing cybersecurity threat, leveraging deceptive techniques to steal sensitive information through malicious websites. To combat these attacks, this paper introduces PhishGuard, an optimal custom ensemble model designed to improve phishing site detection. The model combines multiple machine learning classifiers, including Random Forest, Gradient Boosting, CatBoost, and XGBoost, to enhance detection accuracy. Through advanced feature selection methods such as SelectKBest and RFECV, and optimizations like hyperparameter tuning and data balancing, the model was trained and evaluated on four publicly available datasets. PhishGuard outperformed state-of-the-art models, achieving a detection accuracy of 99.05% on one of the datasets, with similarly high results across other datasets. This research demonstrates that optimization methods in conjunction with ensemble learning greatly improve phishing detection performance.
Paper Structure (18 sections, 3 figures, 3 tables, 1 algorithm)

This paper contains 18 sections, 3 figures, 3 tables, 1 algorithm.

Figures (3)

  • Figure 1: Overview of the PhishGuard development process
  • Figure 2: Comparative analysis of PhishGuard against advanced algorithms al2021optimizedsarasjati2022comparativeabdul2023analysis demonstrates that PhishGuard surpasses all existing approaches on Dataset 01.
  • Figure 3: Comparative analysis of PhishGuard against advanced algorithms khan2020phishingal2021optimizedsarasjati2022comparative demonstrates that PhishGuard surpasses all existing approaches on Dataset 02.