Phishing website detection: How effective are deep learning‐based models and hyperparameter optimization?

Phishing websites are fraudulent websites that appear legitimate and trick unsuspecting users into interacting with them, stealing their valuable information. Because phishing attacks are a leading cause of data breach, different anti‐phishing solutions have been explored for cybersecurity managemen...

Full description

Saved in:
Bibliographic Details
Published in:Security and privacy Vol. 5; no. 6
Main Authors: Almousa, May, Zhang, Tianyang, Sarrafzadeh, Abdolhossein, Anwar, Mohd
Format: Journal Article
Language:English
Published: Boston, USA Wiley Periodicals, Inc 01.11.2022
Subjects:
ISSN:2475-6725, 2475-6725
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Phishing websites are fraudulent websites that appear legitimate and trick unsuspecting users into interacting with them, stealing their valuable information. Because phishing attacks are a leading cause of data breach, different anti‐phishing solutions have been explored for cybersecurity management including machine learning‐based technical approaches. However, there is a gap in understanding how robust deep learning‐based models together with hyperparameter optimization are for phishing website detection. In this vein, this study pursues the tasks of developing parsimonious deep learning models and hyperparameter optimization to achieve high accuracy and reproducible results for phishing website detection. This paper demonstrates a systematic process of building detection models based on three deep learning algorithm architectures (Long Short‐Term Memory‐based detection models, Fully Connected Deep Neural Network‐based detection models, and convolutional neural network‐based detection models) that are built and evaluated using four publicly available phishing website datasets, achieving the best accuracy of 97.37%. We also compared two different optimization algorithms for hyperparameter optimization: Grid Search and Genetic Algorithm, which contributed to 0.1%–1% increase in accuracy.
ISSN:2475-6725
2475-6725
DOI:10.1002/spy2.256