Diagnosing the performance of machine learning models for phishing website detection: A literature review.
DOI:
https://doi.org/10.18687/LACCEI2025.1.1.274Keywords:
Phishing detection, Machine Learning, Random Forest, Cybersecurity, Precision metrics.Abstract
Detecting phishing websites using Machine Learning (ML) techniques is a key approach in modern cybersecurity, with models such as Random Forest reaching accuracy levels close to 99%, followed by Support Vector Machine, Decision Tree and Logistic Regression. However, what is the level of accuracy of ML techniques in this task and what are the key factors affecting their accuracy and effectiveness? The results highlight that the quality and diversity of the training data, together with metrics such as Accuracy, Precision and Recall, are determinants in the performance of the models. In addition, the ability of algorithms to adapt to dynamic attack patterns is crucial. This study, based on a systematic review with the PRISMA statement, analyzed 43 articles selected from more than 4,600 initials, revealing the importance of developing computationally efficient methods that maintain high levels of accuracy to address growing digital threats.Downloads
Published
2025-04-09
Issue
Section
Articles
License
Copyright (c) 2025 LACCEI

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.
How to Cite
Santa Cruz-Rufasto, F. L., & Dios-Castillo, C. A. (2025). Diagnosing the performance of machine learning models for phishing website detection: A literature review. LACCEI, 1(12). https://doi.org/10.18687/LACCEI2025.1.1.274