Diagnosing the performance of machine learning models for phishing website detection: A literature review.

Authors

  • Frank Luis Santa Cruz-Rufasto Universidad Tecnológica Del Perú Utp - (Pe), Perú
  • Christian Abraham Dios-Castillo Universidad Tecnológica Del Perú Utp - (Pe), Perú

DOI:

https://doi.org/10.18687/LACCEI2025.1.1.274

Keywords:

Phishing detection, Machine Learning, Random Forest, Cybersecurity, Precision metrics.

Abstract

Detecting phishing websites using Machine Learning (ML) techniques is a key approach in modern cybersecurity, with models such as Random Forest reaching accuracy levels close to 99%, followed by Support Vector Machine, Decision Tree and Logistic Regression. However, what is the level of accuracy of ML techniques in this task and what are the key factors affecting their accuracy and effectiveness? The results highlight that the quality and diversity of the training data, together with metrics such as Accuracy, Precision and Recall, are determinants in the performance of the models. In addition, the ability of algorithms to adapt to dynamic attack patterns is crucial. This study, based on a systematic review with the PRISMA statement, analyzed 43 articles selected from more than 4,600 initials, revealing the importance of developing computationally efficient methods that maintain high levels of accuracy to address growing digital threats.

Downloads

Published

2025-04-09

How to Cite

Santa Cruz-Rufasto, F. L., & Dios-Castillo, C. A. (2025). Diagnosing the performance of machine learning models for phishing website detection: A literature review. LACCEI, 1(12). https://doi.org/10.18687/LACCEI2025.1.1.274