Explainable Machine Learning for Credit Card Default Prediction Using Web-Scraped Financial Data: A Case Study in the Peruvian Banking Sector

Authors

  • Hilario Aradiel Castañeda Universidad Nacional del Callao
  • Guillermo Antonio Mas Azahuanche Universidad Nacional del Callao
  • Ruben Dario Mendoza Arenas Universidad Nacional del Callao
  • Omar Tupac Amaru Castillo Paredes Universidad Nacional del Callao
  • Artemio Ruben Reinoso Palacios Universidad Nacional del Callao
  • Marisol Paola Delgado Baltazar Universidad Nacional del Callao
  • Raphael Santiago Mendoza Delgado Universidad Nacional del Callao

DOI:

https://doi.org/10.18687/LEIRD2025.1.1.451

Keywords:

Default, credit cards, web scraping, financial prediction, credit risk.

Abstract

Credit card default represents a critical challenge for Peruvian banking due to its direct impact on the profitability and sustainability of financial institutions. In this context, this study aimed to develop an explainable machine learning-based predictive model to anticipate credit default risk using financial data obtained through web scraping from official portals of institutions such as BBVA, BCP, Interbank, and Scotiabank. The methodology involved the automated collection of monthly interest rate data by credit type and the processing of key credit variables, including credit line utilization, payment history, monthly income, and card usage frequency. Several machine learning models were trained and evaluated, with LightGBM outperforming the others by achieving an accuracy of 89.4%, a recall of 86.7%, and an area under the ROC curve of 0.94. To ensure model interpretability, SHAP (SHapley Additive exPlanations) was applied, identifying high credit usage and accumulated delinquency as the most impactful predictors. The findings suggest that the integration of explainable models can significantly enhance decision-making in credit risk management. Their adoption is recommended as a strategic support tool for real-time financial profile evaluation

Downloads

Published

2025-12-09

Issue

Section

Articles

How to Cite

Aradiel Castañeda, H., Mas Azahuanche, G. A., Mendoza Arenas, R. D., Castillo Paredes, O. T. A., Reinoso Palacios, A. R., Delgado Baltazar, M. P., & Mendoza Delgado, R. S. (2025). Explainable Machine Learning for Credit Card Default Prediction Using Web-Scraped Financial Data: A Case Study in the Peruvian Banking Sector. LACCEI, 2(13). https://doi.org/10.18687/LEIRD2025.1.1.451

Most read articles by the same author(s)