Exploring Scientific Discourses on Women in Engineering and Sustainability through Web Scraping and LDA Analysis with R (2020–2025)
DOI:
https://doi.org/10.18687/LEIRD2025.1.1.844Keywords:
Mujeres en ingeniería, sostenibilidad, minería de texto, web scraping, análisis LDA, igualdad de género.Abstract
This study analyzes recent scientific discourses on women’s participation in engineering and sustainability through a text mining approach applied to open-access publications. A total of 766 scientific articles published between 2020 and 2025 were collected from the PLOS ONE database using web scraping techniques in R. Based on their abstracts, a Latent Dirichlet Allocation (LDA) topic modeling was conducted, identifying five dominant discursive axes: reproductive health, gender-based violence, STEM education, maternal health, and community participation. The results reveal narrative patterns linking gender challenges to sustainability in scientific, educational, and social contexts. This work provides valuable evidence for the design of inclusive policies and encourages debate on gender equality in strategic disciplines for sustainable development.Downloads
Published
2025-12-12
Issue
Section
Articles
License
Copyright (c) 2025 LEIRD

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.
How to Cite
Murcia Zorrilla, C. P., & Sinisterra Diaz, M. M. (2025). Exploring Scientific Discourses on Women in Engineering and Sustainability through Web Scraping and LDA Analysis with R (2020–2025). LACCEI, 2(13). https://doi.org/10.18687/LEIRD2025.1.1.844