Development of a neural machine translation model optimized with BERT for translation from Quechua to Spanish
DOI:
https://doi.org/10.18687/LACCEI2024.1.1.1636Palabras clave:
Quechua, Neural Machine Translation, Lowresource Language, BERT, TransformerResumen
Quechua, a Native American language spoken by over 3 million people in Peru, plays a significant cultural role but is at risk of decline due to limited resources and the dominance of Spanish. This paper proposes a Quechua-to-Spanish neural machine translation (NMT) model using a Transformer-based architecture and a semi-supervised approach known as LMfusion. The model is trained on parallel datasets, and PRPE morphological segmentation is employed during preprocessing. Initial results show promise, and integrating the QuBERT language model is expected to enhance translation quality. Additionally, a user-friendly web interface has been developed to facilitate Quechua-Spanish translation. This research aims to address the challenges of translating a low-resource language like Quechua and contribute to improved communication between Quechua and Spanish speakers, preserving cultural heritage and facilitating equitable access to information and services.Descargas
Publicado
2024-07-27
Número
Sección
Articles
Licencia
Derechos de autor 2024 LACCEI

Esta obra está bajo una licencia internacional Creative Commons Atribución-NoComercial 4.0.
Cómo citar
Sulla Torres, J. A., Cueva Medina, B., & Tuco Casquino, G. F. (2024). Development of a neural machine translation model optimized with BERT for translation from Quechua to Spanish. LACCEI, 1(10). https://doi.org/10.18687/LACCEI2024.1.1.1636