CoTranslate: A web-based tool for crowdsourcing high-quality sentence pair corpora

Carvallo, Andres; Jorquera, Ignacio; Aspillaga, Carlos

Abstract

CoTranslate is a web-based platform designed to efficiently label and review translations from language experts, with the aim of creating high-quality sentence-pair corpuses for training neural machine translation models. Utilizing Django backend and ReactJS frontend, the platform fosters collaboration among experts in translating and validating sentences. Focused on developing quality corpora, particularly for low-resource languages, CoTranslate addresses linguistic barriers and enhances translation quality. By streamlining the creation of robust training datasets, CoTranslate holds significant potential to impact the field of machine translation.

Más información

Título de la Revista: SOFTWAREX
Volumen: 23
Editorial: Elsevier
Fecha de publicación: 2023
URL: https://doi.org/10.1016/j.softx.2023.101508
DOI:

https://doi.org/10.1016/j.softx.2023.101508

Notas: ISI