A Named Entity Recognition framework using Transformers to identify relevant clinical findings from mammographic radiological reports

Godoy, Eduardo; Chabert, Steren; Querales, Marvin; Sotelo, Julio; Parra, Denis; Fernandez, Carlos; Mellado, Diego; Veloz, Alejandro; Lever, Scarlett; Pardo, Fabian; Bertini, Ayleen; Molina, Yomar; Diaz, Claudia; Ferreira, Rodrigo; Salas, Rodrigo; et. al.

Abstract

Detecting and extracting findings in a radiological report is crucial for text mining tasks in several applications. In this case, a labeled process for the image associated with the radiological report in mammography and Spanish context for a computer vision model is required. This paper shows the methodology and process generated for this goal. This paper presents a Named Entity Recognition (NER) approach based on a transformer deep learning model, using a labeled corpus and fine-tuning process to find three concepts that compose a typical finding in a mammographic radiological report: laterality, location, and the finding. We add another concept in the labeled process, the negation, necessary to identify falses positive inside the text that writes the radiologist. Our model achieves an F1 score of 88.24% classifying the three principal concepts for a finding, product of the labeled and fine-tuning process. The results presented here will be used as input for future training work on a computer vision model.

Más información

Título según WOS: ID WOS:001002734500032 Not found in local WOS DB
Título de la Revista: 18TH INTERNATIONAL SYMPOSIUM ON MEDICAL INFORMATION PROCESSING AND ANALYSIS
Volumen: 12567
Editorial: SPIE-INT SOC OPTICAL ENGINEERING
Fecha de publicación: 2023
DOI:

10.1117/12.2670228

Notas: ISI