A sequence labelling approach for automatic analysis of ello: tagging pronouns, antecedents, and connective phrases

Parodi, Giovanni; Evans, Richard; Le An Ha; Mitkov, Ruslan; Julio Vergara, Cristobal Jesus; Ignacio Olivares-Lopez, Raul

Abstract

Encapsulators are linguistic units which establish coherent referential connections to the preceding discourse in a text. In this paper, we address the challenge of automatically analysing the pronominal encapsulator ello in Spanish text. Our method identifies, for each occurrence, the antecedent of the pronoun (including its grammatical type), the connective phrase which combines with the pronoun to express a discourse relation linking the antecedent text segment to the following text segment, and the type of semantic relation expressed by the complex discourse marker formed by the connective phrase and pronoun. We describe our annotation of a corpus to inform the development of our method and to finetune an automatic analyser based on bidirectional encoder representation transformers. On testing our method, we find that it performs with greater accuracy than three baselines (0.76 for the resolution task), and sets a promising benchmark for the automatic annotation of occurrences of the pronoun ello, their antecedents, and the semantic relations between the two text segments linked by the connective in combination with the pronoun.

Más información

Título según WOS: ID WOS:000693369900001 Not found in local WOS DB
Título de la Revista: LANGUAGE RESOURCES AND EVALUATION
Volumen: 56
Número: 1
Editorial: Springer
Fecha de publicación: 2022
Página de inicio: 139
Página final: 164
DOI:

10.1007/s10579-021-09559-z

Notas: ISI