Tarantula -> spider -> animal: second level hypernymy discovery based on distributional similarity methods
Abstract
Automatic hypernymy discovery continues to present challenges for natural language processing. Polysemous nouns are linked to more than one hypernym and can therefore cause structural damage on a lexical taxonomy. For instance, the Spanish noun tarantula ('tarantula') is a hyponym of arana ('spider'), but this is also a polysemous noun, as it means 'chandelier' as well. It is thus necessary to determine the next hypernym in the chain, that is animal ('animal') or artefacto ('artifact'). In this paper we explore methods to solve this problem using a similarity measure that uses verb-noun co-occurrence as a predictor variable. Best results (84% success) are obtained with a simple method that only measures co-occurrence, irrespective of any syntactic information.
Más información
Título según WOS: | Tarantula -> spider -> animal: second level hypernymy discovery based on distributional similarity methods |
Título de la Revista: | PROCESAMIENTO DEL LENGUAJE NATURAL |
Número: | 64 |
Editorial: | SOC ESPANOLA PROCESAMIENTO LENGUAJE NATURAL-SEPLN |
Fecha de publicación: | 2020 |
Página de inicio: | 29 |
Página final: | 36 |
DOI: |
10.26342/2020-64-3 |
Notas: | ISI |