Tarantula -> spider -> animal: second level hypernymy discovery based on distributional similarity methods
Abstract
Automatic hypernymy discovery continues to present challenges for natural language processing. Polysemous nouns are linked to more than one hypernym and can therefore cause structural damage on a lexical taxonomy. For instance, the Spanish noun tarántula ('tarantula') is a hyponym of araña ('spider'), but this is also a polysemous noun, as it means 'chandelier' as well. It is thus necessary to determine the next hypernym in the chain, that is animal ('animal') or artefacto ('artifact'). In this paper we explore methods to solve this problem using a similarity measure that uses verb-noun co-occurrence as a predictor variable. Best results (84 % success) are obtained with a simple method that only measures co-occurrence, irrespective of any syntactic information.
Más información
| Título según WOS: | Tarantula -> spider -> animal: second level hypernymy discovery based on distributional similarity methods |
| Título según SCOPUS: | Tarantula -> spider -> animal: Second level hypernymy discovery based on distributional similarity methods |
| Título de la Revista: | Procesamiento del Lenguaje Natural |
| Volumen: | 64 |
| Editorial: | Sociedad Espanola para el Procesamiento del Lenguaje Natural |
| Fecha de publicación: | 2020 |
| Página final: | 36 |
| Idioma: | Spanish |
| DOI: |
10.26342/2020-64-3 |
| Notas: | ISI, SCOPUS |