The community structure of word co-occurrence networks: Experiments with languages from the Americas

Vera, Javier; Palma, Wenceslao

Abstract

We study a set of algorithms to discover the community structure of networks for languages from the Americas. Our experiments are based on a parallel corpus which allows us to represent each language as a co-occurrence network. Four methods to calculate network modularity, as a measure of the quality of community structure, were used. We studied several aspects of the community structure of co-occurrence networks. First, we were able to construct the map of modularity variations across languages from the Americas. With this, we separated large groups of languages into low- and high-modularity families. We suggested also a strong influence of functional words on low-modularity languages. Finally, we found a strong relationship between word entropy values and modularity. Our approach is thus a simple network-based contribution to face data scarcity of languages which are in danger of disappearing. Copyright (C) 2021 EPLA

Más información

Título según WOS: The community structure of word co-occurrence networks: Experiments with languages from the Americas
Título de la Revista: EPL
Volumen: 134
Número: 5
Editorial: IOP PUBLISHING LTD
Fecha de publicación: 2021
DOI:

10.1209/0295-5075/134/58002

Notas: ISI