Linguistic Laws in Speech: The Case of Catalan and Spanish

Hernandez-Fernandez, Antoni; Torre, Ivan G.; Garrido, Juan-Maria; Lacasa, Lucas

Abstract

In this work we consider Glissando Corpus-an oral corpus of Catalan and Spanish-and empirically analyze the presence of the four classical linguistic laws (Zipf's law, Herdan's law, Brevity law, and Menzerath-Altmann's law) in oral communication, and further complement this with the analysis of two recently formulated laws: lognormality law and size-rank law. By aligning the acoustic signal of speech production with the speech transcriptions, we are able to measure and compare the agreement of each of these laws when measured in both physical and symbolic units. Our results show that these six laws are recovered in both languages but considerably more emphatically so when these are examined in physical units, hence reinforcing the so-called 'physical hypothesis' according to which linguistic laws might indeed have a physical origin and the patterns recovered in written texts would, therefore, be just a byproduct of the regularities already present in the acoustic signals of oral communication.

Más información

Título según WOS: ID WOS:000507375900021 Not found in local WOS DB
Título de la Revista: ENTROPY
Volumen: 21
Número: 12
Editorial: Basel
Fecha de publicación: 2019
DOI:

10.3390/e21121153

Notas: ISI