On Stricter Reachable Repetitiveness Measures

Lecroq, T; Touzet, H

Abstract

The size b of the smallest bidirectional macro scheme, which is arguably the most general copy-paste scheme to generate a given sequence, is considered to be the strictest reachable measure of repetitiveness. It is strictly lower-bounded by measures like γ and δ, which are known or believed to be unreachable and to capture the entropy of repetitiveness. In this paper we study another sequence generation mechanism, namely compositions of a morphism. We show that these form another plausible mechanism to characterize repetitive sequences and define NU-systems, which combine such a mechanism with macro schemes. We show that the size ν≤ b of the smallest NU-system is reachable and can be o(δ) for some string families, thereby implying that the limit of compressibility of repetitive sequences can be even smaller than previously thought. We also derive several other results characterizing ν.

Más información

Título según WOS: ID WOS:001344662100016 Not found in local WOS DB
Título según SCOPUS: On Stricter Reachable Repetitiveness Measures
Título de la Revista: Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volumen: 12944
Editorial: Springer Science and Business Media Deutschland GmbH
Fecha de publicación: 2021
Página final: 206
Idioma: English
DOI:

10.1007/978-3-030-86692-1_16

Notas: ISI, SCOPUS