On Stricter Reachable Repetitiveness Measures
Abstract
The size b of the smallest bidirectional macro scheme, which is arguably the most general copy-paste scheme to generate a given sequence, is considered to be the strictest reachable measure of repetitiveness. It is strictly lower-bounded by measures like gamma and delta, which are known or believed to be unreachable and to capture the entropy of repetitiveness. In this paper we study another sequence generation mechanism, namely compositions of a morphism. We show that these form another plausible mechanism to characterize repetitive sequences and define NU-systems, which combine such a mechanism with macro schemes. We show that the size nu <= b of the smallest NU-system is reachable and can be o(delta) for some string families, thereby implying that the limit of compressibility of repetitive sequences can be even smaller than previously thought. We also derive several other results characterizing nu.
Más información
Título según WOS: | ID WOS:001344662100016 Not found in local WOS DB |
Título de la Revista: | STRING PROCESSING AND INFORMATION RETRIEVAL, SPIRE 2020 |
Volumen: | 12944 |
Editorial: | SPRINGER INTERNATIONAL PUBLISHING AG |
Fecha de publicación: | 2021 |
Página de inicio: | 193 |
Página final: | 206 |
DOI: |
10.1007/978-3-030-86692-1_16 |
Notas: | ISI |