On Stricter Reachable Repetitiveness Measures

Lecroq, T; Touzet, H

Abstract

The size b of the smallest bidirectional macro scheme, which is arguably the most general copy-paste scheme to generate a given sequence, is considered to be the strictest reachable measure of repetitiveness. It is strictly lower-bounded by measures like gamma and delta, which are known or believed to be unreachable and to capture the entropy of repetitiveness. In this paper we study another sequence generation mechanism, namely compositions of a morphism. We show that these form another plausible mechanism to characterize repetitive sequences and define NU-systems, which combine such a mechanism with macro schemes. We show that the size nu <= b of the smallest NU-system is reachable and can be o(delta) for some string families, thereby implying that the limit of compressibility of repetitive sequences can be even smaller than previously thought. We also derive several other results characterizing nu.

Más información

Título según WOS: ID WOS:001344662100016 Not found in local WOS DB
Título de la Revista: STRING PROCESSING AND INFORMATION RETRIEVAL, SPIRE 2020
Volumen: 12944
Editorial: SPRINGER INTERNATIONAL PUBLISHING AG
Fecha de publicación: 2021
Página de inicio: 193
Página final: 206
DOI:

10.1007/978-3-030-86692-1_16

Notas: ISI