HOLZ: High-Order Entropy Encoding of Lempel-Ziv Factor Distances

Abstract

We propose a new representation of the offsets of the Lempel-Ziv (LZ) factorization based on the co-lexicographic order of the text's prefixes. The selected offsets tend to approach the k-th order empirical entropy. Our evaluations show that this choice is superior to the rightmost and bit-optimal LZ parsings on datasets with small high-order entropy.

Más información

Título según WOS: HOLZ: High-Order Entropy Encoding of Lempel-Ziv Factor Distances
Título según SCOPUS: HOLZ: High-Order Entropy Encoding of Lempel-Ziv Factor Distances
Título de la Revista: Data Compression Conference Proceedings
Volumen: 2022-
Editorial: Institute of Electrical and Electronics Engineers Inc.
Fecha de publicación: 2022
Página final: 92
Idioma: English
DOI:

10.1109/DCC52660.2022.00016

Notas: ISI, SCOPUS