Rank/select on dynamic compressed sequences and applications

González R.; Navarro G.

Abstract

Operations r a n k and s e l e c t over a sequence of symbols have many applications to the design of succinct and compressed data structures managing text collections, structured text, binary relations, trees, graphs, and so on. We are interested in the case where the collections can be updated via insertions and deletions of symbols. Two current solutions stand out as the best in the tradeoff of space versus time (when considering all the operations). One solution, by Mäkinen and Navarro, achieves compressed space (i.e., n H 0 + o (n log s) bits) and O (log n log s) worst-case time for all the operations, where n is the sequence length, s is the alphabet size, and H 0 is the zero-order entropy of the sequence. The other solution, by Lee and Park, achieves O (log n (1 + frac(log s, log log n))) amortized time and uncompressed space, i.e. n log 2 s + O (n) + o (n log s) bits. In this paper we show that the best of both worlds can be achieved. We combine the solutions to obtain n H 0 + o (n log s) bits of space and O (log n (1 + frac(log s, log log n))) worst-case time for all the operations. Apart from the best current solution to the problem, we obtain several byproducts of independent interest applicable to partial sums, text indexes, suffix arrays, the Burrows-Wheeler transform, and others. © 2009 Elsevier B.V. All rights reserved.

Más información

Título según WOS: Rank/select on dynamic compressed sequences and applications
Título según SCOPUS: Rank/select on dynamic compressed sequences and applications
Título de la Revista: THEORETICAL COMPUTER SCIENCE
Volumen: 410
Número: 43
Editorial: ELSEVIER SCIENCE BV
Fecha de publicación: 2009
Página de inicio: 4414
Página final: 4422
Idioma: English
URL: http://linkinghub.elsevier.com/retrieve/pii/S0304397509004836
DOI:

10.1016/j.tcs.2009.07.022

Notas: ISI, SCOPUS