Faster entropy-bounded compressed suffix trees
Abstract
Suffix trees are among the most important data structures in stringology, with a number of applications in flourishing areas like bioinformatics. Their main problem is space usage, which has triggered much research striving for compressed representations that are still functional. A smaller suffix tree representation could fit in a faster memory, outweighing by far the theoretical slowdown brought by the space reduction. We present a novel compressed suffix tree, which is the first achieving at the same time sublogarithmic complexity for the operations, and space usage that asymptotically goes to zero as the entropy of the text does. The main ideas in our development are compressing the longest common prefix information, totally getting rid of the suffix tree topology, and expressing all the suffix tree operations using range minimum queries and a novel primitive called next/previous smaller value in a sequence. Our solutions to those operations are of independent interest. © 2009 Elsevier B.V. All rights reserved.
Más información
Título según WOS: | Faster entropy-bounded compressed suffix trees |
Título según SCOPUS: | Faster entropy-bounded compressed suffix trees |
Título de la Revista: | THEORETICAL COMPUTER SCIENCE |
Volumen: | 410 |
Número: | 51 |
Editorial: | ELSEVIER SCIENCE BV |
Fecha de publicación: | 2009 |
Página de inicio: | 5354 |
Página final: | 5364 |
Idioma: | English |
URL: | http://linkinghub.elsevier.com/retrieve/pii/S0304397509006392 |
DOI: |
10.1016/j.tcs.2009.09.012 |
Notas: | ISI, SCOPUS |