Compressed Suffix Trees for Repetitive Texts

Abstract

We design a new compressed suffix tree specifically tailored to highly repetitive text collections. This is particularly useful for sequence analysis on large collections of genomes of the close species. We build on an existing compressed suffix tree that applies statistical compression, and modify it so that it works on the grammar-compressed version of the longest common prefix array, whose differential version inherits much of the repetitiveness of the text.

Más información

Título según WOS: Compressed Suffix Trees for Repetitive Texts
Título de la Revista: SOCIAL COMPUTING AND SOCIAL MEDIA, SCSM 2025, PT II
Volumen: 7608
Editorial: SPRINGER INTERNATIONAL PUBLISHING AG
Fecha de publicación: 2012
Página de inicio: 30
Página final: 41
Notas: ISI