Self-indexed text compression using straight-line programs

Claude F.; Navarro G.

Keywords: binary, program, line, science, text, computer, representation, relation, grammar, Straight, compressions, Sub-strings

Abstract

Straight-line programs (SLPs) offer powerful text compression by representing a text T[1,u] in terms of a restricted context-free grammar of n rules, so that T can be recovered in O(u) time. However, the problem of operating the grammar in compressed form has not been studied much. We present a grammar representation whose size is of the same order of that of a plain SLP representation, and can answer other queries apart from expanding nonterminals. This can be of independent interest. We then extend it to achieve the first grammar representation able of extracting text substrings, and of searching the text for patterns, in time o(n). We also give byproducts on representing binary relations. © 2009 Springer Berlin Heidelberg.

Más información

Título de la Revista: Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volumen: 5734
Editorial: Society of Laparoendoscopic Surgeons
Fecha de publicación: 2009
Página de inicio: 235
Página final: 246
URL: http://www.scopus.com/inward/record.url?eid=2-s2.0-70349311543&partnerID=q2rCbXpz