A compressed self-indexed representation of XML documents

Brisaboa N.R.; Cerdeira-Pena, A; Navarro G.

Keywords: tree, libraries, queries, languages, wavelet, xml, xpath, digital, Inverted, Indices, Markup

Abstract

This paper presents a structure we call XML Wavelet Tree (XWT) to represent any XML document in a compressed and self-indexed form. Therefore, any query or procedure that could be performed over the original document can be performed more efficiently over the XWT representation because it is shorter and has some indexing properties. In fact, XWT permits to answer XPath queries more efficiently than using the uncompressed version of the documents. XWT is also competitive when comparing it with inverted indexes over the XML document (if both structures use the same space). © 2009 Springer.

Más información

Título de la Revista: Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volumen: 5714
Editorial: Society of Laparoendoscopic Surgeons
Fecha de publicación: 2009
Página de inicio: 273
Página final: 284
URL: http://www.scopus.com/inward/record.url?eid=2-s2.0-77952062182&partnerID=q2rCbXpz