A Prototype for Querying over LZCS Transformed Documents

Adiego J.; Navarro G.; De La Fuente P.

Abstract

We present novel query algorithms that efficiently support some popular XPath operations over LZCS-transformed documents. The LZCS transformation compresses a redundant XML collection without loss. The main idea of LZCS, inspired by Lempel-Ziv compression, is to replace whole substructures by previous occurrences thereof, and our algorithms try to reuse the work done over those repeating substructures. The algorithms are implemented in a prototype called lzcs-grep. The main advantage of lzcs-grep is that it processes the documents in transformed form, obtaining very fast response times in combination with low memory requirements. Our experimental results show that lzcs-grep is competitive with other XPath processors even over untransformed documents, and by far unbeaten when it can operate over their LZCS-transformed version. © Copyright 2010 IEEE - All Rights Reserved.

Más información

Título según WOS: A Prototype for Querying over LZCS Transformed Documents
Título según SCOPUS: A prototype for querying over LZCS transformed documents
Título de la Revista: IEEE Latin America Transactions
Volumen: 7
Número: 3
Editorial: IEEE Computer Society
Fecha de publicación: 2009
Página de inicio: 353
Página final: 360
Idioma: eng
URL: http://ieeexplore.ieee.org/lpdocs/epic03/wrapper.htm?arnumber=5336634
DOI:

10.1109/TLA.2009.5336634

Notas: ISI, SCOPUS