On the compression of search trees

Claude F.; Nicholson P.K.; Seco, D.

Abstract

Let X = x(1),x(2),...,x(n) be a sequence of non-decreasing integer values. Storing a compressed representation of X that supports access and search is a problem that occurs in many domains. The most common solution to this problem uses a linear list and encodes the differences between consecutive values with encodings that favor small numbers. This solution includes additional information (i.e. samples) to support efficient searching on the encoded values. We introduce a completely different alternative that achieves compression by encoding the differences in a search tree. Our proposal has many applications, such as the representation of posting lists, geographic data, sparse bitmaps, and compressed suffix arrays, to name just a few. The structure is practical and we provide an experimental evaluation to show that it is competitive with the existing techniques. (C) 2013 Elsevier Ltd. All rights reserved.

Más información

Título según WOS: On the compression of search trees
Título de la Revista: INFORMATION PROCESSING MANAGEMENT
Volumen: 50
Número: 2
Editorial: ELSEVIER SCI LTD
Fecha de publicación: 2014
Página de inicio: 272
Página final: 283
Idioma: English
URL: http://linkinghub.elsevier.com/retrieve/pii/S030645731300112X
DOI:

10.1016/j.ipm.2013.11.002

Notas: ISI