Parallel query processing on distributed clustering indexes
Keywords: search, information, structures, space, topology, algorithms, world, media, set, data, similarity, theory, parallel, distributed, databases, searches, query, content, clustering, transforms, retrieval, web, processing, fourier, wide, computing, and, Indexing, (of, information), (materials, engines, metric, working)
Abstract
Similarity search has been proved suitable for searching in large collections of unstructured data objects. A number of practical index data structures for this purpose have been proposed. All of them have been devised to process single queries sequentially. However, in large-scale systems such as Web Search Engines indexing multi-media content, it is critical to deal efficiently with streams of queries rather than with single queries. In this paper we show how to achieve efficient and scalable performance in this context. To this end we transform a sequential index based on clustering into a distributed one and devise algorithms and optimizations specially tailored to support high-performance parallel query processing. © 2008 Elsevier Inc. All rights reserved.
Más información
Título según SCOPUS: | Parallel query processing on distributed clustering indexes |
Título de la Revista: | JOURNAL OF DISCRETE ALGORITHMS |
Volumen: | 7 |
Número: | 1 |
Editorial: | ELSEVIER SCIENCE BV |
Fecha de publicación: | 2009 |
Página de inicio: | 3 |
Página final: | 17 |
Idioma: | eng |
URL: | http://www.scopus.com/inward/record.url?eid=2-s2.0-58649090461&partnerID=q2rCbXpz |
DOI: |
10.1016/j.jda.2008.09.010 |
Notas: | SCOPUS |