Probabilistic proximity searching algorithms based on compact partitions

Bustos, B; Navarro G.

Keywords: systems, system, compression, information, learning, database, range, image, measurement, algorithms, queries, spaces, similarity, languages, theory, probability, query, probabilistic, methods, retrieval, approximate, searching, Functions, Multimedia, Computational, metric

Abstract

The main bottleneck of the research in metric space searching is the so-called curse of dimensionality, which makes the task of searching some metric spaces intrinsically difficult, whatever algorithm is used. A recent trend to break this bottleneck resorts to probabilistic algorithms, where it has been shown that one can find 99% of the relevant objects at a fraction of the cost of the exact algorithm. These algorithms are welcome in most applications because resorting to metric space searching already involves a fuzziness in the retrieval requirements. In this paper, we push further in this direction by developing probabilistic algorithms on data structures whose exact versions are the best for high dimensions. As a result, we obtain probabilistic algorithms that are better than the previous ones. We give new insights on the problem and propose a novel view based on time-bounded searching. We also propose an experimental framework for probabilistic algorithms that permits comparing them in offline mode. © 2003 Elsevier B.V. All rights reserved.

Más información

Título según SCOPUS: Probabilistic proximity searching algorithms based on compact partitions
Título de la Revista: JOURNAL OF DISCRETE ALGORITHMS
Volumen: 2
Número: 1 SPEC. ISS.
Editorial: ELSEVIER SCIENCE BV
Fecha de publicación: 2004
Página de inicio: 115
Página final: 134
Idioma: English
URL: http://www.scopus.com/inward/record.url?eid=2-s2.0-10644227003&partnerID=q2rCbXpz
DOI:

10.1016/S1570-8667(03)00067-4

Notas: SCOPUS