A guided tour to approximate string matching

Navarro G.

Abstract

We survey the current techniques to cope with the problem of string matching that allows errors. This is becoming a more AND more relevant issue for many fast growing areas such as information retrieval AND computational biology. We focus on online searching AND mostly on edit distance, explaining the problem AND its relevance, its statistical behavior, its history AND current developments, AND the central ideas of the algorithms AND their complexities. We present a number of experiments to compare the performance of the different algorithms AND show which are the best choices. We conclude with some directions for future work AND open problems. Categories AND Subject Descriptors: F.2.2 [Analysis of algorithms AND problem complexity]: Nonnumerical algorithms AND problems-Pattern matching, Computations on discrete structures; H.3.3 [Information storage AND retrieval]: Information search AND retrieval-Search process General Terms: Algorithms. © 2001 ACM.

Más información

Título según WOS: A guided tour to approximate string matching
Título según SCOPUS: A guided tour to approximate string matching
Título de la Revista: ACM COMPUTING SURVEYS
Volumen: 33
Número: 1
Editorial: ASSOC COMPUTING MACHINERY
Fecha de publicación: 2001
Página de inicio: 31
Página final: 88
Idioma: English
URL: http://portal.acm.org/citation.cfm?doid=375360.375365
DOI:

10.1145/375360.375365

Notas: ISI, SCOPUS