Matching dependencies: semantics and query answering
Abstract
Matching dependencies (MDs) are used to declaratively specify the identification (or matching) of certain attribute values in pairs of database tuples when some similarity conditions on other values are satisfied. Their enforcement can be seen as a natural generalization of entity resolution. In what we call the pure case of MD enforcement, an arbitrary value from the underlying data domain can be used for the value in common that is used for a matching. However, the overall number of changes of attribute values is expected to be kept to a minimum. We investigate this case in terms of semantics and the properties of data cleaning through the enforcement of MDs. We characterize the intended clean instances, and also the clean answers to queries, as those that are invariant under the cleaning process. The complexity of computing clean instances and clean query answering is investigated. Tractable and intractable cases depending on the MDs are identified and characterized.
Más información
Título según WOS: | ID WOS:000304876500003 Not found in local WOS DB |
Título de la Revista: | FRONTIERS OF COMPUTER SCIENCE |
Volumen: | 6 |
Número: | 3 |
Editorial: | HIGHER EDUCATION PRESS |
Fecha de publicación: | 2012 |
Página de inicio: | 278 |
Página final: | 292 |
DOI: |
10.1007/s11704-012-2007-0 |
Notas: | ISI |