Computing How-Provenance for SPARQL Queries via Query Rewriting

Hernandez, Daniel; Galarraga, Luis; Hose, Katja

Abstract

Over the past few years, we have witnessed the emergence of large knowledge graphs built by extracting and combining information from multiple sources. This has propelled many advances in query processing over knowledge graphs, however the aspect of providing provenance explanations for query results has so far been mostly neglected. We therefore propose a novel method, SPARQLprov, based on query rewriting, to compute how-provenance polynomials for SPARQL queries over knowledge graphs. Contrary to existing works, SPARQLprov is system-agnostic and can be applied to standard and already deployed SPARQL engines without the need of customized extensions. We rely on spm-semirings to compute polynomial annotations that respect the property of commutation with homomorphisms on monotonic and non-monotonic SPARQL queries without aggregate functions. Our evaluation on real and synthetic data shows that SPARQLprov over standard engines incurs an acceptable runtime overhead w.r.t. the original query, competing with state-of-the-art solutions for how-provenance computation.

Más información

Título según WOS: ID WOS:000742944600012 Not found in local WOS DB
Título de la Revista: PROCEEDINGS OF THE VLDB ENDOWMENT
Volumen: 14
Número: 13
Editorial: ASSOC COMPUTING MACHINERY
Fecha de publicación: 2021
Página de inicio: 3389
Página final: 3401
DOI:

10.14778/3484224.3484235

Notas: ISI