Performance evaluation of single vs. batch of queries on GPUs

Gaioso, Roussian; Gil-Costa, Veronica; Guardia, Helio; Senger, Hermes

Abstract

The WAND processing strategy is a dynamic pruning algorithm designed for large scale Web search engines where fast response to queries is a critical service. The WAND is used to reduce the amount of computation by scoring only documents that may become part of the top-k document results. In this paper, we present two parallel strategies for the WAND algorithm and compare their performance on GPUs. In our first strategy (named size-based), the posting lists are evenly partitioned among thread blocks. Our second strategy (named range-based) partitions the posting lists according to document identifier intervals; thus, partitions may have different sizes. We also propose three threshold sharing policies, named Local, Safe-R, and Safe-WR, which emulate the WAND algorithm global pruning technique. We evaluated our proposals with different amounts of work, from short to extra-large queries, using single query processing and batch of queries. Results show that the size-based strategy reports the highest speedups but at the cost of low quality of results. The range-based algorithm retrievals the exact top-k documents and maintains a good speedup. Moreover, both strategies are capable of scaling as the amount of work is increased. In addition, there is no significant difference in the performance of the three threshold sharing policies.

Más información

Título según WOS: ID WOS:000481183600001 Not found in local WOS DB
Título de la Revista: CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE
Volumen: 32
Número: 20
Editorial: Wiley
Fecha de publicación: 2020
DOI:

10.1002/cpe.5474

Notas: ISI