A scalable and energy efficient GPU thread map for m-simplex domains
Abstract
This work proposes a new GPU thread map for m-simplex domains that improves its speedup along with the m-dimension and is energy efficient compared to other state of the art approaches. The main contributions of this work are (i) the formulation of an improved new block-space map H:Zm?Zm for regular orthogonal simplex domains, which is analyzed in terms of resource usage, and (ii) the experimental evaluation in terms of speedup and energy efficiency with respect to a bounding box approach. Results from the analysis show that H has a potential speedup of up to 2× and 6× for 2 and 3-simplices, respectively. Experimental evaluation shows that H is competitive for 2-simplices, reaching 1.2×?2.0× of speedup for different tests, which is on par with the fastest state of the art approaches. For 3-simplices H reaches up to 1.3×?6.0× of speedup making it the fastest. The extension of H to higher dimensional m-simplices is feasible and has a potential speedup that scales as m! given a proper selection of parameters r,? which are the scaling and replication factors of the geometry, respectively. In terms of energy consumption, although H is among the highest in power consumption, it compensates by its short duration, making it one of the most energy efficient approaches. The results of this work show that H is a scalable and energy efficient map that improves the efficiency of GPU applications that need to process m-simplex domains, such as Cellular Automata or PDE simulations, among others. © 2022 Elsevier B.V.
Más información
| Título según WOS: | A scalable and energy efficient GPU thread map for m-simplex domains |
| Título según SCOPUS: | A scalable and energy efficient GPU thread map for m-simplex domains |
| Título de la Revista: | Future Generation Computer Systems |
| Volumen: | 141 |
| Editorial: | Elsevier B.V. |
| Fecha de publicación: | 2023 |
| Página de inicio: | 651 |
| Página final: | 662 |
| Idioma: | English |
| DOI: |
10.1016/j.future.2022.12.020 |
| Notas: | ISI, SCOPUS |