PDI - Resultado de Búsqueda

Abstract

This work presents Squeeze, an efficient compact fractal processing scheme for tensor core GPUs. By combining discrete-space transformations between compact and expanded forms, one can do data-parallel computation on a fractal with neighborhood access without needing to expand the fractal in memory. The space transformations are formulated as two GPU tensor-core accelerated thread maps, Î»(Ï) and Î½(Ï), which act as compact-to-expanded and expanded-to-compact space functions, respectively. The cost of the maps is O(log2logs(n)) time, with n being the side of a nÃn embedding for the fractal in its expanded form, and s the linear scaling factor. The proposed approach works for any fractal that belongs to the Non-overlapping-Bounding-Boxes (NBB) class of discrete fractals, and can be extended to three dimensions as well. Experimental results using a discrete Sierpinski Triangle as a case study shows up to â¼12Ã of speedup and a memory reduction factor of up to â¼315Ã with respect to a GPU-based expanded-space bounding box approach. These results show that the proposed compact approach will allow the scientific community to efficiently tackle problems that up to now could not fit into GPU memory.

Más información

Título según WOS:	Squeeze: Efficient compact fractals for tensor core GPUs
Título según SCOPUS:	Squeeze: Efficient compact fractals for tensor core GPUs
Título de la Revista:	Future Generation Computer Systems
Volumen:	135
Editorial:	Elsevier B.V.
Fecha de publicación:	2022
Página de inicio:	10
Página final:	19
Idioma:	English
DOI:	10.1016/j.future.2022.04.023
Notas:	ISI, SCOPUS

Squeeze: Efficient compact fractals for tensor core GPUs

Abstract

Más información