Performance of OpenMP Loop Transformations for the Acoustic Wave Stencil on GPUs
SessionResearch Posters Display
DescriptionIn this work, we evaluate the performance of unroll and tiling, two loop transformations introduced in OpenMP 5.1 and early implemented in Clang 13 for GPUs. Experiments on a common seismic computational kernel demonstrate performance gains on three GPU architectures.