Workshop: HiPar22: 3rd Workshop on Hierarchical Parallelism for Exascale Computing
Authors: Vivek Kale (Sandia National Laboratories, Brookhaven National Laboratory) and Mathialakan Thavappiragasam (Argonne National Laboratory (ANL), Oak Ridge National Laboratory (ORNL))
Abstract: OpenMP has become the de facto standard for shared memory parallel programming. OpenMP provides a directive, nowait, to enable asynchronous target offload from host to device. In this presentation, we identify best practices in using the asynchronous offload in OpenMP correctly and performantly. Through experimental evaluation on Summit and Crusher, we show how we use the nowait clause of OpenMP to improve performance of a graph algorithm, Floyd-Warshall, by up to 58.24% on Summit and 30.38% on Crusher. Such opportunities suggest the need for programmers to use the nowait features of OpenMP with care in order to achieve performance.
Back to HiPar22: 3rd Workshop on Hierarchical Parallelism for Exascale Computing Archive Listing