SC22 Proceedings

The International Conference for High Performance Computing, Networking, Storage, and Analysis

Workshops Archive

OpenMP's Asynchronous Offloading for Combinatorial Scientific Computations


Workshop: HiPar22: 3rd Workshop on Hierarchical Parallelism for Exascale Computing

Authors: Vivek Kale (Sandia National Laboratories, Brookhaven National Laboratory) and Mathialakan Thavappiragasam (Argonne National Laboratory (ANL), Oak Ridge National Laboratory (ORNL))


Abstract: OpenMP has become the de facto standard for shared memory parallel programming. OpenMP provides a directive, nowait, to enable asynchronous target offload from host to device. In this presentation, we identify best practices in using the asynchronous offload in OpenMP correctly and performantly. Through experimental evaluation on Summit and Crusher, we show how we use the nowait clause of OpenMP to improve performance of a graph algorithm, Floyd-Warshall, by up to 58.24% on Summit and 30.38% on Crusher. Such opportunities suggest the need for programmers to use the nowait features of OpenMP with care in order to achieve performance.





Back to HiPar22: 3rd Workshop on Hierarchical Parallelism for Exascale Computing Archive Listing



Back to Full Workshop Archive Listing