OpenMP's Asynchronous Offloading for Combinatorial Scientific Computations
DescriptionOpenMP has become the de facto standard for shared memory parallel programming. OpenMP provides a directive, nowait, to enable asynchronous target offload from host to device. In this presentation, we identify best practices in using the asynchronous offload in OpenMP correctly and performantly. Through experimental evaluation on Summit and Crusher, we show how we use the nowait clause of OpenMP to improve performance of a graph algorithm, Floyd-Warshall, by up to 58.24% on Summit and 30.38% on Crusher. Such opportunities suggest the need for programmers to use the nowait features of OpenMP with care in order to achieve performance.
Event Type
Workshop
TimeFriday, 18 November 20229:40am - 10am CST
LocationC140-142
Registration Categories
W
Tags
Algorithms
Architectures
Compilers
Computational Science
Exascale Computing
Heterogeneous Systems
Hierarchical Parallelism
Memory Systems
Parallel Programming Languages and Models
Parallel Programming Systems
Resource Management and Scheduling
Session Formats
Recorded
Back To Top Button