· Contributors · Organizations · Search
OpenMP's Asynchronous Offloading for Combinatorial Scientific Computations
DescriptionOpenMP has become the de facto standard for shared memory parallel programming. OpenMP provides a directive, nowait, to enable asynchronous target offload from host to device. In this presentation, we identify best practices in using the asynchronous offload in OpenMP correctly and performantly. Through experimental evaluation on Summit and Crusher, we show how we use the nowait clause of OpenMP to improve performance of a graph algorithm, Floyd-Warshall, by up to 58.24% on Summit and 30.38% on Crusher. Such opportunities suggest the need for programmers to use the nowait features of OpenMP with care in order to achieve performance.
Parallel Programming Languages and Models
Parallel Programming Systems
Resource Management and Scheduling
Next PresentationNext PresentationHiPar22 – Morning Break