On-the-Fly Data Race Detection for MPI RMA Programs with MUST
DescriptionMPI Remote Memory Access (RMA) provides a one-sided communication model for MPI applications. Ensuring consistency between RMA operations with synchronization calls is a key requirement when writing correct RMA codes. Wrong API usage may lead to concurrent modifications of the same memory location without proper synchronization resulting in data races across processes. Due to their non-deterministic nature, such data races are hard to detect. This paper presents MUST-RMA, an on-the-fly data race detector for RMA applications. MUST-RMA uses a race detection model based on happened-before and consistency analysis. It combines the MUST correctness tool with the race detector ThreadSanitizer to detect races across processes in RMA applications. A classification quality study on MUST-RMA with different test cases shows a precision and recall of 0.93. An overhead study on a stencil and a matrix transpose kernel shows runtime slowdowns of 3x to 20x for up to 192 processes.
Event Type
TimeFriday, 18 November 20229:40am - 10am CST
Registration Categories
Software Engineering
Session Formats
Back To Top Button