12th Workshop on Fault-Tolerance for HPC at Extreme Scale (FTXS 2022)
Session Chairs
Event TypeWorkshop
W
Recorded
TimeMonday, 14 November 20221:30pm - 5pm CST
LocationC143-149
DescriptionIncreases in the number, variety, and complexity of components required to compose next-generation extreme-scale systems mean that systems will experience significant increases in aggregate fault rates, fault diversity, and fault complexity. Additionally, the widespread availability of new storage devices (NVMM, NVMe, SSD), increasing system heterogeneity, and the emergence of novel computing paradigms (neuromorphic, quantum) introduce fault tolerance issues that the research community has just begun to address.
Workshop Website
Workshop Website
Archive
view
Presentations
1:30pm - 1:35pm CST | FTXS 2022 – Opening Remarks Presenter | |
1:35pm - 2:35pm CST | FTXS 2022 – Featured Speaker: Harish Dixit (Facebook) Presenter | |
2:35pm - 2:59pm CST | ClusterLog: Clustering Logs for Effective Log-Based Anomaly Detection | |
2:59pm - 3:29pm CST | FTXS 2022 – Afternoon Break | |
3:29pm - 3:54pm CST | Recovery of Distributed Iterative Solvers for Linear Systems Using Non-Volatile RAM | |
3:54pm - 4:09pm CST | Toward Precision-Aware Fault Tolerance Approaches for Mixed-Precision Applications | |
4:09pm - 4:34pm CST | ReStore: In-Memory REplicated STORagE for Rapid Recovery in Fault-Tolerant Algorithms | |
4:34pm - 4:59pm CST | Implicit Actions and Non-blocking Failure Recovery with MPI | |
4:59pm - 5:00pm CST | FTXS 2022 – Closing Remarks Presenter |