Learning to Unlearn (some) Conventional Wisdom in HPC System Design and Operation
DescriptionTraditionally, we have assumed that HPC users are fairly boring and that their workloads often do similar things repetitively. Their "boring" nature has served us well so far -- we could design "boring" systems and get away with it. But, now things are changing and changing fast. Our HPC workloads and users are becoming interesting and, often, are surprising us with new trends and behavior. That means it is springing excitement into our lives. We need to design interesting solutions, and come out of our boredom. My talk will discuss specific examples from job resource consumption, system reliability, and performance tuning -- and their impact on system design and operations. I'll discuss some "speculative" use cases which could really disrupt our boredom and calmness, and assess if we are ready for that?
Event Type
TimeSunday, 13 November 20229:40am - 10am CST
Registration Categories
Data Analytics
Extreme Scale Computing
HPC Community Collaboration
Machine Learning and Artificial Intelligence
Resource Management and Scheduling
System Software
Session Formats
Back To Top Button