Learning to Unlearn (some) Conventional Wisdom in HPC System Design and Operation
DescriptionTraditionally, we have assumed that HPC users are fairly boring and that their workloads often do similar things repetitively. Their "boring" nature has served us well so far -- we could design "boring" systems and get away with it. But, now things are changing and changing fast. Our HPC workloads and users are becoming interesting and, often, are surprising us with new trends and behavior. That means it is springing excitement into our lives. We need to design interesting solutions, and come out of our boredom. My talk will discuss specific examples from job resource consumption, system reliability, and performance tuning -- and their impact on system design and operations. I'll discuss some "speculative" use cases which could really disrupt our boredom and calmness, and assess if we are ready for that?
Event Type
Workshop
TimeSunday, 13 November 20229:40am - 10am CST
LocationD221
Registration Categories
W
Tags
Architectures
Data Analytics
Datacenter
Extreme Scale Computing
HPC Community Collaboration
Machine Learning and Artificial Intelligence
Performance
Resource Management and Scheduling
System Software
Session Formats
Recorded
Back To Top Button