Workshop: SC22 SuperCompCloud: 6th International Workshop on Interoperability of Supercomputing and Cloud Technologies
Authors: Marcin Copik, Marcin Chrapek, Alexandru Calotoiu, and Torsten Hoefler (ETH Zürich)
Abstract: The aggregated HPC resources with rigid allocation systems and programming models struggle to adapt to diverse and changing workloads. Thus, HPC systems fail to efficiently use the large pools of unused memory and increase the utilization of idle computing resources. Prior work attempted to increase the throughput and efficiency of supercomputing systems through workload co-location and resource disaggregation. However, these methods fail to provide a solution that can be applied to existing systems without major hardware modifications and performance losses. In this presentation, we propose to use the new cloud paradigm of serverless computing to improve the utilization of supercomputers. We show that the FaaS programming model can satisfy the requirements of high-performance applications. We demonstrate a software resource disaggregation approach where the co-location of functions allows to utilize idle cores and accelerators while retaining near-native performance.