SC22 Proceedings

The International Conference for High Performance Computing, Networking, Storage, and Analysis

Research Posters Archive

A Light-Weight and Unsupervised Method for Near Real-time Anomaly Detection Using Operational Data Measurement


Authors: Tom Richard Vargis (Technical University Dresden) and Siavash Ghiasvand (Technical University Dresden, ZIH)

Abstract: Monitoring the status of large computing systems is essential to identify unexpected behavior and improve their performance and up-time. However, due to the large-scale and distributed design of such computing systems as well as a large number of monitoring parameters, automated monitoring methods should be applied. Such automatic monitoring methods should also have the ability to adapt themselves to the continuous changes in the computing system. In addition, they should be able to identify behavioral anomalies in useful time, in order to perform appropriate reactions. This work proposes a general light-weight and unsupervised method for near real-time anomaly detection using operational data measurement on large computing systems. The proposed model requires as low as 4 hours of data and 50 epochs for each training process to accurately resemble the behavioral pattern of computing systems.

Best Poster Finalist (BP): no

Poster: PDF
Poster summary: PDF


Back to Poster Archive Listing