Monitoring

Considerations for Monitoring the HPC Platform


Part of the HPC Concepts series
Table of Contents

Types of Monitoring

There are 2 types of monitoring that can be implemented into a network; these are:

Both forms of monitoring are usually necessary in order to ensure that your HPC cluster is running properly, and in full working order.

Metrics

It is worth considering what metrics for the system will be monitored; a few common ones are listed here:

Cloud service providers usually have both passive and active monitoring services available through their cloud management front-end.

Additional Considerations and Questions