.. _Monitoring the Compute Cluster: Monitoring the Compute Cluster ------------------------------ After you create the compute cluster, you can monitor it on the **COMPUTE** > **OVERVIEW** screen. .. not present in UI The compute cluster status is displayed on top of the screen and can be one of the following: .. raw:: latex \setlist[description]{leftmargin=!,labelindent=0pt,labelwidth=1em+\widthof{CRITICAL}} **HEALTHY** All compute cluster components and nodes operate normally. **WARNING** The compute cluster is experiencing issues that may affect its performance. The corresponding alert is generated and shown on the **Alerts** chart. **CRITICAL** The compute cluster has encountered a critical problem. The corresponding alert is generated and shown on the **Alerts** chart. .. raw:: latex \setlist[description]{leftmargin=!,labelindent=0pt,labelwidth=1em+\widthof{ }} The charts show the information on CPU, RAM, and storage usage; the number of virtual machines grouped by status and resource consumption; and compute-related alerts. .. _Used CPUs Chart: Used CPUs Chart ~~~~~~~~~~~~~~~ This chart displays CPU utilization of the compute cluster. The following statistics are available: .. raw:: latex \setlist[description]{leftmargin=!,labelindent=0pt,labelwidth=1em+\widthof{Provisioned vCPUs}} **System** The number of logical cores used by system and storage services on all nodes in the compute cluster. **VMs** The number of logical cores used by virtual machines on all nodes in the compute cluster. **Free** The number of unused logical cores on all nodes in the compute cluster. **Total** The total number of logical cores on all nodes in the compute cluster. **Provisioned vCPUs** The number of vCPUs provisioned for all VMs in the compute cluster. .. raw:: latex \setlist[description]{leftmargin=!,labelindent=0pt,labelwidth=1em+\widthof{ }} .. only:: ac .. image:: /images/stor_image138_ac.png :align: center :class: align-center .. only:: vz .. image:: /images/stor_image138_vz.png :align: center :class: align-center .. include:: /includes/monitoring-compute-cluster-part1.inc .. _Reserved RAM Chart: Reserved RAM Chart ~~~~~~~~~~~~~~~~~~ This chart displays RAM utilization of the compute cluster. The following statistics are available: .. raw:: latex \setlist[description]{leftmargin=!,labelindent=0pt,labelwidth=1em+\widthof{Used by VMs}} **System** The amount of RAM reserved for system and storage services on all nodes in the compute cluster. **VMs** The amount of RAM provisioned for all VMs in the compute cluster. **Free** The amount of free RAM on all nodes in the compute cluster. **Total** The total amount of RAM on all nodes in the compute cluster. **Used by VMs** The amount of RAM actually used by all VMs in the compute cluster. .. raw:: latex \setlist[description]{leftmargin=!,labelindent=0pt,labelwidth=1em+\widthof{ }} .. only:: ac .. image:: /images/stor_image139_ac.png :align: center :class: align-center .. only:: vz .. image:: /images/stor_image139_vz.png :align: center :class: align-center .. include:: /includes/monitoring-compute-cluster-part1.inc .. _Provisioned Storage Chart: Provisioned Storage Chart ~~~~~~~~~~~~~~~~~~~~~~~~~ This chart shows usage of storage space by the compute cluster. The following statistics are available: .. raw:: latex \setlist[description]{leftmargin=!,labelindent=0pt,labelwidth=1em+\widthof{Free physical space}} **Used** The amount of storage space actually occupied by data in all volumes provisioned in the compute cluster. **Free** The amount of unused space in all volumes provisioned in the compute cluster. **Total** The total size of volumes provisioned in the compute cluster. **Free physical space** The amount of physical space available in the storage cluster. .. raw:: latex \setlist[description]{leftmargin=!,labelindent=0pt,labelwidth=1em+\widthof{ }} .. only:: ac .. image:: /images/stor_image140_ac.png :align: center :class: align-center .. only:: vz .. image:: /images/stor_image140_vz.png :align: center :class: align-center .. _VM Status Chart: VM Status Chart ~~~~~~~~~~~~~~~ The **VMs status** chart shows the total number of virtual machines in the compute cluster and groups them by status, which can be the following: .. raw:: latex \setlist[description]{leftmargin=!,labelindent=0pt,labelwidth=1em+\widthof{In progress}} **Running** The number of virtual machines that are up and running. **In progress** The number of virtual machines that are in a transitional state: building, restarting, migrating, etc. **Stopped** The number of virtual machines that are suspended or powered off. **Error** The number of virtual machines that have failed. Try troubleshooting such VMs via the console or rebuild them. .. raw:: latex \setlist[description]{leftmargin=!,labelindent=0pt,labelwidth=1em+\widthof{ }} .. only:: ac .. image:: /images/stor_image141_ac.png :align: center :class: align-center .. only:: vz .. image:: /images/stor_image141_vz.png :align: center :class: align-center To see a full list of virtual machines filtered by the chosen status, click the number next to the status icon. .. _Top VMs Chart: Top VMs Chart ~~~~~~~~~~~~~ The **Top VMs** chart lists virtual machines with the highest resource consumption sorted by **CPU**, **RAM**, or **Storage** in descending order. To switch between lists, click the desired resource. .. only:: ac .. image:: /images/stor_image142_ac.png :align: center :class: align-center .. only:: vz .. image:: /images/stor_image142_vz.png :align: center :class: align-center To see a full list of virtual machines in the compute cluster, click **Show all**. .. _Alerts Chart: Alerts Chart ~~~~~~~~~~~~ The **Alerts** chart lists all the alerts related to the compute cluster sorted by severity. Alerts include the following: .. raw:: latex \setlist[description]{leftmargin=!,labelindent=0pt,labelwidth=1em+\widthof{Critical}} **Critical** The compute cluster has encountered a critical problem. For example, one or more of its components have been unavailable for more than 10 seconds or some resource has exceeded its soft limit. **Warning** The compute cluster is experiencing issues that may affect its performance. For example, one or more of its components operate slowly or some resource is approaching its soft limit. **Other** Some other issue has happened with the compute cluster. For example, its license is about to expire or has expired. .. raw:: latex \setlist[description]{leftmargin=!,labelindent=0pt,labelwidth=1em+\widthof{ }} To see a full list of compute-related alerts, click **Show all**. .. _Per-VM Charts: Per-VM Charts ~~~~~~~~~~~~~ You can monitor performance individual VMs on the **COMPUTE** > **VIRTUAL MACHINES** > VM > **Monitoring** tab. .. include:: /includes/monitoring-storage-cluster-part1.inc The following performance charts are available: .. raw:: latex \setlist[description]{leftmargin=!,labelindent=0pt,labelwidth=1em+\widthof{Storage read/write}} **CPU / RAM** CPU and RAM usage by the VM. **Network** Incoming and outgoing network traffic. **Storage read/write** Amount of data read and written by the VM. **Read/write latency** Read and write latency. Hovering the mouse cursor over a point on the chart, you can also see the average and maximum latency for that moment as well as the 95 and 99 percentiles. .. raw:: latex \setlist[description]{leftmargin=!,labelindent=0pt,labelwidth=1em+\widthof{ }}