eG Monitoring
 

Measures reported by NutAHVDiskTest

Monitoring each physical disk grouped in a storage pool provides administrators with insights into the status, configuration, and usage of each disk. This in turn enables administrators to isolate disks that are running out of free space and those that are experiencing serious processing bottlenecks, so that such disks can be marked for resizing or tuning. Using the NutAHVDiskTest test, administrators can receive such useful physical disk-level performance insights!

For each physical disk, this test reports the current status, type, and mode of the test. In addition, the test measures the space usage and I/O processing ability of each disk, and warns administrators of a probable space contention, a potential overload, or a possible processing snag on a disk. This way, the test enables administrators to ensure high disk performance and availability.

The measures made by this test are as follows:

Measurement Description Measurement Unit Interpretation
DISK_STATUS Indicates the current status of this disk.   The values that this measure can report, their description, and their corresponding numeric values are detailed in the table below:

Measure Value Description Numeric Value
Detachable Disk is not being used and can be removed. 0
Normal Disk is operating normally. 1
Data migration initiated Data is being migrated to other disks. 2
Marked for removal Data migration is in progress. 3

Note:

By default, this measure reports the Measure Values listed in the table above to indicate disk status. In the graph of this measure however, status is represented using the numeric equivalents only.

Use the detailed diagnosis of the Status measure to view the UUID of the disk and the cluster to which it is attached.

DISK_TIER Indicates the disk type.   The values that this measure can report and their corresponding numeric values are detailed in the table below:

Measure Value Numeric Value
SSD 0
HDD 1

Note:

By default, this measure reports the Measure Values listed in the table above to indicate the disk type. In the graph of this measure however, the disk type is represented using the numeric equivalents only.

Use the detailed diagnosis of the Status measure to view the UUID of the disk and the cluster to which it is attached.

IS_ONLINE Indicates whether the disk is currently online or offline.   The values that this measure can report and their corresponding numeric values are detailed in the table below:

Measure Value Numeric Value
Offline 0
Online 1

Note:

By default, this measure reports the Measure Values listed in the table above to indicate the disk mode. In the graph of this measure however, the mode is indicated using the numeric equivalents only.

IS_MARKREMOVE Indicates whether/not the disk is ready to be removed.   The values that this measure can report and their corresponding numeric values are detailed in the table below:

Measure Value Numeric Value
Off 0
On 1

Note:

By default, this measure reports the Measure Values listed in the table above to indicate whether/not the disk is ready to be removed. In the graph of this measure however, the same is indicated using the numeric equivalents only.

IS_DATA_MIGRATE Indicates whether/not the data in this disk is migrated.   The values that this measure can report and their corresponding numeric values are detailed in the table below:

Measure Value Numeric Value
No 0
Yes 1

Note:

By default, this measure reports the Measure Values listed in the table above to indicate whether/not the data is migrated. In the graph of this measure however, the same is indicated using the numeric equivalents only.

IS_SELF_ENCRYPT Indicates whether/not this is a self-encrypting drive.   The values that this measure can report and their corresponding numeric values are detailed in the table below:

Measure Value Numeric Value
No 0
Yes 1

Note:

By default, this measure reports the Measure Values listed in the table above to indicate whether/not the disk is a self-encrypting drive. In the graph of this measure however, the same is indicated using the numeric equivalents only.

STR_CAPACITY Indicates the total capacity of this disk. GB  
STR_USAGE Indicates the amount of space in this disk that is currently in use. GB A low value is desired for this measure.
STR_FREE Indicates the amount of space in this disk that is still unused. GB A high value is desired for this measure.
STR_USE_PERC Indicates the percentage of disk capacity that is being utilized. Percent A value close to 100% indicates that the disk is rapidly running out of space.
STR_FREE_PERC Indicates of percentage of disk capacity that is free and is available for use. Percent A value less than 50% indicates that the disk is rapidly running out of space.
STR_LOGIC_USE Indicates the amount of logical storage space in this disk that is in use currently. GB  
NUM_IO_LATENCY Indicates the average time taken by this disk to process I/O requests. Seconds Ideally, the value of this measure should be very low. A high value or a steady increase in this value could indicate an I/O processing bottleneck on this disk. In such a case, compare the value of the NUM_READ_LATENCY and NUM_WRITE_LATENCY measures to figure out when the slowness is worst - when processing read requests? or write requests?
NUM_READ_LATENCY Indicates the average time taken by this disk to process read I/O requests. Seconds If the NUM_IO_LATENCY measure reports an abnormally high value, then compare the value of these measures to figure out where the slowness is maximum - when processing read requests? or write requests?
NUM_WRITE_LATENCY Indicates the average time taken by this disk to process write I/O requests. Seconds
NUM_IO_BWIDTH Indicates the bandwidth per second used by this disk when processing I/O requests. KB/Sec A high value for this measure denotes that this disk is processing bandwidth-intensive I/O. In such situations, you may want to compare the value of the NUM_READ_BWIDTH and NUM_WRITE_BWIDTH measures to know what type of I/O requests are truly contributing to the excessive bandwidth consumptions - read requests? or write requests?
NUM_READ_BWIDTH Indicates the bandwidth per second used by this disk when processing read I/O requests. KB/Sec If the value of the NUM_IO_BWIDTH measure is high, then you may want to compare the value of the NUM_READ_BWIDTH and NUM_WRITE_BWIDTH measures to know what type of I/O requests are truly contributing to the excessive bandwidth consumption - read requests? or write requests?
NUM_WRITE_BWIDTH Indicates the bandwidth per second used by this disk when processing write I/O requests. KB/Sec
NUMBER_IOPS Indicates the number of I/O operations performed currently on this disk. Number This measure is a good indicator of the level of I/O activity on the Nutanix storage. A steady and significant increase in the value of this measure could indicate a potential I/O overload. In such situations, you may want to compare the value of the NUM_READ_IOPS and NUM_WRITE_IOPS measures to know what type of IO operations are contributing to the overload.
NUM_READ_IOPS Indicates the number of read I/O operations performed currently on this disk. Number If the value of the NUMBER_IOPS measure is unusually high, then compare the value of these measures to know what is contributing to the unusual I/O activity levels - read requests? or write requests?
NUM_WRITE_IOPS Indicates the number of write I/O operations performed currently on the container. Number