eG Monitoring
 

Measures reported by KVMGuestTest

This test monitors the amount of the physical server's resources that each virtual machine on a KVM server is taking up. Using the metrics reported by this test, administrators can determine which virtual machine is taking up most CPU, which virtual machine is generating the most network traffic, which virtual machine is over-utilizing memory, which virtual machine has the maximum disk activity, etc.

The measures made by this test are as follows:

Measurement Description Measurement Unit Interpretation
VM_state Indicates the current status of this VM.   The numeric values that correspond to each of the Measure Values that this test can take are listed in the table below:

Measure Value Numeric Value
Blocked 0
Running 1
Crashed 2
Nostate 3
Paused 4
Shutdown 5
Shutoff 6

Note:

By default, this measure reports one of the Measure Values listed in the table above. The graph of this measure however, represents the status of each VM using the numeric equivalents - ‘0’ to ‘6’.

Is_vm_persist Indicates whether/not the configuration of this VM is persistent.   The numeric values that correspond to each of the Measure Values that this test can take are listed in the table below:

Measure Value Numeric Value
Transient 0
Yes 1
Error 2

Note:

By default, this measure reports one of the Measure Values listed in the table above. The graph of this measure however, represents whether the configuration of this VM is persistent or not using the numeric equivalents - ‘0’ to ‘2’.

CPU_use Indicates the percentage of CPU utilized by this VM. Percent A very high value of this measure indicates that the VM is currently utilizing high memory resources.

Vir_CPU Indicates the number of virtual CPUs that are allocated to this VM. Number  
Mem_alloc Indicates the amount of memory that is currently allocated to this VM. MB  
Used_mem Indicates the amount of memory that is used by this VM. MB A low value is desired for this measure.
Free_mem Indicates the amount of memory that is available for use by this VM. MB A high value is desired for this measure.

The memory that is used for reclaimable cache is not considered as free memory.

Mem_util Indicates the percentage of memory that is currently utilized by this VM. Percent A high value for this measure indicates that the VM is currently running short of memory resources.

Comparing the value of this measure across the VMs will help you identify the VM that is using the maximum memory resources.

Swapin_mem Indicates the amount of memory that is being swapped in by the server from the disk for this VM. MB  
Swapout_mem Indicates the amount of memory that is being swapped to the disk by the server for this VM. MB  
Page_faults Indicates the number of page faults that occurred for the threads matching all processes. Number A page fault occurs when a thread refers to a virtual memory page that is not in its working set in main memory. This may not cause the page to be fetched from disk if it is on the standby list and hence already in main memory, or if it is in use by another process with whom the page is shared.
Unused_mem Indicates the amount of memory that is completely left unused in this VM. MB The value of this measure is the sum total of the Free_mem and the memory that is used for reclaimable caches.
Avail_mem Indicates the amount of memory that is currently available in this VM. MB  
Balloon_mem Indicates the amount of balloon memory that is currently available for use in this VM. MB Memory ballooning is a virtual memory management technique used to free unused memory.

Having multiple virtual machines (VMs) on a single physical server requires virtual memory management techniques to control resource sharing and to prevent shortages. Some processor chipsets use hardware to offload a portion of the virtual memory management work by creating two layers of page tables, the data structure that provides the mapping between virtual addresses and physical addresses. The layers, however, make it difficult for the hypervisor to see a VM's memory contents, how much memory that VM requires or whether the VM is consuming too much memory.

Balloon drivers, which are installed in each VM, transfer the memory shortage from the host (where the shortage exists) to the VM. The hypervisor alerts the balloon driver of low memory instances and instructs it to inflate, which locks a set of unused memory in the VM. The hypervisor can then reassign the physical memory to another VM. This swap activity can potentially impact performance depending upon the amount of memory to recoup and/or the quality of the storage IOPS delivered to the VM. In a VMware environment, the balloon driver only activates when memory becomes scarce, so it’s best to have no ballooning activity at all. In a Windows Server environment, the balloon driver allocates RAM to the VM on-demand.

Rss_mem Indicates the amount of resident memory that is allocated to the process of this VM. MB The resident set size is the portion of a process's memory that is held in RAM. The rest of the memory exists in swap or the filesystem (never loaded or previously unloaded parts of the executable).
Errors Indicates the number of errors that occurred during the disk reads/disk writes of this VM. Number Ideally, the value of this measure should be zero.

Use the detailed diagnosis of this measure to figure out the nature of the errors and the disk on which the errors had occurred.

Data_read Indicates the rate at which data is read from the disk of this VM. MB/sec A high value of this measure indicates that the disk is experiencing high I/O activity.

The detailed diagnosis of this measure if enabled, lists the name of the disk and the rate at which data is read from this disk.

Read_req Indicates the number of read requests handled by the disk of this VM. MB/sec The detailed diagnosis of this measure if enabled, lists the name of the disk and the number of requests handled.
Data_write Indicates the rate at which data is written to the disk of this VM. MB/sec The detailed diagnosis of this measure if enabled, lists the name of the disk and the rate at which data is written to the disk.
Write_req Indicates the number of write requests handled by the disk of this VM. MB/sec The detailed diagnosis of this measure if enabled, lists the name of the disk and the number of write requests handled by the disk.
Trans_bytes Indicates the rate at which data is transmitted from this VM. Mbps A high value for this measure indicates that the data transmission is high for this VM. The detailed diagnosis of this measure if enabled, lists the name of the network interface through which data is transmitted and the rate at which data is transmitted.
Trans_pkts Indicates the rate at which packets are transmitted from this VM. Packets/sec A high value for this measure indicates that the data transmission is high for this VM. The detailed diagnosis of this measure if enabled, lists the name of the network interface through which the packets are transmitted and the rate at which the packets are transmitted.
Data_drop Indicates the number of data packets that were dropped during transmission. Number The detailed diagnosis of this measure if enabled, lists the name of the network interface that dropped the data and the number of data packets dropped.
Error_data Indicates the number of errors encountered by this VM during transmission. Number Ideally, the value of this measure should be zero.

The detailed diagnosis of this measure if enabled, lists the name of the network interface and the number of errors that were enocuntered.

Rec_bytes Indicates the rate at which data is received on this VM. Mbps The detailed diagnosis of this measure if enabled, lists the name of the network interface and the rate at which data was received.
Rec_pkts Indicates the rate at which data packets were received by this VM. Packets/sec The detailed diagnosis of this measure if enabled, lists the name of the network interface and the rate at which the data packets were received.
Data_drop_rec Indicates the number of data packets that were dropped during reception by this VM. Packets/sec Ideally, the value of this measure should be zero.

The detailed diagnosis of this measure if enabled, lists the name of the network interface and the number of data packets that were dropped during reception.

Error_data_rec Indicates the number of errors encountered during data reception by this VM. Number Ideally, the value of this measure should be zero.

The detailed diagnosis of this measure if enabled, lists the name of the network interface and the number of errors encountered during data reception.

High_alloc Indicates the cumulative allocated size of the disks of this VM. MB The detailed diagnosis of this measure if enabled, lists the name of each disk and the size allocated to each disk.
Physical_size Indicates the physical size of this VM. MB The detailed diagnosis of this measure if enabled, lists the name of each disk and the physical size that is available in each disk.
Capacity Indicates the current logical size of this VM. MB The detailed diagnosis of this measure if enabled, lists the name of each disk and the logical size of each disk.
Free_size Indicates the physical size of this VM that is currently free. MB A high value is desired for this measure.

The detailed diagnosis of this measure if enabled, lists the name of each disk and the physical size that is currently free.

Space_util Indicates the percentage of space that is already utilized by this VM. Percent A value close to 100% indicates that the VM is currently running out of physical space. The detailed diagnosis of this measure if enabled, lists the name of each disk and the percentage of space utilized by each disk of the VM.
Current_sessions Indicates the number of user sessions that are currently active in this VM. Number This is a good indicator of the session load on the VMs.