eG Monitoring
 

Measures reported by VmgUptimeTest

In most virtualized environments, it is essential to monitor the uptime of VMs/physical machines hosting critical server applications in the infrastructure. By tracking the uptime of each of the VMs/physical machines, administrators can determine what percentage of time a VM/physical machines has been up. Comparing this value with service level targets, administrators can determine the most trouble-prone areas of the virtualized infrastructure.

In some environments, administrators may schedule periodic reboots of their VM/physical machines. By knowing that a specific VM/physical machines has been up for an unusually long time, an administrator may come to know that the scheduled reboot task is not working on a VM/physical machines.

The VmgUptimeTest included in the eG agent monitors the uptime of each VM on a XenServer host/physical machines.

Outputs of the test:One set of results for every user_on_poweredonphysicaldesktop/virtualdesktop

Measurement Description Measurement Unit Interpretation
Rebooted Indicates whether the VM has been rebooted during the last measurement period or not   If the value of this measure is Yes, it means that the server was rebooted during the last measurement period. By checking the time periods when this metric changes from No to Yes, an administrator can determine the times when this server was rebooted. The Detailed Diagnosis of this measure, if enabled, lists the TIME, SHUTDOWN DATE, REBOOT DATE, SHUTDOWN DURATION, and IS MAINTENANCE.
Uptime Indicates the time period that the VM/physical machines has been up since the last time this test ran. Secs If the guest has not been rebooted during the last measurement period and the agent has been running continuously, this value will be equal to the measurement period. If the VM was rebooted during the last measurement period, this value will be less than the measurement period of the test. For example, if the measurement period is 300 secs, and if the VM was rebooted 120 secs back, this metric will report a value of 120 seconds. The accuracy of this metric is dependent on the measurement period - the smaller the measurement period, greater the accuracy.
Total_uptime Indicates the total time that the VM/physical machines has been up since its last reboot.   This measure displays the number of years, months, days, hours, minutes and seconds since the last reboot. Administrators may wish to be alerted if a VM has been running without a reboot for a very long period. Setting a threshold for this metric allows administrators to determine such conditions.

Note:

If a value less than a minute is configured as the TEST PERIOD of this test, then, the Uptime measure will report the value 0 for Unix VMs (only) until the minute boundary is crossed. For instance, if you configure the Uptime - Guest test to run every 10 seconds, then, for the first 5 test execution cycles (i.e., 10 x 5 = 50 seconds), the Uptime measure will report the value 0 for Unix VMs; however, the sixth time the test executes (i.e, when test execution touches the 1 minute boundary), this measure will report the value 60 seconds for the same VMs. Thereafter, every sixth measurement period will report 60 seconds as the uptime of the Unix VMs. This is because, Unix-based operating systems report uptime only in minutes and not in seconds.