eG Monitoring
 

Measures reported by  UptimeTest 

In most production environments, it is essential to monitor the uptime of critical servers in the infrastructure. By tracking the uptime of each of the servers, administrators can determine what percentage of time a server has been up. Comparing this value with service level targets, administrators can determine the most trouble-prone areas of the infrastructure.

In some environments, administrators may schedule periodic reboots of their servers. By knowing that a specific server has been up for an unusually long time, an administrator may come to know that the scheduled reboot task is not working on a server.

The  UptimeTest  included in the eG agent monitors the uptime of critical Windows and Unix servers.

Measurement Description Measurement Unit Interpretation
Rebooted Indicates whether the server has been rebooted during the last measurement period or not   If the value of this measure is Yes, it means that the server was rebooted during the last measurement period. By checking the time periods when this metric changes from No to Yes, an administrator can determine the times when this server was rebooted. The Detailed Diagnosis of this measure, if enabled, lists the TIME, SHUTDOWN DATE, REBOOT DATE, SHUTDOWN DURATION, and IS MAINTENANCE.
Uptime Indicates the time period that the system has been up since the last time this test ran. Secs If the server has not been rebooted during the last measurement period and the agent has been running continuously, this value will be equal to the measurement period. If the server was rebooted during the last measurement period, this value will be less than the measurement period of the test. For example, if the measurement period is 300 secs, and if the server was rebooted 120 secs back, this metric will report a value of 120 seconds. The accuracy of this metric is dependent on the measurement period - the smaller the measurement period, greater the accuracy.  
Total_uptime Indicates the total time that the server has been up since its last reboot.   This measure displays the number of years, months, days, hours, minutes and seconds since the last reboot. Administrators may wish to be alerted if a server has been running without a reboot for a very long period. Setting a threshold for this metric allows administrators to determine such conditions.  

Note:

For a Unix host, if a value less than a minute is configured as the TEST PERIOD of the Uptime test, then, the Uptime measure will report the value 0 until the minute boundary is crossed. For instance, if you configure the Uptime test for a Unix host to run every 10 seconds, then, for the first 5 test execution cycles (i.e., 10 x 5 = 50 seconds), the Uptime measure will report the value 0 only; however, the sixth time the test executes (i.e, when test execution touches the 1 minute boundary), this measure will report the value 60 seconds. This way, every sixth measurement period will report 60 seconds as the uptime of the host. This is because, Unix hosts report uptime only in minutes and not in seconds.