Measures reported by DockContUptimeTest
In environments where Docker server is used extensively, it is essential to monitor the uptime of critical containers within the Docker server. By tracking the uptime of each of the containers, administrators can determine what percentage of time a container has been up. Comparing this value with service level targets, administrators can determine the most trouble-prone areas of the infrastructure.
In some environments, administrators may schedule periodic reboots of their containers. By knowing that a specific container has been up for an unusually long time, an administrator may come to know that the scheduled reboot task is not working on a container. This is where the DockContUptimeTest helps!
This test included in the eG agent monitors the uptime of critical containers in the target Docker server.
Outputs of the test : One set of results for each container available in the Docker server being monitored
The measures made by this test are as follows:
| Measurement |
Description |
Measurement Unit |
Interpretation |
| cont_reboot |
Indicates whether/not this container was rebooted. |
|
The values reported by this measure and their numeric equivalents are available in the table below:
| Measure Value |
Numeric Value |
| No |
0 |
| Yes |
1 |
Note:
This measure reports the Measure Values listed in the table above while indicating whether /not this container was rebooted. However, in the graph of this measure, the measure is indicated using only the Numeric Values listed in the above table.
|
| time_diff |
Indicates the time duration for which this container has been up since the last time this test ran. |
Secs |
If this container has not been rebooted during the last measurement period and the agent has been running continuously, this value will be equal to the measurement period. If this container was rebooted during the last measurement period, this value will be less than the measurement period of the test. The accuracy of this metric is dependent on the measurement period - the smaller the measurement period, greater the accuracy. |
| total_uptime |
Indicates the total time that this container has been up since its last reboot. |
|
This measure displays the number of years, months, days, hours, minutes and seconds since the last reboot. Administrators may wish to be alerted if a container has been running without a reboot for a very long period. Setting a threshold for this metric allows administrators to determine such conditions. |
|