eG Monitoring
 

Measures reported by ODBSrvcHlhTest

A service incident is an event that affects the delivery of the OneDrive for Business service. Service incidents may be caused by hardware or software failure in the Microsoft data center, a faulty network connection between the customer and Microsoft, or a major data center challenge such as fire, flood, or regional catastrophe. Most service incidents can be addressed using Microsoft technology and process solutions and are resolved within a short time. However, some service incidents are more serious and can lead to longer term outages. To avoid such unpleasant events and ensure the high uptime and peak service performance, administrators should be able to detect issues in the OneDrive for Business service much before users complain. The ODBSrvcHlhTest helps administrators in this regard!

This test reports the status of the OneDrive for Business service in real-time, thus proactively alerting administrators to service disruptions and outages. The test additionally reveals if any service incidents are occurring, and elaborately describes such incidents vide detailed diagnostics. If OneDrive for Business has been stopped as part of a planned maintenance activity, then this test indicates the same by reporting the count of maintenance events associated with OneDrive for Business.

Note:

This test uses the Microsoft Graph API. Tests that use Microsoft Graph API may not start reporting metrics right away. Sometimes, they may go without reporting metrics for over 48 hours. This is normal behavior, and it occurs because, Microsoft does not collect/refresh the metrics as frequently as the test executes.

Outputs of the test: One set of results for the monitored Microsoft OneDrive for Business.

The measures made by this test are as follows:

Measurement Description Measurement Unit Interpretation
Service_Status Indicates the current health status of the service.  

If the service is not experiencing any service incidents currently. , then this measure will report the value Healthy. On the other hand, if even one service incident is occurring on the service, then this measure will report the value Service Degraded.

The numeric values that correspond to these measure values are discussed in the table below:

Measure Value Numeric Value
Healthy 1
Service degraded 0

Note:

By default, this measure reports the Measure Values listed in the table above to indicate current health status of a service. In the graph of this measure however, the same is indicated using the numeric equivalents only.

Service_incidents Indicates the number of service incidents that are currently occurring on the service. Number

Unplanned service incidents occur when OneDrive for Business is unavailable or unresponsive.

Use the detailed diagnosis of this measure to know the complete details of the service incidents.

Maintenance_events Indicates the number of maintenance events currently occurring on the service. Number

Planned maintenance is regular Microsoft-initiated service updates to the infrastructure and software applications. Microsoft typically plans maintenance for times when service usage is historically at its lowest based on regional time zones.