eG Monitoring
 

Measures reported by JexFanFailureTest

EX4200 switches have a single fan tray on the rear panel. The fan tray is a hot-removable and hot-insertable field-replaceable unit (FRU): You can remove and replace it without powering off the switch or disrupting switch functions.

The fan tray used in the switch comes with load-sharing redundancy that can tolerate a single fan failure at room temperature (below 113° F/45° C) to still provide sufficient cooling.

Under normal operating conditions, the fans in the fan tray run at less than full speed. If a fan fails or the ambient temperature rises above the threshold 113° F (45° C), the speed of the remaining fans is automatically adjusted to keep the temperature within the acceptable range, 32° F (0° C) through 113° F (45° C).

The system raises an alarm if the fan fails or if the ambient temperature inside the chassis rises above the acceptable range. If the temperature inside the chassis rises above the threshold temperature, the system shuts down automatically.

This test intercepts the fan failure traps sent by the switch, extracts relevant information related to the failure from the traps, and reports the count of these trap messages to the eG manager. This information enables administrators to detect the fan failures if any, understand the nature of these failures, and accordingly decide on the remedial measures.

The measures made by this test are as follows:

Measurement Description Measurement Unit Interpretation
fan_failure_count Indicates the number of events of this type that were triggered during the last measurement period. Percent The failure events may be generated due to the failure of the fans of the Juniper EX Switch. If the failure events are not rectified within a certain pre-defined timeperiod, the storage system will be shutdown automatically.

Ideally, the value of this measure should be zero. A high value is an indication of performance degradation of the Juniper EX Switch.