eG Monitoring
 

Measures reported by UCSFIConnsFanModuleTest

A Cisco UCS fabric interconnect is a top-of-rack fabric interconnect that provides Ethernet and Fibre Channel to all servers in the UCS system. Servers connect to the fabric interconnect, and it connects to the LAN or SAN. Depending on the model of the Cisco UCS, the fabric interconnects may contain upto four fan modules. Each fan module contains four fans. The fans in the fan module help ensure adequate air flow to the internals of the fabric interconnect. If the fans in the fan modules are inoperable, then, the temperature of the fabric interconnect may increase which would eventually lead to the malfunctioning of the fabric interconnects. Therefore, it is necessary to constantly monitor the overall status, operational state and the temperature of each fan module housed in the fabric interconnects at regular intervals. The UCSFIConnsFanModuleTest test helps administrators in this regard!

This test auto-discovers the fan modules of the fabric interconnects and for each fan module, this test reports the overall status, power state, operational state and the temperature of the fans. Using this test, administrators can figure out the faulty fan module and replace it before abnormalities are detected in the functioning of the fan modules.

Outputs of the test : One set of results for each fan module in each fabric interconnect managed by the Cisco UCS Manager being monitored.

The measures made by this test are as follows:

Measurement Description Measurement Unit Interpretation
OperState Indicates the overall status of this fan module in this fabric interconnect.   The States reported by this measure and their corresponding numeric equivalents are described in the table below:

State Numeric Value
Unknown 0
Operable 1
Inoperable 2
Degraded 3
Powered-off 4
Power-problem 5
Removed 6
Voltage-problem 7
Thermal-problem 8
Performance-problem 9
Accessibility-problem 10
Identity-unestablishable 11
Bios-post-timeout 12
Disabled 13
Fabric-conn-problem 51
Fabric-unsupported-conn 52
Config 81
Equipment-problem 82
Decommissioning 83
Chassis-limit-exceeded 84
Discovery 101
Discovery-failed 102
Identify 103
Post-failure 104
Upgrade-problem 105
Peer-comm-problem 106
Auto-upgrade 107
Not Available -5

Note:

By default, this measure reports the above-mentioned States while indicating the overall status of a fan module. However, in the graph of this measure, states will be represented using their corresponding numeric equivalents only.

The detailed diagnosis of this measure provides the Time, ID, PID, Module, Revision, Serial Number, Tray and Vendor attributes for each fan module in the Fabric Interconnect.

Operability Indicates the current operating state of this fan module in this fabric interconnect.   The States reported by this measure and their corresponding numeric equivalents are described in the table below:

State Numeric Value
Unknown 0
Operable 1
Inoperable 2
Degraded 3
Powered-off 4
Power-problem 5
Removed 6
Voltage-problem 7
Thermal-problem 8
Performance-problem 9
Accessibility-problem 10
Identity-unestablishable 11
Bios-post-timeout 12
Disabled 13
Fabric-conn-problem 51
Fabric-unsupported-conn 52
Config 81
Equipment-problem 82
Decommissioning 83
Chassis-limit-exceeded 84
Discovery 101
Discovery-failed 102
Identify 103
Post-failure 104
Upgrade-problem 105
Peer-comm-problem 106
Auto-upgrade 107
Not Available -5

Note:

By default, this measure reports the above-mentioned States while indicating the operational state of a fan. However, in the graph of this measure, states will be represented using their corresponding numeric equivalents only.

Performance Indicates the current performance state of this fan module in this fabric interconnect.   The States reported by this measure and their corresponding numeric equivalents are described in the table below:

State Numeric Value
Ok 1
Upper-non-recoverable 2
Upper-critical 3
Upper-non-critical 4
Lower-non-critical 5
Lower-critical 6
Lower non-recoverable 7
Not Available -5

Note:

By default, this measure reports the above-mentioned States while indicating the performance state of a fan. However, in the graph of this measure, states will be represented using their corresponding numeric equivalents only.

Power Indicates the current power state of this fan module in this fabric interconnect.   The States reported by this measure and their corresponding numeric equivalents are described in the table below:

State Numeric Value
Unknown 0
On 1
Test 2
Off 3
Online 4
Offline 5
Offduty 6
Degraded 7
Power-save 8
Error 9
Not Available -5

Note:

By default, this measure reports the above-mentioned States while indicating the power state of a fan. However, in the graph of this measure, states will be represented using their corresponding numeric equivalents only.

Presence Indicates the current state of this fan module in this fabric interconnect.   The States reported by this measure and their corresponding numeric equivalents are described in the table below:

State Numeric Value
Unknown 0
Empty 1
Equipped 10
Missing 11
Mismatch 12
Equipped-not-primary 13
Equipped-identity-unestablishable 20
Mismatch-identity-unestablishable 21
Inaccessible 30
Unauthorized 40
Not Available -5

Note:

By default, this measure reports the above-mentioned States while indicating the current state of a fan. However, in the graph of this measure, states will be represented using their corresponding numeric equivalents only.

Thermal Indicates the current thermal state of this fan module in this fabric interconnect.   The States reported by this measure and their corresponding numeric equivalents are described in the table below:

State Numeric Value
Unknown 0
Ok 1
Upper-non-recoverable 2
Upper-critical 3
Upper-non-critical 4
Lower-non-critical 5
Lower-critical 6
Lower non-recoverable 7
Not Available -5

Note:

By default, this measure reports the above-mentioned States while indicating the current thermal state of a fan. However, in the graph of this measure, states will be represented using their corresponding numeric equivalents only.

AmbientTemp Indicates the current temperature of the fans present in this fan module in this fabric interconnect. Celsius Ideally, the value of this measure should be low, as an abnormal temperature can cause damage to the fans in the fan module.