eG Monitoring
 

Measures reported by UCSCsFanModuleTest

The Cisco UCS Blade server chassis contains eight hot-swappable fan modules. These fan modules ensure that the internals of the chassis always receive adequate air flow and the temperature within the chassis is maintained at acceptable levels at all times. Snags in the functioning of the fan module can hence hamper air flow, which in turn may have disastrous effects on the health of the other chassis components.

By periodically monitoring the availability, overall health, operational state, and the exhaust temperature of fan module, you can promptly detect abnormalities in the operations of the module and initiate speedy remedial measures. This test does just that.

The measures made by this test are as follows:

Measurement Description Measurement Unit Interpretation
OperState Indicates the overall status of the fan modules present in this chassis.   This measure reports the status of the fan modules and their numeric equivalents as shown in the table:

Numeric Value State
0 Unknown
1 Operable
2 Inoperable
3 Degraded
4 Powered-off
5 Power-problem
6 Removed
7 Voltage-problem
8 Thermal-problem
9 Performance-problem
10 Accessibility-problem
11 Identity-unestablishable
12 Bios-post-timeout
13 Disabled
51 Fabric-conn-problem
52 Fabric-unsupported-conn
81 Config
82 Equipment-problem
83 Decommissioning
84 Chassis-limit-exceeded
101 Discovery
102 Discovery-failed
103 Identify
104 Post-failure
105 Upgrade-problem
106 Peer-comm-problem
107 Auto-upgrade

Note:

By default, this measure reports the above-mentioned States while indicating the status of the fan modules in this chassis. However, the graph of this measure will be represented using the corresponding numeric equivalents of the states as mentioned in the table above.

The detailed diagnosis of this measure provides the Time, ID, PID, Module, Revision, Serial Number, Tray and Vendor attributes for each fan module in this chassis.
Operability Indicates the operating state of the fan modules present in this chassis.   This measure reports the operating state of the fan modules and their numeric equivalents as shown in the table:

Numeric Value State
0 Unknown
1 Operable
2 Inoperable
3 Degraded
4 Powered-off
5 Power-problem
6 Removed
7 Voltage-problem
8 Thermal-problem
9 Performance-problem
10 Accessibility-problem
11 Identity-unestablishable
12 Bios-post-timeout
13 Disabled
51 Fabric-conn-problem
52 Fabric-unsupported-conn
81 Config
82 Equipment-problem
83 Decommissioning
84 Chassis-limit-exceeded
101 Discovery
102 Discovery-failed
103 Identify
104 Post-failure
105 Upgrade-problem
106 Peer-comm-problem
107 Auto-upgrade

Note:

By default, this measure reports the above-mentioned States while indicating the operating state of the fan modules in this chassis. However, the graph of this measure will be represented using the corresponding numeric equivalents of the states as mentioned in the table above.
Performance Indicates the current performance status of the fan modules present in this chassis.   This measure reports the current performance status of the fan modules and their numeric equivalents as shown in the table:

Numeric Value State
0 Unknown
1 Ok
2 Upper-non-recoverable
3 Upper-critical
4 Upper-non-critical
5 Lower-non-critical
6 Lower-critical
7 Lower non-recoverable

Note:

By default, this measure reports the above-mentioned States while indicating the performance status of the fan modules in this chassis. However, the graph of this measure will be represented using the corresponding numeric equivalents of the states as mentioned in the table above.
Power Indicates the power status of the fan modules present in this chassis.   This measure reports the power status of the fan modules and their numeric equivalents as shown in the table:

Numeric Value State
0 Unknown
1 On
2 Test
3 Off
4 Online
5 Offline
6 Offduty
7 Degraded
8 Power-save
9 Error

Note:

By default, this measure reports the above-mentioned States while indicating the power status of the fan modules in this chassis. However, the graph of this measure will be represented using the corresponding numeric equivalents of the states i.e., 0 to 10.
Presence Indicates the current state of the fan modules present in this chassis i.e., whether the fan modules exist or not in the chassis.   This measure reports the current state of the fan modules and their numeric equivalents as shown in the table:

Numeric Value State
0 Unknown
1 Empty
10 Equipped
11 Missing
12 Mismatch
13 Equipped-not-primary
20 Equipped-identity-unestablishable
30 Inaccessible
40 Unauthorized

Note:

By default, this measure reports the above-mentioned States while indicating the current state of the fan modules in this chassis. However, the graph of this measure will be represented using the corresponding numeric equivalents of the states as mentioned in the table above.
Thermal Indicates the thermal state of the fan modules present in this chassis.   This measure reports the thermal state of the fan modules and their numeric equivalents as shown in the table:

Numeric Value State
0 Unknown
1 Ok
2 Upper-non-recoverable
3 Upper-critical
4 Upper-non-critical
5 Lower-non-critical
6 Lower-critical
7 Lower non-recoverable

Note:

By default, this measure reports the above-mentioned States while indicating the thermal state of the fan modules in this chassis. However, the graph of this measure will be represented using the corresponding numeric equivalents of the states as mentioned in the table above.
AmbientTemp Indicates the current exhaust temperature of the fans present in this fan module in this chassis. Celcius An abnormal temperature may cause damage to the fans. So the temperature of the fans must be well within normal limits.