eG Monitoring
 

Measures reported by UCSChassisFanTest

A Cisco Blade Server Chassis contains the following components:

  • Cisco UCS Fabric Extenders—Up to two fabric extenders (FEX), each FEX provides four ports of 10-Gigabit Ethernet, Cisco Data Center Ethernet, and Fibre Channel over Ethernet (FCoE)

  • SFP+ transceiver choices that include copper and fiber optic

  • Power supply units—Up to four 2500 W hot-swappable power supply units

  • Fan modules—Eight hot-swappable fan modules

  • Cisco UCS Blade Servers —Up to eight half-wide blade servers or four full-width blade servers, each holding RAID capable hard drives

This test monitors the overall health of the fans present in this chassis of the Cisco UCS Manager, and proactively alerts users to the following:

  • Fans that are in an abnormal operational state;

  • Fans that are in a critical performance/thermal/voltage state;

  • Fans in a degraded/errored power state;

  • Fans operating at abnormal speeds.

The measures made by this test are as follows:

Measurement Description Measurement Unit Interpretation
OperStat Indicates the overall status of this fan present in this chassis.   This measure reports the status of the fans and their numeric equivalents as shown in the table:

Numeric Value State
0 Unknown
1 Operable
2 Inoperable
3 Degraded
4 Powered-off
5 Power-problem
6 Removed
7 Voltage-problem
8 Thermal-problem
9 Performance-problem
10 Accessibility-problem
11 Identity-unestablishable
12 Bios-post-timeout
13 Disabled
51 Fabric-conn-problem
52 Fabric-unsupported-conn
81 Config
82 Equipment-problem
83 Decommissioning
84 Chassis-limit-exceeded
101 Discovery
102 Discovery-failed
103 Identify
104 Post-failure
105 Upgrade-problem
106 Peer-comm-problem
107 Auto-upgrade

Note:

By default, this measure reports the above-mentioned States while indicating the status of the fans in each chassis. However, the graph of this measure will be represented using the corresponding numeric equivalents of the states as mentioned in the table above.

The detailed diagnosis of this measure provides the Time, ID, PID, Module, Revision, Serial Number, Tray and Vendor attributes for each fan in the chassis.
Operability Indicates the operating state of this fan present in each chassis.   This measure reports the operating state of the fans and their numeric equivalents as shown in the table:

Numeric Value State
0 Unknown
1 Operable
2 Inoperable
3 Degraded
4 Powered-off
5 Power-problem
6 Removed
7 Voltage-problem
8 Thermal-problem
9 Performance-problem
10 Accessibility-problem
11 Identity-unestablishable
12 Bios-post-timeout
13 Disabled
51 Fabric-conn-problem
52 Fabric-unsupported-conn
81 Config
82 Equipment-problem
83 Decommissioning
84 Chassis-limit-exceeded
101 Discovery
102 Discovery-failed
103 Identify
104 Post-failure
105 Upgrade-problem
106 Peer-comm-problem
107 Auto-upgrade

Note:

By default, this measure reports the above-mentioned States while indicating the operating state of the fans in each chassis. However, the graph of this measure will be represented using the corresponding numeric equivalents of the states as mentioned in the table above.
Performance Indicates the current performance status of this fan present in each chassis.   This measure reports the current performance status of the fans and their numeric equivalents as shown in the table:

Numeric Value State
0 Unknown
1 Ok
2 Upper-non-recoverable
3 Upper-critical
4 Upper-non-critical
5 Lower-non-critical
6 Lower-critical
7 Lower non-recoverable

Note:

By default, this measure reports the above-mentioned States while indicating the performance status of the fans in each chassis. However, the graph of this measure will be represented using the corresponding numeric equivalents of the states as mentioned in the table above.
Power Indicates the power status of this fan present in each chassis.   This measure reports the power status of the fans and their numeric equivalents as shown in the table:

Numeric Value State
0 Unknown
1 On
2 Test
3 Off
4 Online
5 Offline
6 Offduty
7 Degraded
8 Power-save
9 Error

Note:

By default, this measure reports the above-mentioned States while indicating the power status of the fans in each chassis. However, the graph of this measure will be represented using the corresponding numeric equivalents of the states i.e., 0 to 10.
Presence Indicates the current state of this fan present in each chassis i.e., whether the fans exist or not in the chassis.   This measure reports the current state of the fans and their numeric equivalents as shown in the table:

Numeric Value State
0 Unknown
1 Empty
10 Equipped
11 Missing
12 Mismatch
13 Equipped-not-primary
20 Equipped-identity-unestablishable
30 Inaccessible
40 Unauthorized

Note:

By default, this measure reports the above-mentioned States while indicating the current state of the fans in each chassis. However, the graph of this measure will be represented using the corresponding numeric equivalents of the states as mentioned in the table above.
ThermalState Indicates the thermal state of this fan present in each chassis.   This measure reports the thermal state of the fans and their numeric equivalents as shown in the table:

Numeric Value State
0 Unknown
1 Ok
2 Upper-non-recoverable
3 Upper-critical
4 Upper-non-critical
5 Lower-non-critical
6 Lower-critical
7 Lower non-recoverable

Note:

By default, this measure reports the above-mentioned States while indicating the thermal state of the fans in each chassis. However, the graph of this measure will be represented using the corresponding numeric equivalents of the states as mentioned in the table above.
Voltage Indicates the voltage state of this fan present in each chassis.   This measure reports the voltage state of the fans and their numeric equivalents as shown in the table:

Numeric Value State
0 Unknown
1 Ok
2 Upper-non-recoverable
3 Upper-critical
4 Upper-non-critical
5 Lower-non-critical
6 Lower-critical
7 Lower non-recoverable

Note:

By default, this measure reports the above-mentioned States while indicating the voltage state of the fans in each chassis. However, the graph of this measure will be represented using the corresponding numeric equivalents of the states as mentioned in the table above.
Speed Indicates the speed at which this fan operates in each chassis. Rpm Ideally, the speed of the fans must be within normal limits.