eG Monitoring
 

Measures reported by UCSCsIOModuleTest

The Cisco UCS chassis contains I/O Modules or Fabric Extenders that allow the blade servers in the chassis to communicate with Cisco UCS Fabric Interconnects. The chassis supports up to two I/O Modules, each with four I/O ports.

The Cisco UCS Fabric Extenders bring the unified fabric into the blade server enclosure, providing 10 Gigabit Ethernet connections between blade servers and the fabric interconnect, simplifying diagnostics, cabling, and management.

The Cisco UCS Fabric Extenders extend the I/O fabric between the Cisco UCS Fabric Interconnects and the Cisco Blade Server Chassis, enabling a lossless and deterministic Fibre Channel over Ethernet (FCoE) fabric to connect all blades and chassis together. Since the fabric extender is similar to a distributed line card, it does not do any switching and is managed as an extension of the fabric interconnects. This approach removes switching from the chassis, reducing overall infrastructure complexity and enabling the Cisco Unified Computing System to scale to many chassis without multiplying the number of switches needed, reducing TCO and allowing all chassis to be managed as a single, highly available management domain.

The Cisco UCS Fabric Extenders also manages the chassis environment (the power supply and fans as well as the blades) in conjunction with the Fabric Interconnects. Therefore, separate chassis management modules are not required.

Cisco UCS Fabric Extenders fit into the back of the Cisco UCS Chassis. Each Cisco UCS Chassis can support up to two Fabric Extenders, enabling increased capacity as well as redundancy.

This test monitors the overall health of each of the I/O Modules present in every chassis of the Cisco UCS Manager, and in the process, promptly alerts you to abnormalities in the power, thermal, voltage states of the modules and sudden spikes in the ambient/ASIC temperature of the modules. This way, defective I/O modules come to light.

The measures made by this test are as follows:

Measurement Description Measurement Unit Interpretation
ConfigState Indicates the configuration status of the I/O Modules present in this chassis.   This measure reports the configuration status of the I/O Modules and their numeric equivalents as shown in the table:

Numeric Value State
0 Un-initialized
1 Un-acknowledged
2 Unsupported-connectivity
3 Ok
4 Removing

Note:

By default, this measure reports the above-mentioned States while indicating the configuration status of the I/O Modules in this chassis. However, the graph of this measure will be represented using the corresponding numeric equivalents i.e., 0 to 4.

The detailed diagnosis of this measure provides the Time, ID, PID, Side, Chassis ID, Fabric ID, Revision, Serial Number and Vendor attributes for each I/O Module.
OperState Indicates the overall status of the I/O Modules present in this chassis.   This measure reports the status of the I/O Modules and their numeric equivalents as shown in the table:

Numeric Value State
0 Unknown
1 Operable
2 Inoperable
3 Degraded
4 Powered-off
5 Power-problem
6 Removed
7 Voltage-problem
8 Thermal-problem
9 Performance-problem
10 Accessibility-problem
11 Identity-unestablishable
12 Bios-post-timeout
13 Disabled
51 Fabric-conn-problem
52 Fabric-unsupported-conn
81 Config
82 Equipment-problem
83 Decommissioning
84 Chassis-limit-exceeded
101 Discovery
102 Discovery-failed
103 Identify
104 Post-failure
105 Upgrade-problem
106 Peer-comm-problem
107 Auto-upgrade

Note:

By default, this measure reports the above-mentioned States while indicating the status of the I/O Modules in this chassis. However, the graph of this measure will be represented using the corresponding numeric equivalents of the states as mentioned in the table above.
Operability Indicates the operating state of the I/O Modules present in this chassis.   This measure reports the operating state of the I/O Modules and their numeric equivalents as shown in the table:

Numeric Value State
0 Unknown
1 Operable
2 Inoperable
3 Degraded
4 Powered-off
5 Power-problem
6 Removed
7 Voltage-problem
8 Thermal-problem
9 Performance-problem
10 Accessibility-problem
11 Identity-unestablishable
12 Bios-post-timeout
13 Disabled
51 Fabric-conn-problem
52 Fabric-unsupported-conn
81 Config
82 Equipment-problem
83 Decommissioning
84 Chassis-limit-exceeded
101 Discovery
102 Discovery-failed
103 Identify
104 Post-failure
105 Upgrade-problem
106 Peer-comm-problem
107 Auto-upgrade

Note:

By default, this measure reports the above-mentioned States while indicating the operating state of the I/O Modules in this chassis. However, the graph of this measure will be represented using the corresponding numeric equivalents of the states as mentioned in the table above.
Performance Indicates the current performance status of the I/O Modules present in this chassis.   This measure reports the current performance status of the I/O Modules and their numeric equivalents as shown in the table:

Numeric Value State
0 Unknown
1 Ok
2 Upper-non-recoverable
3 Upper-critical
4 Upper-non-critical
5 Lower-non-critical
6 Lower-critical
7 Lower non-recoverable

Note:

By default, this measure reports the above-mentioned States while indicating the performance status of the I/O Modules in this chassis. However, the graph of this measure will be represented using the corresponding numeric equivalents of the states as mentioned in the table above.
Power Indicates the power status of the I/O Modules present in this chassis.   This measure reports the power status of the I/O Modules and their numeric equivalents as shown in the table:

Numeric Value State
0 Unknown
1 On
2 Test
3 Off
4 Online
5 Offline
6 Offduty
7 Degraded
8 Power-save
9 Error

Note:

By default, this measure reports the above-mentioned States while indicating the power status of the I/O Modules in this chassis. However, the graph of this measure will be represented using the corresponding numeric equivalents of the states i.e., 0 to 10.
Presence Indicates the current state of the I/O modules present in this chassis.   This measure reports the current state of the I/O Modules and their numeric equivalents as shown in the table:

Numeric Value State
0 Unknown
1 Empty
10 Equipped
11 Missing
12 Mismatch
13 Equipped-not-primary
20 Equipped-identity-unestablishable
30 Inaccessible
40 Unauthorized

Note:

By default, this measure reports the above-mentioned States while indicating the current state of the I/O Modules in this chassis. However, the graph of this measure will be represented using the corresponding numeric equivalents of the states as mentioned in the table above.
ThermalState Indicates the thermal state of the I/O Modules present in this chassis.   This measure reports the thermal state of the I/O Modules and their numeric equivalents as shown in the table:

Numeric Value State
0 Unknown
1 Ok
2 Upper-non-recoverable
3 Upper-critical
4 Upper-non-critical
5 Lower-non-critical
6 Lower-critical
7 Lower non-recoverable

Note:

By default, this measure reports the above-mentioned States while indicating the thermal state of the I/O Modules in this chassis. However, the graph of this measure will be represented using the corresponding numeric equivalents of the states as mentioned in the table above.
Voltage Indicates the voltage state of the I/O Modules present in this chassis.   This measure reports the voltage state of the I/O Modules and their numeric equivalents as shown in the table:

Numeric Value State
0 Unknown
1 Ok
2 Upper-non-recoverable
3 Upper-critical
4 Upper-non-critical
5 Lower-non-critical
6 Lower-critical
7 Lower non-recoverable

Note:

By default, this measure reports the above-mentioned States while indicating the voltage state of the I/O Modules in this chassis. However, the graph of this measure will be represented using the corresponding numeric equivalents of the states as mentioned in the table above.
Ambient Indicates the ambient temperature of this I/O Module present in this chassis. Celcius An abnormal temperature may cause severe damage to the I/O Modules.
Asic Indicates the ASIC (Application-Specific Integrated Circuit) temperature of this I/O Module present in this chassis. Celcius An application-specific integrated circuit (ASIC) is an integrated circuit (IC) customized for a particular use, rather than intended for general-purpose use.

If an ASIC registers an abnormal temperature, it may severely affect the operations of the I/O module in which that ASIC operates.