eG Monitoring
 

Measures reported by UCSCsSrvNicsTest

This test auto-discovers the NICs (Network Interface Cards) supported by the UCS Blade servers, monitors the overall health, operational state, and load on each NIC, and promptly notifies administrators when an NIC suddenly switches to an abnormal state, becomes overloaded, or encounters errors while sending/receiving data over the network. This way, you can easily isolate problematic, over-used, and error-prone NICs.

The measures made by this test are as follows:

Measurement Description Measurement Unit Interpretation
Overall_status Indicates the current state of this NIC.   The values reported by this measure and their corresponding numeric values are described in the table below:

Measure Value Numeric Value
Unknown 0
Operable 1
Inoperable 2
Degraded 3
Powered off 4
Power-problem 5
Removed 6
Voltage-problem 7
Thermal-problem 8
Performance-problem 9
Accessibility-problem 10
Identity-unestablishable 11
Bios-post-timeout 12
Disabled 13
Fabric-conn-problem 51
Fabric-unsupported-conn 52
Config 81
Equipment-problem 82
Decommissioning 83
Chassis-limit-exceeded 84
Not-supported 100
Discovery 101
Discovery-failed 102
Identify 103
Post-failure 104
Upgrade-problem 105
Peer-comm-problem 106
Auto-upgrade 107

Note:

By default, this measure reports the Measure Values listed in the table above to indicate the overall state of an NIC. However, in the graph of this measure, states will be represented using their corresponding numeric equivalents only.

The detailed diagnosis of this measure provides the complete details of an NIC such as its ID, Vendor, vNIC, PCIE Address, MAC, Original MAC, Purpose, Name, and Type.

Operability Indicates the current operational state of this NIC.   The values reported by this measure and their corresponding numeric values are described in the table below:

Measure Value Numeric Value
Unknown 0
Operable 1
Inoperable 2
Degraded 3
Powered off 4
Power-problem 5
Removed 6
Voltage-problem 7
Thermal-problem 8
Performance-problem 9
Accessibility-problem 10
Identity-unestablishable 11
Bios-post-timeout 12
Disabled 13
Fabric-conn-problem 51
Fabric-unsupported-conn 52
Config 81
Equipment-problem 82
Decommissioning 83
Chassis-limit-exceeded 84
Not-supported 100
Discovery 101
Discovery-failed 102
Identify 103
Post-failure 104
Upgrade-problem 105
Peer-comm-problem 106
Auto-upgrade 107

Note:

By default, this measure reports the Measure Values listed in the table above to indicate the operational state of an NIC. However, in the graph of this measure, states will be represented using their corresponding numeric equivalents only.
Admin_state Indicates the current administrative state of this NIC.   The values reported by this measure and their corresponding numeric values are described in the table below:

Measure Value Numeric Value
Enabled 0
Reset-connectivity-active 1
Reset-connectivity-passive 2
Reset-connectivity 3

Note:

By default, this measure reports the Measure Values listed in the table above to indicate the administrative state of an NIC. However, in the graph of this measure, states will be represented using their corresponding numeric equivalents only.
Discovery Indicates the current discovery state of this NIC.   The values reported by this measure and their corresponding numeric values are described in the table below:

Measure Value Numeric Value
Absent 0
Present 1
Mis-connect 2
Missing 3
New 4

Note:

By default, this measure reports the Measure Values listed in the table above to indicate the discovery state of an NIC. However, in the graph of this measure, states will be represented using their corresponding numeric equivalents only.
Presence Indicates the current presence state of this NIC.   The values reported by this measure and their corresponding numeric values are described in the table below:

Measure Value Numeric Value
Unknown 0
Empty 1
Equipped 10
Missing 11
Mismatch 12
Equipped-not-primary 13
Equipped-identity-unestablishable 20
Mismatch-identity-unestablishable 21
Inaccessible 30
Unauthorized 40
Not-supported 100

Note:

By default, this measure reports the Measure Values listed in the table above to indicate the presence state of an NIC. However, in the graph of this measure, states will be represented using their corresponding numeric equivalents only.
Data_received Indicates the amount of data received by this NIC during the last measurement period. MB These measures are good indicators of the load handled by an NIC. By comparing the value of each measure across NICs, you can quickly identify which NIC is experiencing heavy data traffic and when - while receiving data? or while transmitting data?
Data_transmitted Indicates the amount of data transmitted by this NIC during the last measurement period. MB
Packets_received Indicates the number of packets received by this NIC during the last measurement period. Packets These measures are good indicators of the load handled by an NIC. By comparing the value of each measure across NICs, you can quickly identify which NIC is experiencing heavy data traffic and when - while receiving data? or while transmitting data?
Packets_transmitted Indicates the number of packets sent by this NIC during the last measurement period. MB
Dropped_pkts_received Indicates the number of dropped packets received by this NIC during the last measurement period. Packets  
Dropped_pkts_transmitted Indicates the number of dropped packets transmitted by this NIC during the last measurement period. Packets  
Errors_received Indicates the errors encountered by this NIC while receiving data during the last measurement period. Errors Ideally, the value of both these measures should be 0. A non-zero value indicates that one/more errors have occurred on an NIC. If these measure values increase with time, you may want to compare the value of each of these measures across NICs to quickly zero-in on the error-prone NICs and understand when the maximum number of errors occurred on those NICs - while transmitting data? or while receiving it?
Errors_transmitted Indicates the errors encountered by this NIC while transmitting data during the last measurement period. Errors