eG Monitoring
 

Measures reported by IBMChassBladLEDTest

The BladeCenter chassis contains multiple blade slots to accommodate blade servers, also called blades or server blades. The blades are independent servers containing one or more processors, memory, disk storage, and network controllers. Each blade slot is designed with a set of LEDs to indicate the various states:

  • Power-This green LED indicates the power status of the blade server.

  • Error or Fault-When this amber LED is lit, it indicates that a system error has occurred in the blade server. The blade-error LED turns off only after the error is corrected.

  • Information-When this amber LED is lit, it indicates that information about a system event in the blade server has been placed in the Advanced-Management-Module event log.

Using these LEDs, administrators can find out the health, power state and errors (if any) of the blade slots at a single glance. Critical or fatal errors, power failures or connectivity failures of the blade slots can render the blades unavailable/inoperable. This in turn affects performance of the blades as well as the target chassis. To prevent such eventualities, it is imperative that administrators should closely monitor the blades and take immediate measures before the users complaint.

This test auto-discovers the blades on the target chassis and reports the availability and current health of each blade server. In addition, this test also reports the power supply status and the status of error LED of each blade.

Outputs of the test : One set of results for each blade in the BladeCenter chassis being monitored.

Descriptor of the test : Blade

The measures made by this test are as follows:

Measurement Description Measurement Unit Interpretation
bladeExistense Indicates whether/not this blade slot exists.   The values that this measure can report and the numeric values they indicate have been listed in the table below:

Measure Value Numeric Value
Yes 1
No 0

Note:

By default, this measure can report the Measure Values mentioned above indicating whether each blade slot exists or not. However, the graph of this measure is indicated using the numeric equivalents.
bladePwrState Indicates the current power state of this blade slot.   The values that this measure can report and the numeric values they indicate have been listed in the table below:

Measure Value Numeric Value
Off 0
On 1
Standby 3
Hibernate 4

Note:

By default, this measure can report the Measure Values mentioned above while indicating the current status of each blade slot. However, the graph of this measure is indicated using the numeric equivalents.
bladeHealthState Indicates the current health of this blade slot.   The values that this measure can report and the numeric values they indicate have been listed in the table below:

Measure Value Numeric Value
Unknown 0
Good 1
Warning 2
Critical 3
Kernal mode 4
Discovering 5
Common Error 6
No Power 7
Flashing 8
Initialization Failure 9
Insufficient Power 10
Power Denied 11

Note:

By default, this measure can report the Measure Values mentioned above while indicating the current health of each blade slot. However, the graph of this measure is indicated using the numeric equivalents.
bladeErrLedState Indicates the current error LED state of this blade slot.   The values that this measure can report and the numeric values they indicate have been listed in the table below:

Measure Value Numeric Value
Off 0
On 1

Note:

By default, this measure can report the Measure Values mentioned above while indicating the current status of the error LED of each blade slot. However, the graph of this measure is indicated using the numeric equivalents.