eG Monitoring
 

Measures reported by XIOTargetsTest

A target is a physical port located on the storage controller of the XtremIO Storage Array.

The XtremIO Storage Array supports the following target types:

  • iSCSI - a 10GbE NIC port for connecting to iSCSI networks. There are two iSCSI targets per Storage Controller.

  • FC - an FC HBA port for connecting to fiber optic cable networks. There are two FC targets per Storage Controller.

The cluster targets form the XtremIO Storage Array’s front-end to which application servers connect for receiving storage services. The I/O requests from the host/server are primarily received through these targets, and the responses from the storage array are communicated to the host/server through these targets. The targets are the primary handlers of the I/O request processing in the storage array. By periodically checking the target port status and measuring the I/O load on the ports, you can identify overloaded ports, and thus proactively detect potential/existing load-balancing irregularities and/or processing bottlenecks with the storage controller. The XIOTargetsTest test facilitates this port check.

For every target port configured on the storage controller, this test reports the port state, the I/O load on the ports, the processing ability of the ports, and the errors encountered by each target port. In the process, the test not only points administrators to overloaded ports, but also puts a finger on ports that are slow when processing I/O requests and the ports that are erroneous.

Outputs of the test : One set of results for each X-Brick:Storage Controller:target port of the target EMC XtremIO being monitored

The measures made by this test are as follows:

Measurement Description Measurement Unit Interpretation
portState Indicates the current state of this target port.   The values reported by this measure and their numeric equivalents are available in the table below:

Measure Value Numeric Value
Up 0
Down 1

Note:

This measure reports the Measure Values listed in the table above to indicate the current state of this target. However, in the graph, this measure is indicated using the Numeric Values listed in the above table.

targetHealth Indicates the current health of this target port.   The values reported by this measure and their numeric equivalents are available in the table below:

Measure Value Numeric Value
Level 1 clear 0
Unknown 1

Note:

This measure reports the Measure Values listed in the table above to indicate the health of this target. However, in the graph, this measure is indicated using the Numeric Values listed in the above table.

bandwidth Indicates the amount of data utilized for performing I/O operations per second through this target port. MB/Sec  
readBandwidth Indicates the amount of data utilized for performing read I/O operations per second through this target port. MB/Sec By comparing the values of these measures across the target ports, you can isolate those target ports that consume bandwidth excessively, and also understand where exactly was too much of bandwidth consumed - while reading? or writing?
writeBandwidth Indicates the amount of data utilized for performing write I/O operations were performed per second through this target port. MB/Sec
iops Indicates the total number of reads and writes through this target port per second. IOPS Compare the value of this measure across the target ports to know the target port that handled the maximum number of I/O operations and the port that handled the least. By constantly analyzing these values, administrators can keep a check on the load balancing issues across the target ports.
readIOPS Indicates the number of read I/O operations performed through this target port per second. IOPS  
writeIOPS Indicates the number of write I/O operations performed through this target port per second. IOPS  
reads Indicates the number of reads per second through this target port during the last measurement period. Reads/sec Compare the value of this measure across target ports to identify the port that is handling the maximum number of read requests and the port that is handling the least.
writes Indicates the number of writes handled by this target port per second during the last measurement period. Writes/Sec Compare the value of this measure across target ports to identify the port that is handling the maximum number of write requests and the port that is handling the least.
dataReads Indicates the rate at which data was read through this target port during the last measurement period. MB/Sec Compare the value of these measures across the target ports to identify the slowest target port in terms of servicing read and write requests (respectively).
dataWritten Indicates the rate at which data was written through this target port during the last measurement period. MB/Sec
avgReadSize Indicates the average amount of data read through this target port per I/O operation during the last measurement period. MB/Op Compare the values of these measures across the targets to identify the slowest target in terms of servicing read and write requests (respectively).
avgWriteSize Indicates the average amount of data written through this target port per I/O operation during the last measurement period. MB/Op
lossOfSigCount Indicates the number of times the signal was lost on this target port during the last measurement period. Number Ideally, the value of this measure should be zero. A non-zero value for this measure indicates that the port detected a loss of the electrical or optical signal used to transfer data on the port.

This is likely an indicator for a faulty connector or cable. These are also caused when the device connected to the port is restarted, replaced or being serviced when the Fiber Channel cable connected to the port is temporarily disconnected.

If the port is in the “loss of signal” state for longer than a specific period, the port will get into the link failure state which could degrade the performance of the Fiber Channel link.

This measure is applicable only when the target port is a FC port.
lossOfSigRate Indicates number of signals lost on this target port per second during the last measurement period. Signals/sec A low value is desired for this measure.

This measure is applicable only when the target port is a FC port.
lossOfSyncCount Indicates the number of times this target port failed to synchronize during the last measurement period. Number Ideally, the value of this measure should be zero. A non-zero value for this measure indicates that port went into the “loss of synchronization” state, where it encountered continuous Disparity errors.

This is likely an indicator for a faulty connector or cable. These are also caused when the device connected to the port is restarted, replaced or being serviced when the Fiber Channel cable connected to the port is temporarily disconnected.

If the port is in the “loss of synchronization” state for longer than a specific period, the port will get into the link failure state which could degrade the performance of the Fiber Channel link.

This measure is applicable only when the target port is a FC port.
lossOfSyncRate Indicates the number of times this target port failed to synchronize per second during the last measurement period. Sync/sec Ideally, the value of this measure should be zero.

This measure is applicable only when the target port is a FC port.
invalidCRCCount Indicates the number of invalid CRCs that occurred on this target port during the last measurement period. Number This refers to the number of frames handled by this target that contains checksum errors. Ideally, the value of this measure should be zero.

These are usually recoverable errors and will not degrade system performance unless their occurrence is sustained when the data cannot be relayed after retransmissions.

This measure is applicable only when the target port is a FC port.
invalidCRCRate Indicates the rate at which invalid CRCs occurred on this target port during the last measurement period. CRC/sec Ideally, the value of this measure should be low. A high value for this measure indicates poor health of the target port.

This measure is applicable only when the target port is a FC port.
primSeqProtErrCnt Indicates the number of Primitive Sequence protocol errors that occurred on this target during the last measurement period. Number Ideally, the value of this measure should be zero.

This measure is applicable only when the target port is a FC port.
primSeqProtErrRate Indicates the number of Primitive Sequence protocol errors occurred per second on this target port during the last measurement period. Errors/sec Ideally, the value of this measure should be Zero.

This measure is applicable only when the target port is a FC port.
linkFailureCnt Indicates the number of link failures experienced by this target during the last measurement period. Number Ideally, the value of this measure should be zero. A non-zero value indicates that Fiber Channel connectivity with this target was “broken” that many times. This is likely an indicator for a faulty connector or cable. These are also caused when the device connected to this target is restarted, replaced or being serviced when the Fiber Channel cable connected to this target is temporarily disconnected.

These measures are applicable only when the target port is a FC port.
linkFailureRate Indicates the number of link failures experienced per second by this target during the last measurement period. Failures/sec
dumpedFrames Indicates the number of frames dumped by this target port due to lack of buffer credit during the last measurement period. Number Buffer credits, also called buffer-to-buffer credits (BBC) are used as a flow control method by Fiber Channel technology and represent the number of frames a port can store.

Each time a port transmits a frame that port's BB Credit is decremented by one; for each R_RDY received, that port's BB Credit is incremented by one. Transmission of an R_RDY indicates that the port has processed a frame, freed a receive buffer, and is ready for one more. If the BB Credit is zero, the corresponding node cannot transmit until an R_RDY is received back. A high value for this measure therefore indicates that an R_RDY was not received by the FC port for a long time. This is a cause for concern, as until the R_RDY is received, the FC port will not resume communication.

The solution for this problem is to allocate optimal buffer credits to the FC port. The optimal number of buffer credits is determined by the distance (frame delivery time), the processing time at the receiving port, the link signaling rate, and the size of the frames being transmitted. As the link speed increases, the frame delivery time is reduced and the number of buffer credits must be increased to obtain full link utilization, even in a short-distance environment. Smaller frame sizes need more buffer credits.

This measure is applicable only when the target port is a FC port.