eG Monitoring
 

Measures reported by OHMGpuSensorTest

This test monitors each GPU available in the hardware unit and reports the voltage, temperature and the load handled by each GPU. In addition, this test reports the speed of each GPU and the average speed of the fans in each GPU. This way, administrators may be alerted to potential overload condition of the GPU and help administrators identify potential issues that may affect the functioning of the GPU.

The measures made by this test are as follows:

Measurement Description Measurement Unit Interpretation
Volt_utilized Indicates the current voltage of this GPU. Volts  
Clock_speed Indicates the current speed of this GPU. MHz  
Temp_utilized Indicates the current temperature of this GPU. Celcius The value of this measure should be within permissible limits. A sudden/gradual increase in the value of this measure may affect the functioning of the server and needs to be immediately attended to.
Load_utilized Indicates the percentage of load handled by this GPU. Percent Comparing the value of this measure across GPUs will help you identify the GPU that is handling the maximum load.
No_of_turns Indicates the average speed of the fans in this GPU. RPM