eG Monitoring
 

Measures reported by MsSqlCluProcsTest

This test reports the current state and resource usage of the cluster process, and promptly alerts administrators if the cluster goes down or is up and consuming more resources than it should.

Note:

This test will report metrics only if the eG agent is installed on a remote Windows host. The eG agent should be installed using the credentials of a domain user possessing valid permission to log into the target Microsoft SQL server.

The measures made by this test are as follows:

Measurement Description Measurement Unit Interpretation
Availability Indicates the availability of the cluster service. Percent The availability is 100% when the cluster process is running.

This measure will report the value 0, if the cluster process is not running.

Num_procs_running Indicates the number of instances of the cluster process that are currently running. Number The value 0 for this measure is indicative of the non-availability of the cluster service.

The detailed diagnosis of this measure reveals the process ID of each running instance of the cluster process and the resource and I/O usage of each instance, so that you can quickly identify which instance is resource-intensive.

Cpu_util Indicates the percentage of CPU resources used by the cluster process. Percent A value close to 100% indicates that the cluster process is hogging the CPU resources of the ‘active’ node in the cluster.
Handle_count Indicates the number of handles opened by the process. Number An increasing trend in this measure is indicative of a memory leak in the process.
No_of_threads Indicates the number of threads that are used by the process. Number  
Virtual_memory_used Indicates the amount of virtual memory that is being used by the process. MB  
Private_memory_used Indicates the amount of memory that this process has been allotted, that cannot be shared with other processes. MB  
Pool_paged_bytes Indicates the amount of memory currently in the pool paged area of system memory. MB  
Pool_Nonpaged_bytes Indicates the amount of memory currently in the pool non-paged area of system memory. MB  
IO_data_rate Indicates the rate at which processes are reading and writing bytes in I/O operations. Kbytes/Sec This value counts all I/O activity generated by each process and includes file, network and device I/Os.
IO_data_oper_rate Indicates the rate at which the process is issuing read and write data to file, network and device I/O operations. Operations/Sec  
IO_read_data_rate Indicates the rate at which the process is reading data from file, network and device I/O operations. Kbytes/Sec  
IO_write_data_rate Indicates the rate at which the process is writing data to file, network and device I/O operations. Kbytes/Sec  
Page_fault_rate Indicates the total rate at which page faults are occurring for the threads of all matching processes. Faults/Sec A page fault occurs when a thread refers to a virtual memory page that is not in its working set in main memory. This may not cause the page to be fetched from disk if it is on the standby list and hence already in main memory, or if it is in use by another process with whom the page is shared.
Working_set Indicates the current size of the working set of a process. MB The Working Set is the set of memory pages touched recently by the threads in the process. If free memory in the computer is above a threshold, pages are left in the Working Set of a process even if they are not in use. When free memory falls below a threshold, pages are trimmed from Working Sets. If they are needed they will then be soft-faulted back into the Working Set before leaving main memory.

The detailed diagnosis for this test provides details of the individual process instances and their individual working sets.

Comparing the working set across process instances indicates which instances taking up excessive memory.