| eG Monitoring |
|---|
|
Measures reported by HdpAppStatsTest Apache YARN (Yet Another Resource Negotiator) is Hadoop’s cluster resource management system. YARN provides its core services via two types of long-running daemon: a resource manager (one per cluster) to manage the use of resources across the cluster, and node managers running on all the nodes in the cluster to launch and monitor containers. A container executes an applicationspecific process with a constrained set of resources (memory, CPU, and so on). To run an application on YARN, a client contacts the resource manager and asks it to run an application master process. The resource manager is the master daemon of YARN and is responsible for resource assignment and management among all the applications. Whenever it receives a processing request, it forwards it to the corresponding node manager and allocates resources for the completion of the request accordingly. The node manager launches the application master in a container. Precisely what the application master does once it is running depends on the application. It could simply run a computation in the container it is running in and return the result to the client. Or it could request more containers from the resource managers, and use them to run a distributed computation. In an ideal world, the requests that a YARN application makes would be granted immediately. In the real world, however, resources are limited, and on a busy cluster, an application will often need to wait to have some of its requests fulfilled. The Scheduler component of the resource manager manages resource allocations to applications. Typically, the scheduler places applications in queues and allocates resources to them based on pre-defined policies. The true test of the efficiency of the scheduler lies in how quickly it makes resources available to the applications/jobs in queues. Delay in resource allocation is one of the common causes for application jobs to fail or remain pending in the queues for a long time. By tracking the status of jobs in queues, an administrator can proactively detect potential application slowness. This is where the HdpAppStatsTest test helps! This test auto-discovers the queues spawned by the YARN scheduler, and for each queue, reports the count of jobs that are in different states of activity. In the process, the test points administrators to those queues that have one/more failed jobs, long-running jobs, and those with jobs that have not been assigned a container yet. This way, administrators can be proactively alerted to potential issues in resource allocations by the scheduler. Outputs of the test : One set of the results for each queue spawned by the scheduler component of the resource manager of the target Hadoop cluster. The measures made by this test are as follows:
|