eG Monitoring
 

Measures reported by NutClusFileServTest

Nutanix Files is a software-defined scale-out file storage solution that lets you share files in a centralized and protected location to eliminate the requirement of a third-party file server.

Files uses a scale-out architecture that provides file services to clients through Server message Block (SMB) and Network File System (NSP) protocols. Files combine one or more file server VMs (FSVMs) into a logical file server instance sometimes referrred to as a File Cluster. You can create multiple file servers within a single Nutanix cluster.

Files creates a volume group (VG) for every FSVM to provide stable storage for persistent states and audit events. During a service outage, the states, storage, and events of a VG fail-over to another FSVM. Files also creates a dedicated container for every file server instance. In order to protect Nutanix files users from malware and viruses, you need to address both the client and the file server. Nutanix currently supports third party vendors that use Internet Content Adaptation Protocol ( ICAP ) servers. If too many files from a file server are quarantined by the ICAP server or if too many files are disconnected from the file server during scans, then those files may not be available to the users. Similarly, if the file server runs out of space, the files in the file server will not be updated and hence the files may be outdated. This may lead to frustrated users and a poor user expereince. To ensure that the file servers are upto date and are scanned periodically, administrators can use the NutClusFileServTest.

This test auto-discovers the file servers in the target Nutanix Acropolis Prism Element and for each file server, reports the number of file shares, SMB connections initiated and the file server VMs. The count of files that were scanned and quarantined by the ICAP server from the file server throws light on the file server that is frequently prone to malicious attacks. The I/O operations performed on the file servers are periodically monitored and the file server that is taking too long to perform I/O operations is identified. The space utilization of the file servers are also periodically monitoredand the file sserver that is running out of space is identified. This way, administrators can isolate those file servers that are problematic and initiate troubleshooting at the earliest.

Outputs of the test: One set of results for each Nutanix file server on the target the Nutanix Acropolis Prism Element being monitored.

Descriptor: Nutanix File Server

The measures made by this test are as follows:

Measurement Description Measurement Unit Interpretation
No_of_NVMS Indicates the total number of file server VMs in this file server. Number

The detailed diagnosis of this measure lists the name of each file server VM, IP address, vCPUs and size of the memory allocated to each file server VM.

No_of_NTP Indicates the number of NTP servers that are synchornizing with the file server VMs in this file server. Number

Use the detailed diagnosis of this measure to figure out the names of the NTP servers that are synchronizing with the file server VMs.

SMB_conn Indicates the total number of SMB connections initiated by this file server. Number

 

Number_shares Indicates the number of file shares in this file server. Number

Use the detailed diagnosis of this measure to figure out the name of the file shares available in the file server.

ICAP_latency Indicates the time taken by this file server to connect to the ICAP server. Secs

A low value is desired for this measure.

Compare the value of this measure across file servers to identify the file server that is taking too long to connect to the ICAP server.

ICAP_scan_file Indicates the total number of files (available in this file server) that were scanned by the ICAP server. Number

Compare the value of this measure across file servers to identify the file server from which maximum number of files were scanned successfully by the ICAP server.

ICAP_quarantine Indicates the total number of files (from this file server) that were quarantined by the ICAP server. Number

Files that are infected are generally quarantined and cannot be accessed by the users. A consistently high value for this measure is a cause of concern.

Files of certain extensions such as .dat,.ini can be incorrectly quarantined by the ICAP server. Administrators need to analyze whether the files are incorrectly quarantined or whether the files are infected due to malicious attacks and initiate troubleshooting at the earliest.

Comparing the number of quarantined fies across file servers will help you in identifying the file server on which maximum number of files are quarantined.

ICAP_clean Indicates the number of files (from this file server) that were cleaned by the ICAP server. Number

Compare the value of this measure across file servers to identify the file server that topped the number of files cleaned by the ICAP server.

ICAP_disconnect Indicates the number of files that were disconnected (from this file server) by the ICAP server. Number

 

ICAP_qDepth Indicates the number of files (fromthis file server) that are in the scan queue waiting for an ICAP server. Number

A high value for this measure may indicate that the ICAP server is down.

ICAP_throughput Indicates the amount of data processed by the ICAP server while processing the files of this file server. MB

 

ICAP_thread Indicates the total number of scanning ICAP threads available for this file server. Number

 

Ctrl_IOBand Indicates the amount of data used to perform read and write I/O operations per second on this file server. KBps

Compare the value of this measure across file server to identify the file server on which maximum amount of data was used to perform read and write I/O operations.

Ctrl_IOPS Indicates the time taken to read from and write to this file server. Number

Compare the value of this measure across file servers to identify the file server that is busy in terms of read and write I/O operations.

Ctrl_latency Indicates the time taken to read from and write to this file server. Secs

A high value for this measure is a cause of concern.

Compare the value of this measure across file servers to identify the file server on which the read and write operations are taking too long to complete.

User_capcity Indicates the amount of storage that is allocated for the users accessing the file server VMs of this file server. GB

 

Total_capacity Indicates the total capacity/size of this file server. GB

 

Used_capacity Indicates the size of this file server that is already utilized. GB

A value close to the Total capacity measure indicates that the File Sever is running out of space.

Free_capacity Indicates the size of this file server that is available for use. GB

A high value is desired for this measure.

Used_perc Indicates the capacity of this file server that is already utilized, expressed as percentage. Percent

A value close to 100 percent indicates that the file server is running out of space.

Free_perc Indicates the capacity of this file server that is available for use, expressed as percentage. Percent

A high value is desired for this measure.

Snapshot_capcity Indicates the amount of space that is available for snapshots in this file server. GB