| Measurement |
Description |
Measurement
Unit |
Interpretation |
| Filtering_threads |
Indicates the current number of filtering threads in the system. |
Number |
|
| Idle_threads |
Indicates the number of threads that are currently waiting for documents. |
Number |
These threads are not currently doing any work and will eventually be terminated. If you consistently have a more than Max Threads/Hosts idle threads you can schedule an additional crawl. If this number is 0 then you are starved. Do not schedule another crawl in this time period and analyze the durations of your crawls during this time to see if they are meeting your freshness goals. If your goals are not being met you should either reduce the number of crawls. |
| Network_threads |
Indicates the number of threads that were waiting for a response from the filter process. |
Number |
If you figure out that there is no activity that is taking place as far as this measure is concerned, and if the value of this measure is equal to the Filtering_threads measure; it indicates a network issue or the unavailability of the server that is crawling into. |
| Committing_threads |
Indicates the number of threads that are committing transactions. |
Number |
|
| Plugin_threads |
Indicates the number of threads currently waiting for plug-ins to complete an operation. |
Number |
These threads have the filtered documents and are processing it in one of several plug-ins. This is when the index and property store are created. |
| Loading_threads |
Indicates the number of transactions that are loaded from the persisted crawl queue. |
Number |
|
| Link_processing_threads |
Indicates the number of threads that are processing links. |
Number |
|
| Active_filter_processes |
Indicates the number of filtering processes that are active in the system. |
Number |
|
| Filter_objects |
Indicates the number of filter objects in the system. |
Number |
|
| Active_document_queue |
Indicates the number of documents that are waiting for robot threads. |
Number |
If the value of this measure is 0, then it implies that all the threads are filtering threads. |
| Admin_clients |
Indicates the number of currently connected admin clients. |
Number |
|
| Performance_level |
Indicates the amount of resources that the Gatherer service is allowed to use. |
Number |
|
| Current_servers |
Indicates the number of servers that were recently accessed by the system. |
Number |
|
| Unavailable_servers |
Indicates the number of servers that are currently unavailable to the system. |
Number |
A server becomes unavailable if the requests made to the server is timed out. |
| Stemmers_cached |
Indicates the number of cached stemmer instances in the system. |
Number |
Stemmers are nothing but components shared by the Search and Indexing engines that generate inflected forms for a word. Too many stemmer instances that are cached may indicate a resource usage problem. |
| System_IO_rate |
Indicates the rate at which the system IO disk traffic is detected during back off period. |
KB/Sec |
During a back-off period, indexing is suspended. To manually back off the gatherer service, pause the search service. If the search service itself generates the back-off, an event will be recorded and the search service will be paused automatically. There is no automatic restart, so you must manually start the search service in order to end a back-off state. Note that there is little reason to start the search service until you have solved the problem that caused the back-off in the first place. |
| Time_outs |
Indicates the number of timeouts detected by the system during the last measurement period. |
Number |
Ideally, this value should be zero. |
| Documents_filtered_rate |
Indicates the rate at which the documents are filtered in the system. |
KB/sec |
If this rate is decreasing over time, you should perform some troubleshooting to find out why your server is not filtering documents. Look for memory issues, processor issues, network issues, or site hit frequency rules that slow the gatherer process. |
| Successful_filter_rate |
Indicates the rate at which the documents are filtered sucessfully in the system. |
KB/sec |
|
| Delayed_documents |
Indicates the number of documents that were currently delayed due to site hit frequency rules. |
Number |
If you have a plethora of rules and this number is steadily increasing over time, consider relaxing or simplifying your site hit frequency rules. A very high number may indicate a conflict in the rules that the gatherer cannot resolve or follow with efficiency. |
| Documents_in_memory |
Indicates the number of document entries that are currently available in the memory of the system. |
KB/sec |
If the value of this measure is 0, it indicates that the indexing activity has been stopped. |
| Documents_filtered |
Indicates the total number of documents filtered in the system during the last measurement period. |
Number |
|
| Docs_successful_filtered |
Indicates the total number of documents that are successfully filtered in the system during the last measurement period. |
Number |
If the value of this measure is less than the value of the Documents_filtered measure, use the gatherer logs to figure out the cause for the documents that are attempting to be filtered but are failing. |