Measures reported by SPContentDBTest
Content databases are used to store SharePoint data. This data consists of sites, permissions, documents, lists, etc.
A content database is closely related to two other SharePoint objects: a web application and a site collection. A web application is backed by an Internet Information Services (IIS) web site, and it contains one or more content databases which can contain one or more site collections. Think of a site collection as the“container” for data and also as the security boundary. Security is defined at the site collection level allowing administrators to control access to the sites and their data.
Content databases can grow pretty quickly, and if this growth is not tracked and controlled, users may be left with no space for Sharepoint data. Sharepoint administrators should hence prudently and proactively plan their data storage needs, accordingly size the content databases, and effectively manage the space available in the databases, so that manageability, performance, and reliability issues do not arise. This is where the SPContentDBTest test helps!
Besides reporting the state of each content database where Sharepoint data is stored, this test also monitors the size, usage, and growth of every database, thus pointing administrators to those databases that are over-used or are exhibiting alarming growth patterns! In addition, the test provides hints for enhancing the overall performance of the content databases - will it help to cleanup the orphaned items? should the recycle bin storage space be reduced? should the content database host fewer site collections?
The measures made by this test are as follows:
| Measurement |
Description |
Measurement Unit |
Interpretation |
| Is_database_Use |
Indicates whether/not this content database is in use. |
|
The values that this measure can report and their corresponding numeric values are listed in the table below:
| Measure Value |
Numeric Value |
| Yes |
1 |
| No |
0 |
Note:
By default, the measure reports the Measure Values listed in the table above to indicate the usage state of this content database. However, in the graph of this measure, the same will be represented using the numeric equivalents only. |
| ContentDB_size |
Indicates the current size of this content database. |
GB |
Microsoft recommends that no content database be more than 200 GB in size.
Content databases of up to 4 TB are supported when the following requirements are met:
- Disk sub-system performance of 0.25 IOPs per GB. 2 IOPs per GB is recommended for optimal performance.
- You must have developed plans for high availability, disaster recovery, future capacity, and performance testing.
You should also carefully consider the following factors:
- Requirements for backup and restore may not be met by the native SharePoint Server backup for content databases larger than 200 GB. It is recommended to evaluate and test SharePoint Server solutions to determine the best solution for your specific environment.
- It is strongly recommended to have proactive skilled administrator management of the SharePoint Server and SQL Server installations.
- The complexity of customizations and configurations on SharePoint Server may necessitate refactoring (or splitting) of data into multiple content databases. Seek advice from a skilled professional architect and perform testing to determine the optimum content database size for your implementation. Examples of complexity may include custom code deployments, use of more than 20 columns in property promotion, or features listed as not to be used in the over 4 TB section below.
- Refactoring of site collections allows for scale out of a SharePoint Server implementation across multiple content databases. This permits SharePoint Server implementations to scale indefinitely. This refactoring will be easier and faster when content databases are less than 200 GB.
- It is suggested that for ease of backup and restore that individual site collections within a content database be limited to 100 GB.
|
| Disk_space_usage |
Indicates the percentage disk space in the SQL server that is used by this content database. |
Percent |
A high value for this measure is a cause for concern, as it indicates excessive disk space consumption by a content database.
Compare the value of this measure across content databases to identify that database which is eroding the disk space of the SQL server. |
| Growth_rate |
Indicates the percentage growth in the size of this content database since the last measurement period. |
Percent |
A consistent rise in the value of this measure is a sign that this content database is growing rapidly!
To curb this growth, you may want to consider the following measures:
- Use an ootb Record Center as an archive for old content: The users must manually send each document to the RC using e.g. move and leave a link; note that only the latest major version with metadata is kept - all version history is lost. The information management policies supported by SharePoint for retention and disposition can be used to automate the cleanup. As the RC has its own content databases, the live collaboration databases will grow slower or even shrink as outdated information is moved to the archive. Keeping the live databases small ensures shorter recovery time; while the recovery time for the archived content can be considerable, but not business critical. Search must be configured appropriately to cover both live and archived content.
- Use a third-party archiving solution for SharePoint. This has the same pros & cons as the previous option, but the functionality is probably better in relation to keeping version history and batch management of outdated content. Search must be configured appropriately to cover both live and archived content.
- Use a third-party remote blob storage (RBS) solution for SharePoint so that documents are registered in the database, but not stored there. This gives smaller content databases, but more complicated backup and recovery as the content now resides both in databases and on disk. Provided that you don't lose both at the same time, the recovery time should be shorter. Search will work as before, as all content is still logically in the “database”.
- Use powershell scripts or other code to implement the disposition of outdated content. The script can e.g. copy old documents to disk and delete old versions from the content database; the drawback being that all metadata will be lost and there is no link left in SharePoint. The databases size will shrink as data is actually deleted, and backup and recovery is more complicated as content is now both in the database and on disk (same as for option C). Search can be configured to also crawl and index the files on disk, but content ranking will suffer as the valuable metadata is lost.
- Use powershell scripts or other code to implement the disposition of outdated content. The script can e.g. copy old documents to disk and delete old versions from the content database; the drawback being that all metadata will be lost and there is no link left in SharePoint. The databases size will shrink as data is actually deleted, and backup and recovery is more complicated as content is now both in the database and on disk.Search can be configured to also crawl and index the files on disk, but content ranking will suffer as the valuable metadata is lost.
|
| Orphan_sites |
Indicates the number of orphaned sites in this content database. |
Number |
An Orphaned Site is where SharePoint only has partial information and not a complete set of data for a given site collection in your Windows SharePoint Services or SharePoint Portal Server content databases or configuration databases. The site may in fact still be viewable via the browser, but you may notice that many things are broken.
If the Growth_rate measure is increasing consistently, you may want to check the variations in the value of this measure over the same time period to figure out whether/not the existence of too many orphan sites is contributing to the growth in the size of the content database. If so, you may want to cleanup the orphan sites to right-size your database and to ensure optimum performance. |
| ContentDB_site_limit |
Indicates the maximum number of site collections that this content database can host. |
Number |
Microsoft strongly recommends limiting the number of site collections in a content database to 5,000. However, up to 10,000 site collections in a database are supported. Note that in a content database with up to 10,000 total site collections, a maximum of 2,500 of these can be non-Personal site collections. It is possible to support 10,000 Personal site collections if they are the only site collections within the content database.
These limits relate to speed of upgrade. The larger the number of site collections in a database, the slower the upgrade with respect to both database upgrade and site collection upgrades.
The limit on the number of site collections in a database is subordinate to the limit on the size of a content database that has more than one site collection. Therefore, as the number of site collections in a database increases, the average size of the site collections it contains must decrease.
Exceeding the 5,000 site collection limit puts you at risk of longer downtimes during upgrades. If you plan to exceed 5,000 site collections, Microsoft recommends that you have a clear upgrade strategy to address outage length and operations impact, and obtain additional hardware to speed up the software updates and upgrades that affect databases. |
| ContentDB_Conf_limit |
Indicates the percentage of the configured site limit that is used by this content database. |
Percent |
A value close to 100% indicates that the configured site limit is about to be reached.
By comparing the value of this measure across content databases, you can easily identify the database that hosts too many site collections. You may then have to reassess the ability of that content database to handle additional site collections, and accordingly decide whether to reconfigure the site limit or reduce the number of site collections hosted by the database. |
| Recyclebin_space |
Indicates the space used by the items present in the second stage recycle bin of this content database. |
MB |
Recycle Bins are used to help users protect and recover data. Microsoft SharePoint Server 2010 supports two stages of Recycle Bins: the first-stage Recycle Bin and second-stage Recycle Bin.
When a user deletes an item, the item is automatically sent to the first-stage Recycle Bin. By default, when an item is deleted from the first-stage Recycle Bin, the item is sent to the second-stage Recycle Bin.
A high value for this measure could indicate that a large amount of deleted data resides in the second stage recycle bin, unnecessarily consuming disk space and increasing the size of the content database. |
| Recyclebin_growthrate |
Indicates the percentage growth in the space used in the second stage recycle bin of this content database, since the last measurement period. |
Percent |
A consistent increase in the value of this measure indicates that deleted data is steadily accumulating in the recycle bin; this is a cause of concern, as data in the second stage recycle bin can add megabytes to the overall size of the content database!
Every site collection has a second stage recycle bin and the size of this bin must not grow beyond 50 percent of the quota set for that site collection. You may want to reduce this percentage to ensure that the recycle bin does not grow too unwieldy and impact the size and performance of the content database. |
| Needs_upgrade |
Indicates whether/not this database needs to be upgraded. |
|
The values that this measure can report and their corresponding numeric values are listed in the table below:
| Measure Value |
Numeric Value |
| Yes |
1 |
| No |
0 |
Note:
This measure reports the Measure Values listed in the table above to indicate whether/not a database needs an upgrade. In the graph of this measure however, the same is represented using the numeric equivalents only. |
|