eG Monitoring
 

Measures reported by HanaRepServiceTest

This test auto-discovers the secondary servers that are communicating with the target database server and for each secondary server, this test reports the amount of data that needs to be currently replicated. This test also throws light on the time taken for completing the replication and the time that elapsed since the last delta replication /last full replication were initiated on each secondary server. In addition, the difference in log position and savepoint log position between each primary server and the secondary server is reported. Using this test, administrators can determine the secondary server that is up to-date with the primary server and on which the replication is completed successfully.

Outputs of the test: One set of results for each Primary server:Secondary server combination communicating with the target SAP HANA database server being monitored.

Descriptor: Primary server: Secondary server

The measures made by this test are as follows:

Measurement Description Measurement Unit Interpretation
Repl_backlog_size Indicates the amount of data that needs to be currently replicated to this secondary server. MB

A high value for this measure indicates that the secondary server is not up to-date with the data available in the primary server. This may lead to data loss if the primary server goes down abruptly.

Compare the value of this measure across the secondary servers to figure out the secondary server that needs to be updated with the maximum amount of data.

The detailed diagnosis of this measure lists the active status of the secondary server, the connect time of the secondary server, the reconnect count of the secondary server, failover count of the secondary server, the async buffer full count, the backlog size, the maximum backlog size, the backlog time and maximum backlog time.

Repl_backlog_time Indicates the time taken to transfer the data that needs to be replicated to this secondary server. Seconds

A high value for this measure is a cause of concern. This may indicate that the secondary server is not up to-date with the data available in the primary server.

Compare the value of this measure across the secondary servers to figure out the server that takes too much of backlog time for data replication.

Last_deltaship_diff Indicates the time elapsed since the last delta replication was initiated on this secondary server. Seconds

The detailed diagnosis of this measure lists the active status of the secondary server, the connect time of the secondary server, the reconnect count of the secondary server, the shipped delta replica count, the shipped last delta replica size, the shipped last delta replica start time and the shipped last delta replica end time.

Last_fullship_diff Indicates the time elapsed since the last full replication was initiated on this secondary server. Seconds

The detailed diagnosis of this measure lists the active status of the secondary server, the connect time of the secondary server, the reconnect count of the secondary server, the last full replica size shipped, the last full replica shipped start time and the last full replica shipped end time.

A full set of the data created as HANA in-place snapshot on the disk of the primary is initially sent when system replication is set up.

Lag_ship_logpost Indicates the difference between the current log position of the primary server and the log position of this secondary server. Number

The log position is determined as the position from which logs must be read during restart.

The detailed diagnosis of this measure lists the active status of the secondary server, the connect time of the secondary server, the reconnect count of the secondary server, the last log position, the last log position time, the shipped log position and the shipped log position time.

Lag_ship_logpost_time Indicates the difference detected in the current log position time of the primary server and the log position time of this secondary server. Seconds

 

Lag_savpt_logpost Indicates the difference between the savepoint log position of the primary server and this secondary server. Number

SAP HANA persists in-memory data by using savepoints. Each SAP HANA service has its own separate savepoint. During a savepoint operation, SAP HANA database flushes all changed data from memory to the data volumes. The data belonging to a savepoint represents a consistent state of the data on disk and remains so until the next savepoint operation has completed. Redo log entries are written to the log volumes for all changes to persistent data. In the event of a database restart (for example, after a crash), the data from the last completed savepoint can be read from the data volumes, and the redo log entries written to the log volumes since the last savepoint can be replayed. Savepoints are also triggered automatically by a number of other operations such as data backup, and database shutdown and restart.

The detiailed diagnosis of this measure lists the active status of the secondary server, the connect time of the secondary server, the reconnect count of the secondary server, the last savepoint log position, the last savepoint start time, the shipped savepoint log position and the shipped savepoint start time.

Lag_savpt_logpost_time Indicates the difference in the savepoint log position time between the primary server and this secondary server. Seconds