HADR takeover service
The HADR takeover service is responsible to take over from a primary database when a connection problem occurs between IBM® Security Guardium® Key Lifecycle Manager master server and the primary database in the Multi-Master cluster. When the primary database is down, the takeover operation is initiated on a standby database so that the user operations are not hindered during the outage.
You can configure agent.takeover.svc.interval property in the <SKLM_HOME>/config/SKLMConfig.properties file, for example, C:\Program Files\IBM\WebSphere\AppServer\products\sklm\config\SKLMConfig.properties to set the time interval for running HADR takeover service. For more information about the configuration property, see agent.takeover.svc.interval.
DB2® High Availability Disaster Recovery (HADR) is used in IBM Security Guardium Key Lifecycle Manager Multi-Master cluster. Configuring DB2 HADR protects you against data loss by transmitting data changes from a primary database to standby databases. Under normal conditions, DB2 HADR keeps the DB2 HADR primary and standby databases in sync.
Agents are installed on all the master servers in the cluster. Agent services track the availability of IBM Security Guardium Key Lifecycle Manager related ports. If the primary database is down, takeover service instructs the HADR standby database to take over as the new HADR primary database.
For the takeover operation, the primary and standby databases are continuously synchronized by using a secure communication channel. A set of DB2 HADR and WebSphere® Application Server configuration parameters are automatically updated for the takeover operation by using the configuration services that the agent runs. For more information about the various configuration services, see Configuration services.
IBM Security Guardium Key Lifecycle Manager supports the failback option. You can configure the primary database to take over the primary role when it comes up.
- Takeover service of
Instance 1
(primary master server) checks the database status (Primary Database
) by using DB2 commands. - If the
Primary Database
is down,Instance 2
(standby master server) receives takeover request from the primary server. TheStandby Database
takes over as thePrimary Database
. - The primary master server receives a message from standby to indicate whether the takeover operation is successful. When the takeover operation fails, takeover service on the primary server sends takeover requests to the next standby if the cluster is configured with multiple standby servers.
- When the old primary database server is up, takeover service starts HADR on it as standby.
For more information about prerequisites for DB2 HADR configuration, see Database configuration for high availability disaster recovery (HADR).
Manually initiating takeover operation
- Locate the sklmTakeoverHADR script.
- Windows
- <SKLM_INSTALL_HOME>\agent
Default location is C:\Program Files\IBM\SKLMV41\agent.
- Linux®
- <SKLM_INSTALL_HOME>/agent
Default location is /opt/IBM/SKLMV41\agent.
- Open a command prompt and run the script.
- Windows
- Go to the <SKLM_INSTALL_HOME>\agent directory and run
the following
command:
For example,sklmTakeoverHADR.bat <WAS_HOME> [IP_HOSTNAME] [AGENT_PORT]
sklmTakeoverHADR.bat "C:\Program Files\IBM\WebSphere\AppServer" 9.113.37.10 60015
- Linux
- Go to the <SKLM_INSTALL_HOME>/agent directory and run
the following
command:
For example,sklmTakeoverHADR.sh <WAS_HOME> [IP_HOSTNAME] [AGENT_PORT]
./sklmTakeoverHADR.sh /opt/IBM/WebSphere/AppServer 9.113.37.10 60015