Replacing a failed node in a disaster recovery configuration

If you lose one of the appliances in a disaster recovery configuration, you can replace the appliance and restore the disaster recovery configuration by following this procedure.

About this task

Note: These instructions apply to disaster recovery configurations comprising two single appliances, for instructions for an HA configuration with a single DR appliance, see Replacing failed high availability nodes in a disaster recovery configuration. For instructions for a DR configuration consisting of two HA pairs, see Replacing a failed appliance in a DR/HA configuration.
If a disaster occurs such that the appliance in the main site is beyond repair, you delete the disaster recovery configuration while your queue manager runs on the recovery appliance. You then replace the appliance and restore the disaster recovery configuration.

If you lose the appliance at the recovery site, you run a command on the main appliance to recover the crtdrsecondary command that you used to create the secondary instance of the queue manager. You can then rerun the crtdrsecondary on the new recovery appliance.

Procedure

  • Following the loss of the primary queue manager at the main site, take the following steps:
    1. On the recovery appliance, run the following command:
      makedrprimary -m QMname
      
      Where QMname is the name of the queue manager.
    2. Replace the main appliance, and configure the replacement to look like the original.
    3. On the recovery appliance, run the following command to retrieve the crtdrsecondary command:
      dspdrsecondary -m QMname
      
      Where QMname is the name of the queue manager.
    4. On the replacement appliance run the crtdrsecondary command that was output in step 3. For example:
      crtdrsecondary -m QM1 -s 65536 -l myliveappl -i 198.51.100.24 -p 2015 -t a
      
      Synchronization of data from the recovery appliance to the replacement appliance begins. Use the status command to check that synchronization has completed before you proceed.
    5. On the recovery appliance, ensure that the queue manager is not running, and then make it the secondary queue manager:
      makedrsecondary -m QMName
      
    6. On the replacement appliance, make the queue manager the primary queue manager:
      makedrprimary -m QMName
      
    7. On the replacement appliance, start the queue manager:
      strmqm QMname
      
      You have now restored the configuration as it was before the failure at your main site.
  • Following the loss of the secondary queue manager at the recovery site, take the following steps:
    1. Replace the recovery appliance, and configure the replacement to look like the original.
    2. On the main appliance, run the following command to retrieve the crtdrsecondary command:
      dspdrsecondary -m QMname
      
      Where QMname is the name of the queue manager.
    3. On the recovery appliance run the crtdrsecondary command that was output in step 2. For example:
      crtdrsecondary -m QM1 -s 65536 -l myliveappl -i 198.51.100.24 -p 2015 -t a
      
      Synchronization of data from the main appliance to the recovery appliance begins. Use the status command to check that synchronization has completed.