IBM Support

IT29837: RDQM synchronization may become extremely slow or fail when resuming a suspended node.

Subscribe to this APAR

By subscribing, you receive periodic emails alerting you to the status of the APAR, along with a link to the fix after it becomes available. You can track this item individually or track all items by product.

Notify me when this APAR changes.

Notify me when an APAR for this component changes.

 

APAR status

  • Closed as program error.

Error description

  • When resuming a suspended RDQM HA node, the
    resynchronization stalled leaving the replication state in
    "Inconsistent"
    this prevents the queue managers' from moving back to their
    preferred node.
    
    Here is an example of the output from the drbdadm status
    command:
    
    [root@node3.localdomain.com ~]# drbdadm status
    ...
    rdqm02 role:Primary
      disk:UpToDate
      node1.localdomain.com role:Secondary
        peer-disk:UpToDate
      node2.localdomain.com role:Secondary
        replication:SyncSource peer-disk:Inconsistent done:98.35
    
    and
    
    [root@node3.localdomain.com ~]# rdqmstatus -m rdqm02
    Node:                                   node3.localdomain.com
    Queue manager status:                   Running
    CPU:                                    0.00%
    Memory:                                 180MB
    Queue manager file system:    1983MB used, 98.3GB allocated [2%]
    HA role:                                Primary
    HA status:                              Mixed
    HA control:                             Enabled
    HA current location:                    This node
    HA preferred location:                  node2.localdomain.com
    This node
    HA floating IP interface:               None
    HA floating IP address:                 None
    
    Node:
    node1.localdomain.com
    HA status:                              Normal
    
    Node:
    node2.localdomain.com
    HA status:                           Synchronization in progress
    HA synchronization progress:            100.0%
    HA estimated time to completion:        2019-07-25 12:20:52
    ----------------------------------------------------------------
    

Local fix

Problem summary

  • ****************************************************************
    USERS AFFECTED:
    This issue affects users of IBM MQ RDQM
    
    
    Platforms affected:
    Linux on x86-64
    
    ****************************************************************
    PROBLEM DESCRIPTION:
    A timing condition triggered a defect which prevented the disk
    synchronization process from completing.
    This occurs after suspending and resuming node(s) in the RDQM
    configuration.
    

Problem conclusion

  • The timing condition has been fixed.
    
    ---------------------------------------------------------------
    The fix is targeted for delivery in the following PTFs:
    
    Version    Maintenance Level
    v9.1 CD    9.1.4
    v9.1 LTS   9.1.0.4
    
    The latest available maintenance can be obtained from
    'WebSphere MQ Recommended Fixes'
    http://www-1.ibm.com/support/docview.wss?rs=171&uid=swg27006037
    
    If the maintenance level is not yet available information on
    its planned availability can be found in 'WebSphere MQ
    Planned Maintenance Release Dates'
    http://www-1.ibm.com/support/docview.wss?rs=171&uid=swg27006309
    ---------------------------------------------------------------
    

Temporary fix

Comments

APAR Information

  • APAR number

    IT29837

  • Reported component name

    IBM MQ BASE MP

  • Reported component ID

    5724H7271

  • Reported release

    910

  • Status

    CLOSED PER

  • PE

    NoPE

  • HIPER

    NoHIPER

  • Special Attention

    NoSpecatt / Xsystem

  • Submitted date

    2019-07-25

  • Closed date

    2019-09-27

  • Last modified date

    2019-09-27

  • APAR is sysrouted FROM one or more of the following:

  • APAR is sysrouted TO one or more of the following:

Fix information

  • Fixed component name

    IBM MQ BASE MP

  • Fixed component ID

    5724H7271

Applicable component levels

[{"Business Unit":{"code":"BU053","label":"Cloud & Data Platform"},"Product":{"code":"SSYHRD","label":"IBM MQ"},"Component":"","ARM Category":[],"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"910","Edition":"","Line of Business":{"code":"LOB45","label":"Automation"}}]

Document Information

Modified date:
27 September 2019