IBM Support

PI33497: WMQ: CSQE034E CSQ0 CSQERWI2 RECOVERY PHASE 2 FAILED FOR STRUCTURE RC=00000004 REASON=00000412

A fix is available

Subscribe

You can track all active APARs for this component.

 

APAR status

  • Closed as program error.

Error description

  • An outage occurred for the Queue Sharing Group (QSG).  One
    memeber of the QSG received
     CSQE034E CSQ0 CSQERWI2 Recovery phase 2 failed for
     structure CICS connection name CSQEMQGPCSQ002, RC=00000004
     reason=00000412
    
    xxxx0412 means
     IxlRsnCodeLockHeldBySys:  For a LOCKOPER=READNEXT or
     LOCKOPER=TEST request, the lock is not generally available.
     The lock is not held by any connection, but instead is held by
     the system.
    
    Another member of the QSG received
     Dump Title: CSQL,ABN=5C6-00C510AD,U=SYSOPR  ,C=R3600.710.CFM
     -CSQEOCRQ,M=CSQGFRCV,LOC=CSQELPLM.CSQEOCRQ+00000C1A
    
     Dump Title: CSQL,ABN=026-08110102,U=SYSOPR  ,C=R3600.710.CFM
     -CSQERWI2,M=CSQGFRCV,PSW=470C400081442702
    
     Dump Title: ABEND=S026,REASON=08118001,CONNECTOR HANG:
     CONNAME=CSQEMQGPCSQL06,JOBNAME=CSQLMSTR
    
    In response to an EEPLDISCFAILCONNECTION event, a queue manager
    may need to get locks on the list headers in the structure in
    order to perform recovery processing.  This results in an
    unconditional list lock request being made from phase 2
    recovery. If the lock was held by one of the failed
    connections, XES will not release the lock until the connection
    failure event has been resolved.  This causes the structure
    recovery processing to hang in the unconditional list lock
    request.  This task remains stuck until XES hang resolution
    processing abends the queue manager task with the  026 abend.
    
    
    
    Additional Symptom(s) Search Keyword(s):
    
    00C510AD 08110102 08118001 ABEND026 ABENDS026 026 S026 S0026
    0412
    

Local fix

  • Restart of the failed queue manager should be successful
    

Problem summary

  • ****************************************************************
    * USERS AFFECTED: All users of WebSphere MQ for z/OS Version 7 *
    *                 Release 1 Modification 0.                    *
    ****************************************************************
    * PROBLEM DESCRIPTION: Peer level recovery (PLR) for a failed  *
    *                      connector to a structure hangs waiting  *
    *                      for a lock unconditionally, leading to  *
    *                      abend 026-08110101 in CSQELPLM.CSQERWI2 *
    *                      initially, followed later by abend      *
    *                      026-08110102 in CSQELPLM.CSQERWI2 and   *
    *                      abnormal queue manager termination      *
    *                      6C6 with REASON=00C510AB.               *
    *                      While the connector is hung, other      *
    *                      queue managers attempting to lock the   *
    *                      same list header can fail with return   *
    *                      code IxlRsnCodeLockHeldBySys (00000412) *
    ****************************************************************
    * RECOMMENDATION:                                              *
    ****************************************************************
    During Peer Level Recovery (PLR) for a failed connection to a
    structure, CSQERWI2 issues an unconditional request for a lock
    on a list header in the structure.
    While CSQERWI2 waited for the lock to become available, the
    current holder of the lock also failed, leading to XCF taking
    ownership of the lock until the failure events have been
    responded to.
    However, while CSQERWI2 is waiting for the lock, it is unable
    to respond to any failure events, leading to a deadlock
    condition occurring.
    

Problem conclusion

  • CSQERWI2 is changed to handle the holder of the required lock
    failing during PLR.
    100Y
    CSQEQUER
    CSQERWI2
    

Temporary fix

  • *********
    * HIPER *
    *********
    

Comments

APAR Information

  • APAR number

    PI33497

  • Reported component name

    WMQ Z/OS V7

  • Reported component ID

    5655R3600

  • Reported release

    100

  • Status

    CLOSED PER

  • PE

    NoPE

  • HIPER

    YesHIPER

  • Special Attention

    NoSpecatt / Xsystem

  • Submitted date

    2015-01-26

  • Closed date

    2015-07-23

  • Last modified date

    2015-09-02

  • APAR is sysrouted FROM one or more of the following:

  • APAR is sysrouted TO one or more of the following:

    PI44852 UI29667

Modules/Macros

  • CSQEQUER CSQERWI2
    

Fix information

  • Fixed component name

    WMQ Z/OS V7

  • Fixed component ID

    5655R3600

Applicable component levels

  • R100 PSY UI29667

       UP15/08/11 P F508 ¢

Fix is available

  • Select the PTF appropriate for your component level. You will be required to sign in. Distribution on physical media is not available in all countries.

[{"Business Unit":{"code":"BU054","label":"Systems w\/TPS"},"Product":{"code":"SG19M","label":"APARs - z\/OS environment"},"Component":"","ARM Category":[],"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"7.1","Edition":"","Line of Business":{"code":"","label":""}}]

Document Information

Modified date:
02 September 2015