IBM Support

IC68184: DURING REMOTE CATCHUP, STANDBY MAY HANG INDEFINITELY IF PRIMARY COULD NOT FIND A LOG.

Subscribe

You can track all active APARs for this component.

APAR status

  • Closed as program error.

Error description

  • During normal operation, an HADR Standby requests logs from the
    HADR Primary while in 'Remote Catchup' state.  If the requested
    logs are not available on the Primary then the Standby will shut
    down.  In rare, random, cases the Standby may hang during its
    shut down.
    
    Typical db2diag.log messages that you would see on the Standby
    
    server:
    
    2010-03-17-23.01.21.363452-300 I139143A368        LEVEL: Error
    PID     : 692376               TID  : 8997        PROC : db2sysc
    0
    INSTANCE: db2p51               NODE : 000
    EDUID   : 8997                 EDUNAME: db2hadrs (P51) 0
    FUNCTION: DB2 UDB, High Availability Disaster Recovery, hdrEduS,
    probe:21210
    MESSAGE : HADR: Missing log on primary. Standby shutting down
    
    2010-03-17-23.01.21.363657-300 I139512A398        LEVEL: Error
    PID     : 692376               TID  : 8997        PROC : db2sysc
    0
    INSTANCE: db2p51               NODE : 000
    EDUID   : 8997                 EDUNAME: db2hadrs (P51) 0
    FUNCTION: DB2 UDB, High Availability Disaster Recovery, hdrEduS,
    probe:21210
    RETCODE : ZRC=0x87800148=-2021654200=HDR_ZRC_BAD_LOG
              "HADR standby found bad log"
    
    2010-03-17-23.01.21.363808-300 I139911A342        LEVEL: Warning
    PID     : 692376               TID  : 8997        PROC : db2sysc
    0
    INSTANCE: db2p51               NODE : 000
    EDUID   : 8997                 EDUNAME: db2hadrs (P51) 0
    FUNCTION: DB2 UDB, High Availability Disaster Recovery, hdrEduS,
    probe:20342
    MESSAGE : Info: Standby Finished.
    
    2010-03-17-23.01.21.588793-300 I146338A350        LEVEL: Warning
    PID     : 692376               TID  : 8997        PROC : db2sysc
    0
    INSTANCE: db2p51               NODE : 000
    EDUID   : 8997                 EDUNAME: db2hadrs (P51) 0
    FUNCTION: DB2 UDB, High Availability Disaster Recovery,
    hdrEduEntry, probe:21145
    MESSAGE : Waiting for hdrEDUTerminate
    
    Even though we see 'Standby Finished' message in db2diag.log, we
    would still see db2hadrs thread running, which can be confirmed
    using the
    'db2pd -edus' command. The rest of the threads like db2shred,
    db2redow,
    db2redom will have exited.
    
    Typical stack trace for db2hadrs thread would look like:
    <StackTrace>
    -------Frame------ ------Function + Offset------
    0x09000000000D6FD8 thread_wait + 0x98
    0x090000000557A90C sqloWaitEDUWaitPost + 0x1A4
    0x0900000007312C24 @146@hdrEduEntry__FPcUi + 0x250
    0x090000000549AD7C sqloEDUEntry + 0x1C
    </StackTrace>
    

Local fix

  • db2_kill
    

Problem summary

  • ****************************************************************
    * USERS AFFECTED:                                              *
    * All users of DB2 High Availability Disaster Recovery (HADR)  *
    ****************************************************************
    * PROBLEM DESCRIPTION:                                         *
    * The HADR Standby hangs during a shut down (deactivation).    *
    ****************************************************************
    * RECOMMENDATION:                                              *
    * Upgrade to DB2 V9.7 Fixpack 3.                               *
    ****************************************************************
    

Problem conclusion

  • The HADR Standby won't hang during shut down.
    

Temporary fix

Comments

APAR Information

  • APAR number

    IC68184

  • Reported component name

    DB2 FOR LUW

  • Reported component ID

    DB2FORLUW

  • Reported release

    970

  • Status

    CLOSED PER

  • PE

    NoPE

  • HIPER

    NoHIPER

  • Special Attention

    NoSpecatt

  • Submitted date

    2010-04-26

  • Closed date

    2010-10-19

  • Last modified date

    2010-10-19

  • APAR is sysrouted FROM one or more of the following:

  • APAR is sysrouted TO one or more of the following:

Fix information

  • Fixed component name

    DB2 FOR LUW

  • Fixed component ID

    DB2FORLUW

Applicable component levels

  • R970 PSN

       UP



Document information

More support for: DB2 for Linux, UNIX and Windows

Software version: 9.7

Reference #: IC68184

Modified date: 19 October 2010