IBM Support

IC74771: SMALL TIMING WINDOW WHERE PARALLEL RECOVERY (ROLLFORWARD OR CRAS H RECOVERY) MAY HANG AFTER ALL LOG RECORDS HAVE BEEN PROCESSED.

Subscribe

You can track all active APARs for this component.

APAR status

  • Closed as program error.

Error description

  • Customer will see that crash recovery or rollforward is not
    making progress after it appears that all log records have been
    processed.  If you generate call stacks, you may see something
    similar to this.
    
    The redo master process is waiting :
    
    0x00002ACC32851DCD ossSleep + 0x0059
                    (/home/eunicec2/sqllib/lib64/libdb2osse.so.1)
    0x00002ACC2C4C770F sqlorest + 0x008f
                    (/home/eunicec2/sqllib/lib64/libdb2e.so.1)
    0x00002ACC2C832662
    _Z15sqlpPRecReadLogP8sqeAgentP8SQLP_ACBP14sqlpMasterDbcb +
    0x5de8
                    (/home/eunicec2/sqllib/lib64/libdb2e.so.1)
    0x00002ACC2C7FA31E _Z20sqlpParallelRecoveryP8sqeAgentP5sqlca +
    0x0d54
                    (/home/eunicec2/sqllib/lib64/libdb2e.so.1)
    0x00002ACC2A6F7AA1 _Z26sqleSubCoordProcessRequestP8sqeAgent +
    0x00f3
    
    and so is the redo worker :
    
    0x00002ACC2C59AAA3 sqloDumpDiagInfoHandler + 0x01bb
                    (/home/eunicec2/sqllib/lib64/libdb2e.so.1)
    0x00002ACC27BE3D60 address: 0x00002ACC27BE3D60 ; dladdress:
    0x00002ACC27BD6000 ; offset in lib: 0x000000000000DD60 ;
                    (/lib64/libpthread.so.0)
    0x00002ACC3325C25A semtimedop + 0x000a
                    (/lib64/libc.so.6)
    0x00002ACC2C47C5A6 _ZN13sqlo_waitlist11timeoutWaitEi + 0x018c
                    (/home/eunicec2/sqllib/lib64/libdb2e.so.1)
    0x00002ACC2C47B7A9 sqloWaitEDUWaitPost + 0x025b
                    (/home/eunicec2/sqllib/lib64/libdb2e.so.1)
    0x00002ACC2C836FA1
    _Z19sqlprWaitDuringPRecP8sqeAgentP16SQLO_EDUWAITPOST + 0x0095
                    (/home/eunicec2/sqllib/lib64/libdb2e.so.1)
    0x00002ACC2C83B30D _Z14sqlprFindQueueP8sqeAgentP9SQLP_DBCBmmPm +
    0x04dd
                    (/home/eunicec2/sqllib/lib64/libdb2e.so.1)
    0x00002ACC2C832E6B
    _Z15sqlpPRecProcLogP8sqeAgentP8SQLP_ACBP14sqlpMasterDbcb +
    0x02c7
                    (/home/eunicec2/sqllib/lib64/libdb2e.so.1)
    

Local fix

  • Kill DB2 and restart the recovery.  DB2 will continue from the
    previous point and will complete the recovery successfully.
    

Problem summary

  • ****************************************************************
    * USERS AFFECTED:                                              *
    * All                                                          *
    ****************************************************************
    * PROBLEM DESCRIPTION:                                         *
    * Customer will see that crash recovery or rollforward is not  *
    *                                                              *
    * making progress after it appears that all log records have   *
    * been                                                         *
    * processed.  If you generate call stacks, you may see         *
    * something                                                    *
    * similar to this.                                             *
    *                                                              *
    *                                                              *
    *                                                              *
    * The redo master process is waiting :                         *
    *                                                              *
    *                                                              *
    *                                                              *
    * 0x00002ACC32851DCD ossSleep + 0x0059                         *
    *                                                              *
    *                                                              *
    * (/home/eunicec2/sqllib/lib64/libdb2osse.so.1)                *
    * 0x00002ACC2C4C770F sqlorest + 0x008f                         *
    *                                                              *
    *                 (/home/eunicec2/sqllib/lib64/libdb2e.so.1)   *
    *                                                              *
    * 0x00002ACC2C832662                                           *
    *                                                              *
    * _Z15sqlpPRecReadLogP8sqeAgentP8SQLP_ACBP14sqlpMasterDbcb +   *
    *                                                              *
    * 0x5de8                                                       *
    *                                                              *
    *                 (/home/eunicec2/sqllib/lib64/libdb2e.so.1)   *
    *                                                              *
    * 0x00002ACC2C7FA31E _Z20sqlpParallelRecoveryP8sqeAgentP5sqlca *
    * +                                                            *
    * 0x0d54                                                       *
    *                                                              *
    *                 (/home/eunicec2/sqllib/lib64/libdb2e.so.1)   *
    *                                                              *
    * 0x00002ACC2A6F7AA1 _Z26sqleSubCoordProcessRequestP8sqeAgent  *
    * +                                                            *
    * 0x00f3                                                       *
    *                                                              *
    *                                                              *
    *                                                              *
    * and so is the redo worker :                                  *
    *                                                              *
    *                                                              *
    *                                                              *
    * 0x00002ACC2C59AAA3 sqloDumpDiagInfoHandler + 0x01bb          *
    *                                                              *
    *                 (/home/eunicec2/sqllib/lib64/libdb2e.so.1)   *
    *                                                              *
    * 0x00002ACC27BE3D60 address: 0x00002ACC27BE3D60 ; dladdress:  *
    *                                                              *
    * 0x00002ACC27BD6000 ; offset in lib: 0x000000000000DD60 ;     *
    *                                                              *
    *                 (/lib64/libpthread.so.0)                     *
    *                                                              *
    * 0x00002ACC3325C25A semtimedop + 0x000a                       *
    *                                                              *
    *                 (/lib64/libc.so.6)                           *
    *                                                              *
    * 0x00002ACC2C47C5A6 _ZN13sqlo_waitlist11timeoutWaitEi +       *
    * 0x018c                                                       *
    *                 (/home/eunicec2/sqllib/lib64/libdb2e.so.1)   *
    *                                                              *
    * 0x00002ACC2C47B7A9 sqloWaitEDUWaitPost + 0x025b              *
    *                                                              *
    *                 (/home/eunicec2/sqllib/lib64/libdb2e.so.1)   *
    *                                                              *
    * 0x00002ACC2C836FA1                                           *
    *                                                              *
    * _Z19sqlprWaitDuringPRecP8sqeAgentP16SQLO_EDUWAITPOST +       *
    * 0x0095                                                       *
    *                 (/home/eunicec2/sqllib/lib64/libdb2e.so.1)   *
    *                                                              *
    * 0x00002ACC2C83B30D                                           *
    * _Z14sqlprFindQueueP8sqeAgentP9SQLP_DBCBmmPm +                *
    * 0x04dd                                                       *
    *                                                              *
    *                 (/home/eunicec2/sqllib/lib64/libdb2e.so.1)   *
    *                                                              *
    * 0x00002ACC2C832E6B                                           *
    *                                                              *
    * _Z15sqlpPRecProcLogP8sqeAgentP8SQLP_ACBP14sqlpMasterDbcb +   *
    *                                                              *
    * 0x02c7                                                       *
    *                                                              *
    *                 (/home/eunicec2/sqllib/lib64/libdb2e.so.1)   *
    ****************************************************************
    * RECOMMENDATION:                                              *
    * Apply Version 9.5 fixpack 7                                  *
    ****************************************************************
    

Problem conclusion

  • Apply Version 9.5 fixpack 7
    

Temporary fix

Comments

APAR Information

  • APAR number

    IC74771

  • Reported component name

    DB2 FOR LUW

  • Reported component ID

    DB2FORLUW

  • Reported release

    970

  • Status

    CLOSED PER

  • PE

    NoPE

  • HIPER

    NoHIPER

  • Special Attention

    NoSpecatt

  • Submitted date

    2011-03-01

  • Closed date

    2011-03-01

  • Last modified date

    2011-03-01

  • APAR is sysrouted FROM one or more of the following:

    IC72907

  • APAR is sysrouted TO one or more of the following:

Fix information

  • Fixed component name

    DB2 FOR LUW

  • Fixed component ID

    DB2FORLUW

Applicable component levels

  • R950 PSY

       UP



Document information

More support for: DB2 for Linux, UNIX and Windows

Software version: 9.7

Reference #: IC74771

Modified date: 01 March 2011