IBM Support

PM87976: After a grid restart, occasionally one or more catalog servers are encountering OOM exception from DCS.

Subscribe to this APAR

By subscribing, you receive periodic emails alerting you to the status of the APAR, along with a link to the fix after it becomes available. You can track this item individually or track all items by product.

Notify me when this APAR changes.

Notify me when an APAR for this component changes.

 

APAR status

  • Closed as program error.

Error description

  • After a grid restart, occasionally one or more catalog servers
    are encountering OOM exception from DCS.
    

Local fix

  • N/A.
    

Problem summary

  • ****************************************************************
    * USERS AFFECTED:  Users with topologies that generate a large *
    *                  amount of                                   *
    *                  placement work units within a window of one *
    *                  minute.                                     *
    ****************************************************************
    * PROBLEM DESCRIPTION: The catalog server might                *
    *                      run out of memory due to a high number  *
    *                      of canceled TimerTasks that have not    *
    *                      been purged from java.util.Timer.       *
    ****************************************************************
    * RECOMMENDATION:                                              *
    ****************************************************************
    Tracking placement work units involves the use of TimerTask
    objects on the catalog server.  These tasks are frequently
    canceled.
    However, java.util.Timer does not purge the objects at the time
    of
    cancellation.  As a result, the build up
    of thousands of TimerTask references being held by the Timer
    might occur.
    The build up of these objects might cause an OutOfMemoryError on
    the catalog server.
    

Problem conclusion

  • This fix purges the java.util.Timer on an interval.
    

Temporary fix

Comments

APAR Information

  • APAR number

    PM87976

  • Reported component name

    WS EXTREME SCAL

  • Reported component ID

    5724X6702

  • Reported release

    850

  • Status

    CLOSED PER

  • PE

    NoPE

  • HIPER

    NoHIPER

  • Special Attention

    NoSpecatt

  • Submitted date

    2013-04-28

  • Closed date

    2013-05-17

  • Last modified date

    2013-07-05

  • APAR is sysrouted FROM one or more of the following:

  • APAR is sysrouted TO one or more of the following:

Fix information

  • Fixed component name

    WS EXTREME SCAL

  • Fixed component ID

    5724X6702

Applicable component levels

  • R850 PSY

       UP

  • R860 PSY

       UP

[{"Business Unit":{"code":"BU053","label":"Cloud & Data Platform"},"Product":{"code":"SSTVLU","label":"WebSphere eXtreme Scale"},"Component":"","ARM Category":[],"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"850","Edition":"","Line of Business":{"code":"LOB45","label":"Automation"}}]

Document Information

Modified date:
05 July 2013