IBM Support

PI56154: After a network bronwn out recovers, the routetable contains unr eachable entries

Subscribe to this APAR

By subscribing, you receive periodic emails alerting you to the status of the APAR, along with a link to the fix after it becomes available. You can track this item individually or track all items by product.

Notify me when this APAR changes.

Notify me when an APAR for this component changes.

 

APAR status

  • Closed as program error.

Error description

  • After a network brownout recovers, the routetable contains
    unreachable or entries pointing. The routetable entries that are
    unreachable match the locations of shards on servers that were
    stopped or restarted during brownout recovery.
    

Local fix

Problem summary

  • ****************************************************************
    * USERS AFFECTED:  WebSphere eXtreme Scale or WebSphere        *
    *                  DataPower                                   *
    *                  XC10 Appliance users who experience a       *
    *                  network                                     *
    *                  brownout (a temporary network failure       *
    *                  lasting                                     *
    *                  longer than 30 seconds or the configured    *
    *                  failover detection time).                   *
    ****************************************************************
    * PROBLEM DESCRIPTION: After a network bronwn out recovers,    *
    *                      the                                     *
    *                      routetable contains unreachable         *
    *                      entries                                 *
    ****************************************************************
    * RECOMMENDATION:                                              *
    ****************************************************************
    After a network brownout recovers, the routetable contains
    unreachable or entries pointing. The routetable entries that
    are
    unreachable match the locations of shards on servers that were
    stopped or restarted during brownout recovery.
    This can happen if a catalog server is isolated with a group of
    containers and does some shard movement while islanded,
    incremented the placement work ID for their grids. If the group
    rejoins the domain or collective, the containers can push their
    route table entries to the catalog servers. If the same
    containers are then restarted to recover from the brownout, then
    bad route table entries can be left in the route table and new
    routes temporarily rejected because they appear to be stale or
    old.
    

Problem conclusion

  • The route table processing double checks if potentially stale
    route entries are actually valid (match the current placement
    plan). This allows "bad" routes from a network brownout failure
    be overridden.
    

Temporary fix

Comments

APAR Information

  • APAR number

    PI56154

  • Reported component name

    WS EXTREME SCAL

  • Reported component ID

    5724X6702

  • Reported release

    860

  • Status

    CLOSED PER

  • PE

    NoPE

  • HIPER

    NoHIPER

  • Special Attention

    NoSpecatt / Xsystem

  • Submitted date

    2016-01-29

  • Closed date

    2016-03-03

  • Last modified date

    2016-03-03

  • APAR is sysrouted FROM one or more of the following:

  • APAR is sysrouted TO one or more of the following:

Fix information

  • Fixed component name

    WS EXTREME SCAL

  • Fixed component ID

    5724X6702

Applicable component levels

  • R860 PSY

       UP

[{"Business Unit":{"code":"BU053","label":"Cloud & Data Platform"},"Product":{"code":"SSTVLU","label":"WebSphere eXtreme Scale"},"Component":"","ARM Category":[],"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"860","Edition":"","Line of Business":{"code":"LOB45","label":"Automation"}}]

Document Information

Modified date:
03 March 2016