IBM Support

IV59209: LOSS OF REPOS AND REBOOT IN UNICAST CLUSTER CAUSES DMS TIMEOUT APPLIES TO AIX 6100-09

A fix is available

Subscribe

You can track all active APARs for this component.

 

APAR status

  • Closed as program error.

Error description

  • With a HA 7.1.3 unicast 2-node cluster, if the node
    with the storage having the LUN for the rep disk fails,
    the entire cluster may go down if the other node is
    rebooting/down. This is due to CAA unable to do a send
    via
    tcpsock connections (since other node is rebooting) and
    unable to do a send via dpcomm (since rep disk failed),
    causing a deadman switch timeout, despite ethernet and/or
    sancomm connections being healthy.
    

Local fix

Problem summary

  • 2-node unicast cluster, if the node with the LUN for
    the repository is lost (e.g., power loss to the
    building), the other node will go down as well due
    to deadman switch timeout.
    

Problem conclusion

  • If using unicast and repository is down and an ethernet and/or
    sancomm interface is healthy, send deadman swith heartbeat,
    preventing deadman switch timeout.
    

Temporary fix

Comments

  • 6100-09 - use AIX APAR IV59209
    6100-09 - use AIX APAR IV59209
    6100-09 - use AIX APAR IV59209
    7100-03 - use AIX APAR IV58961
    7100-04 - use AIX APAR IV59681
    

APAR Information

  • APAR number

    IV59209

  • Reported component name

    AIX 610 STD EDI

  • Reported component ID

    5765G6200

  • Reported release

    610

  • Status

    CLOSED PER

  • PE

    NoPE

  • HIPER

    NoHIPER

  • Submitted date

    2014-04-14

  • Closed date

    2014-04-14

  • Last modified date

    2016-05-10

  • APAR is sysrouted FROM one or more of the following:

    IV58961

  • APAR is sysrouted TO one or more of the following:

Fix information

  • Fixed component name

    AIX 610 STD EDI

  • Fixed component ID

    5765G6200

Applicable component levels

  • R610 PSY U856008

       UP14/05/21 I 1000

PTF to Fileset Mapping

[{"Business Unit":{"code":"BU058","label":"IBM Infrastructure w\/TPS"},"Product":{"code":"SSLLZP","label":"AIX Standard Edition"},"Component":"","ARM Category":[],"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"610","Edition":"","Line of Business":{"code":"LOB08","label":"Cognitive Systems"}},{"Business Unit":{"code":"BU054","label":"Systems w\/TPS"},"Product":{"code":"SSMV87","label":"AIX 6.1 Enterprise Edition"},"Component":"","ARM Category":[],"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"610","Edition":"","Line of Business":{"code":"LOB08","label":"Cognitive Systems"}},{"Business Unit":{"code":"BU058","label":"IBM Infrastructure w\/TPS"},"Product":{"code":"SSMVAX","label":"AIX Express Edition"},"Component":"","ARM Category":[],"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"610","Edition":"","Line of Business":{"code":"LOB08","label":"Cognitive Systems"}},{"Business Unit":{"code":"BU054","label":"Systems w\/TPS"},"Product":{"code":"SSAUMY","label":"IBM AIX Enterprise Edition"},"Component":"","ARM Category":[],"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"610","Edition":"","Line of Business":{"code":"","label":""}},{"Business Unit":{"code":"BU054","label":"Systems w\/TPS"},"Product":{"code":"SG11Q","label":"AIX 6.1 HIPERS, APARs and Fixes"},"Component":"","ARM Category":[],"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"610","Edition":"","Line of Business":{"code":"","label":""}},{"Business Unit":{"code":"BU054","label":"Systems w\/TPS"},"Product":{"code":"SG11R","label":"APARs - AIX 7.1 environment"},"Component":"","ARM Category":[],"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"610","Edition":"","Line of Business":{"code":"","label":""}}]

Document Information

Modified date:
10 May 2016