A fix is available
APAR status
Closed as program error.
Error description
With a HA 7.1.3 unicast 2-node cluster, if the node with the storage having the LUN for the rep disk fails, the entire cluster may go down if the other node is rebooting/down. This is due to CAA unable to do a send via tcpsock connections (since other node is rebooting) and unable to do a send via dpcomm (since rep disk failed), causing a deadman switch timeout, despite ethernet and/or sancomm connections being healthy.
Local fix
Problem summary
2-node unicast cluster, if the node with the LUN for the repository is lost (e.g., power loss to the building), the other node will go down as well due to deadman switch timeout.
Problem conclusion
If using unicast and repository is down and an ethernet and/or sancomm interface is healthy, send deadman swith heartbeat, preventing deadman switch timeout.
Temporary fix
Comments
6100-09 - use AIX APAR IV59209 6100-09 - use AIX APAR IV59209 6100-09 - use AIX APAR IV59209 7100-03 - use AIX APAR IV58961 7100-04 - use AIX APAR IV59681
APAR Information
APAR number
IV59681
Reported component name
AIX V7.1
Reported component ID
5765H4000
Reported release
710
Status
CLOSED PER
PE
NoPE
HIPER
NoHIPER
Submitted date
2014-04-23
Closed date
2014-04-23
Last modified date
2016-05-10
APAR is sysrouted FROM one or more of the following:
APAR is sysrouted TO one or more of the following:
Fix information
Fixed component name
AIX V7.1
Fixed component ID
5765H4000
Applicable component levels
R710 PSY U861566
UP15/11/22 I 1000
PTF to Fileset Mapping
U861566 bos.cluster.rte 7.1.4.0
[{"Business Unit":{"code":"BU054","label":"Systems w\/TPS"},"Product":{"code":"SSMV87","label":"AIX 6.1 Enterprise Edition"},"Component":"","ARM Category":[],"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"710","Edition":"","Line of Business":{"code":"LOB08","label":"Cognitive Systems"}},{"Business Unit":{"code":"BU058","label":"IBM Infrastructure w\/TPS"},"Product":{"code":"SSMVAX","label":"AIX Express Edition"},"Component":"","ARM Category":[],"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"710","Edition":"","Line of Business":{"code":"LOB08","label":"Cognitive Systems"}},{"Business Unit":{"code":"BU054","label":"Systems w\/TPS"},"Product":{"code":"SG11R","label":"AIX 7.1 HIPERS, APARs and Fixes"},"Component":"","ARM Category":[],"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"710","Edition":"","Line of Business":{"code":"","label":""}}]
Document Information
Modified date:
10 May 2016