IBM Support

Nodes in Metro or Global Mirror Inter-cluster Partnerships May Reboot if the Inter-cluster Link Becomes Overloaded

Flashes (Alerts)


Abstract


SVC, Storwize V7000 or Storwize V7000 Unified nodes in Metro or Global Mirror inter-cluster partnerships may experience lease expiry reboot events if an inter-cluster link to a partner system becomes overloaded. These reboot events may occur on all nodes simultaneously, leading to a temporary loss of host access to Volumes.

Content

If an inter-cluster link becomes severely and abruptly overloaded, it is possible for the local fibre channel fabric to become congested to the extent that no fibre channel ports on the local SVC, Storwize V7000 or Storwize V7000 Unified nodes are able to perform local intra-cluster heartbeat communication. This may result in the nodes experiencing lease expiry events, in which a node will reboot in order to attempt to re-establish communication with the other nodes in the system. If all nodes lease expire simultaneously, this may lead to a loss of host access to Volumes for the duration of the reboot events.

Workaround

The recommended default zoning recommendation for inter-cluster Metro and Global Mirror partnerships has now been revised to ensure that, if link-induced congestion occurs, only two of the four fibre channel ports on each node are able to be subjected to this congestion. The remaining two ports on each node will remain unaffected, and therefore able to continue performing intra-cluster heartbeat communication without interruption.



The revised zoning recommendation is as follows:

  • For each node in a clustered system, exactly two fibre channel ports should be zoned to exactly two fibre channel ports from each node in the partner system.
  • If dual-redundant ISLs are available, then the two ports from each node should be split evenly between the two ISLs, i.e. exactly one port from each node should be zoned across each ISL.
  • Local system zoning should continue to follow the standard requirement for all ports on all nodes in a clustered system to be zoned to one another.

This implies that for each system, there will be two ports on each SVC node that have no remote zones, only local zones.


Please refer to the What's new in the IBM System Storage SAN Volume Controller Information Center for more information on SVC Terminology changes.

[{"Product":{"code":"STPVGU","label":"SAN Volume Controller"},"Business Unit":{"code":"BU058","label":"IBM Infrastructure w\/TPS"},"Component":"6.4","Platform":[{"code":"","label":"SAN Volume Controller"}],"Version":"Version Independent","Edition":"","Line of Business":{"code":"LOB26","label":"Storage"}},{"Product":{"code":"STPVGU","label":"SAN Volume Controller"},"Business Unit":{"code":"BU058","label":"IBM Infrastructure w\/TPS"},"Component":"6.3","Platform":[{"code":"","label":"SAN Volume Controller"}],"Version":"4.3.1;5.1;6.1;6.2;6.3;6.4","Edition":"","Line of Business":{"code":"LOB26","label":"Storage"}},{"Product":{"code":"ST3FR7","label":"IBM Storwize V7000"},"Business Unit":{"code":"BU058","label":"IBM Infrastructure w\/TPS"},"Component":"6.3","Platform":[{"code":"","label":"IBM Storwize V7000"}],"Version":"6.1;6.2;6.3;6.4","Edition":"","Line of Business":{"code":"LOB26","label":"Storage"}},{"Product":{"code":"ST5Q4U","label":"IBM Storwize V7000 Unified (2073)"},"Business Unit":{"code":"BU058","label":"IBM Infrastructure w\/TPS"},"Component":"1.3","Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"1.3","Edition":"","Line of Business":{"code":"LOB26","label":"Storage"}}]

Document Information

Modified date:
28 March 2023

UID

ssg1S1003634