IBM Support

IJ49540: POTENTIAL FOR RESOURCE LEAK WHEN USING RDMA

Subscribe to this APAR

By subscribing, you receive periodic emails alerting you to the status of the APAR, along with a link to the fix after it becomes available. You can track this item individually or track all items by product.

Notify me when this APAR changes.

Notify me when an APAR for this component changes.

 

APAR status

  • Closed as program error.

Error description

  • When a RDMA connection to a remote node has to be shutdown due
    to network errors (e.g. network link goes down) it can sometimes
    happen that the affected RDMA connection will not be closed and
    all resources assigned to this RDMA connection (memory, VERBS
    Queue Pair, ...) are not freed.
    

Local fix

Problem summary

  • When a RDMA connection to a remote node has to be shutdown due
    to network errors (e.g. network link goes down) it can sometimes
    happen that the affected RDMA connection will not be closed and
    all resources assigned to this RDMA connection (memory, VERBS
    Queue Pair, ...) are not freed.
    

Problem conclusion

  • This problem is fixed in 5.1.2.15
    To see all Spectrum Scale APARs and their respective
    Fix solutions refer to page:
    https://public.dhe.ibm.com/storage/spectrumscale/spectrum_scale_
    apars.html
    
    Benefits of the solution:
    Fixed code so that the resource leak can no longer happen.
    
    Work around:
    No work around available.
    
    Problem trigger:
    verbsRdmaSend must be enabled. Loss of a RDMA connection to a
    node because of network errors in the RDMA fabric.
    
    Symptom:
    Unexpected Results/Behavior
    
    Platforms affected:
    ALL Linux OS environments
    
    Functional Area affected:
    RDMA
    
    Customer Impact:
    Suggested
    

Temporary fix

Comments

APAR Information

  • APAR number

    IJ49540

  • Reported component name

    SPEC SCALE STD

  • Reported component ID

    5737F33AP

  • Reported release

    512

  • Status

    CLOSED PER

  • PE

    NoPE

  • HIPER

    NoHIPER

  • Special Attention

    NoSpecatt / Xsystem

  • Submitted date

    2023-12-14

  • Closed date

    2023-12-14

  • Last modified date

    2023-12-14

  • APAR is sysrouted FROM one or more of the following:

  • APAR is sysrouted TO one or more of the following:

Fix information

  • Fixed component name

    SPEC SCALE STD

  • Fixed component ID

    5737F33AP

Applicable component levels

[{"Business Unit":{"code":"BU058","label":"IBM Infrastructure w\/TPS"},"Product":{"code":"STXKQY"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"512","Line of Business":{"code":"LOB26","label":"Storage"}}]

Document Information

Modified date:
15 December 2023