IBM Support

IJ46533: A deadlock of AcquireBRTHandlerThread or RangeRevokeWorkerThread waiting for other threads to exit fast path

Subscribe to this APAR

By subscribing, you receive periodic emails alerting you to the status of the APAR, along with a link to the fix after it becomes available. You can track this item individually or track all items by product.

Notify me when this APAR changes.

Notify me when an APAR for this component changes.

 

APAR status

  • Closed as program error.

Error description

  • There is a code issue that could result in that an AIO
    completion event could be not handled by the AIO completion
    thread, then form a deadlock with long waiter for the thread of
    AcquireBRTHandlerThread or RangeRevokeWorkerThread waiting for
    other threads to exit fast path. In addition, such miss-handling
     for AIO completion could also cause the file system cannot be
    quiesced and memory leak issue.
    

Local fix

Problem summary

  • There is a code issue that could result in that an AIO
    completion event could be not handled by the AIO completion
    thread, then form a deadlock with long waiter for the thread of
    AcquireBRTHandlerThread or RangeRevokeWorkerThread waiting for
    other threads to exit fast path. In addition, such miss-handling
     for AIO completion could also cause the file system cannot be
    quiesced and memory leak issue.
    

Problem conclusion

  • This problem is fixed in 5.1.2.11
    To see all Spectrum Scale APARs and their respective
    Fix solutions refer to page:
    https://public.dhe.ibm.com/storage/spectrumscale/spectrum_scale_
    apars.html
    
    Benefits of the solution:
    Fix the deadlock issue caused by the AIO completion missing
    handled issue.
    
    Work around:
    No
    
    Problem trigger:
    Doing AIO reads/writes from one node and then start normal
    buffer I/O load from the other nodes against the same files.
    
    Symptom:
    Deadlock
    
    Platforms affected:
    Linux Only
    
    Functional Area affected:
    AIO only
    
    Customer Impact:
    Critical
    

Temporary fix

Comments

APAR Information

  • APAR number

    IJ46533

  • Reported component name

    SPEC SCALE STD

  • Reported component ID

    5737F33AP

  • Reported release

    512

  • Status

    CLOSED PER

  • PE

    NoPE

  • HIPER

    NoHIPER

  • Special Attention

    NoSpecatt / Xsystem

  • Submitted date

    2023-04-21

  • Closed date

    2023-05-15

  • Last modified date

    2023-05-15

  • APAR is sysrouted FROM one or more of the following:

  • APAR is sysrouted TO one or more of the following:

Fix information

  • Fixed component name

    SPEC SCALE STD

  • Fixed component ID

    5737F33AP

Applicable component levels

[{"Business Unit":{"code":"BU058","label":"IBM Infrastructure w\/TPS"},"Product":{"code":"STXKQY"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"512","Line of Business":{"code":"LOB26","label":"Storage"}}]

Document Information

Modified date:
15 May 2023