IBM Support

IV51337: CRASH IN LVM USING ASYNC GLVM AFTER REMOTE SITE FAILURE APPLIES TO AIX 7100-03

A fix is available

Subscribe

You can track all active APARs for this component.

 

APAR status

  • Closed as program error.

Error description

  • When using async glvm, after some problem causes
    the remote site to fail, the local
    host may crash with the below stack:
    
    pvthread+041000 STACK:
    [0001BF20]abend_trap+000000 ()
    [0054E128]slock+0005A8 (??, ??)
    [000090C4].disable_lock+0000C0 ()
    [044259AC]hd_ca_wrt+00006C ()
    [044186B8]hd_sched_queue+0001B8 (??, ??)
    [0441E1F8]hd_strategy+0000D8 (??)
    [002D23E4]std_devstrat+000364 (??)
    [002D2A10]devstrat+000050 (??)
    [00014F50].kernel_add_gate_cstack+000030 ()
    [F1000000C024D7F0]rpvc_mark_aio_group_complete+0001D0
    (??, ??)
    [F1000000C024C284]rpvc_fail_aio_write_done+0000E4 (??)
    [00014D70].hkey_legacy_gate+00004C ()
    [04433720]hd_aiocbdone+0000C0 (??, ??, ??)
    [044335B0]hd_async_stalepp_end+000170 (??)
    [04428384]hd_sa_rtn+000144 (??, ??)
    [044289C8]hd_sa_whladv+000088 (??)
    [044286E0]hd_sa_iodone+0002E0 (??)
    [00188BA8]internal_iodone_offl+0000C8 (??, ??)
    [00188A84]iodone_offl+000064 ()
    [0018C27C]i_softmod+00043C ()
    [001B9B0C]flih_util+000258 ()
    

Local fix

Problem summary

  • When using Async GLVM, and a write to the remote glvm site
    fails, certain timing of I/O during the error handling path
    can result in the local node to crash with the stack trace
    above.
    

Problem conclusion

  • Correct some logic with reguards to locking to not crash in
    this scenario
    

Temporary fix

Comments

  • 6100-07 - use AIX APAR IV51028
    6100-08 - use AIX APAR IV57817
    6100-09 - use AIX APAR IV51122
    6100-09 - use AIX APAR IV51122
    6100-09 - use AIX APAR IV51122
    7100-01 - use AIX APAR IV57510
    7100-02 - use AIX APAR IV56962
    7100-03 - use AIX APAR IV51337
    

APAR Information

  • APAR number

    IV51337

  • Reported component name

    AIX V7.1

  • Reported component ID

    5765H4000

  • Reported release

    710

  • Status

    CLOSED PER

  • PE

    NoPE

  • HIPER

    NoHIPER

  • Submitted date

    2013-10-24

  • Closed date

    2013-10-24

  • Last modified date

    2014-08-14

  • APAR is sysrouted FROM one or more of the following:

    IV51028

  • APAR is sysrouted TO one or more of the following:

Fix information

  • Fixed component name

    AIX V7.1

  • Fixed component ID

    5765H4000

Applicable component levels

  • R710 PSY U859005

       UP14/05/22 I 1000

PTF to Fileset Mapping

[{"Business Unit":{"code":"BU054","label":"Systems w\/TPS"},"Product":{"code":"SSMV87","label":"AIX 6.1 Enterprise Edition"},"Component":"","ARM Category":[],"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"710","Edition":"","Line of Business":{"code":"LOB08","label":"Cognitive Systems"}},{"Business Unit":{"code":"BU058","label":"IBM Infrastructure w\/TPS"},"Product":{"code":"SSMVAX","label":"AIX Express Edition"},"Component":"","ARM Category":[],"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"710","Edition":"","Line of Business":{"code":"LOB08","label":"Cognitive Systems"}},{"Business Unit":{"code":"BU054","label":"Systems w\/TPS"},"Product":{"code":"SG11R","label":"AIX 7.1 HIPERS, APARs and Fixes"},"Component":"","ARM Category":[],"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"710","Edition":"","Line of Business":{"code":"","label":""}}]

Document Information

Modified date:
14 August 2014