IBM Support

IJ00360: SYSTEM CRASH OR HANG DURING ERROR RECOVERY ON SOME FC ADAPTERS APPLIES TO AIX 6100-09

 

APAR status

  • Closed as program error.

Error description

  • **************************************************************
    * USERS AFFECTED:
    * Systems running the AIX 6100-09 Technology Level.
      **************************************************************
    * ERROR DESCRIPTION:
    * For some Fibre Channel adapters, when error recovery happens,
    * the driver may crash or hang the partition.
    * Adapters using the emfcdd driver are exposed.
    *
    * To find out if a system has any adapters using an affected
    * driver with lsdev, for example:
    * # lsdev -F "name ddins" | grep emfcdd
    * Note: on a VIOS, this must be run as root using
    * 'oem_setup_env'.
      **************************************************************
    * RECOMMENDATION:
    * Install APAR IJ00360.
    * Prior to fix availability, an interim fix is available from
    * either
    * ftp://aix.software.ibm.com/aix/ifixes/ij00360/
    * https://aix.software.ibm.com/aix/ifixes/ij00360/
    * Installation of the ifix requires a reboot.
      **************************************************************
    

Local fix

Problem summary

  • 1. Possible hang with similar stack below during error
    recovery.
    
     04314D68 emfc_find_abt_info_active_xri+000068
     0431388C emfc_sm_abort_proc_ctl_elem+0000EC (??, ??, ??)
     0430E558 emfc_proc_cqe_proto+000598 (??, ??, ??)
     043102AC emfc_proc_cq_wq+0002AC (??, ??)
     04310F00 emfc_proc_eq+000140 (??, ??)
     04311444 emfc_intr+000144 (??)  002646C4 i_poll_soft+000204
    (??)  00263E50 i_softmod+000670 ()  0020292C flih_util+000258
    ()
    
    OR
    
     05A6B318 emfc_log_err+000778 (??, ??, ??, ??, ??, ??)
     05AE7724 emfc_sm_abort_proc_xri_aborted+000264 (??, ??)
     05AE5318 emfc_proc_cq_wq+0002B8 (??, ??)
     05AE5F70 emfc_proc_eq+000150 (??, ??)
     05AE64B8 emfc_intr+000138 (??)  002BFA54 i_poll_soft+0001F4
    (??)  002BF1E4 i_softmod+000624 ()  001B0338 flih_util+000258
    ()
    
    2. And possible system crash with similar stack below during
    error recovery.
    
    pvthread+085900 STACK:   00024BE0 abend_trap+000000 ()
     05AE9190 emfc_sm_abort_build_rrq+0001D0 (??, ??)
     05AE9A9C emfc_sm_abort+0003DC (??, ??)
     05AE79E0 emfc_sm_abort_proc_xri_aborted+000520 (??, ??)
     05AE5318 emfc_proc_cq_wq+0002B8 (??, ??)
     05AE5F70 emfc_proc_eq+000150 (??, ??)
     05AE64B8 emfc_intr+000138 (??)  002BFA54 i_poll_soft+0001F4
    (??)  002BF1E4 i_softmod+000624 ()  001B0338 flih_util+000258
    ()
    

Problem conclusion

  • Fixed code to avoid hang and crash.
    

Temporary fix

  •   *********
      * HIPER *
      *********
    

Comments

  • 6100-09 - use AIX APAR IJ00360
    6100-09 - use AIX APAR IJ00360
    7100-04 - use AIX APAR IJ00519
    7100-05 - use AIX APAR IJ01304
    7200-01 - use AIX APAR IV96360
    7200-02 - use AIX APAR IJ01262
    

APAR Information

  • APAR number

    IJ00360

  • Reported component name

    AIX 6.1 EXT SUP

  • Reported component ID

    5765G6299

  • Reported release

    61E

  • Status

    CLOSED PER

  • PE

    NoPE

  • HIPER

    YesHIPER

  • Submitted date

    2017-09-27

  • Closed date

    2017-11-02

  • Last modified date

    2018-09-21

Fix information

  • Fixed component name

    AIX 6.1 EXT SUP

  • Fixed component ID

    5765G6299

Applicable component levels

[{"Business Unit":{"code":"BU058","label":"IBM Infrastructure w\/TPS"},"Product":{"code":"SSLLZP","label":"AIX Standard Edition"},"Component":"","ARM Category":[],"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"61E","Edition":"","Line of Business":{"code":"LOB08","label":"Cognitive Systems"}},{"Business Unit":{"code":"BU054","label":"Systems w\/TPS"},"Product":{"code":"SG11Q","label":"AIX 6.1 HIPERS, APARs and Fixes"},"Component":"","ARM Category":[],"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"61E","Edition":"","Line of Business":{"code":"","label":""}}]

Document Information

Modified date:
21 September 2018