IBM Support

IV19178: SYSTEM CRASH DUE TO FREED SOCKET WHEN SOCKETPAIR() CALL USED APPLIES TO AIX 5300-12

Subscribe to this APAR

By subscribing, you receive periodic emails alerting you to the status of the APAR, along with a link to the fix after it becomes available. You can track this item individually or track all items by product.

Notify me when this APAR changes.

Notify me when an APAR for this component changes.

 

APAR status

  • Closed as program error.

Error description

  • When socketpair calls are used on the system we could see
    a crash in socket code path (close, send, etc.) due to
    one
    of the sockets in the freelist.
    Stack could be something like,
    
    [00009514].simple_lock+000014 ()
    [0049EB08]unp_disconnect@AF32_12+000228 (??, ??, ??, ??,
     ??)
    [0049C574]uipc_usrreq+0005D4 (??, ??, ??, ??, ??)
    [004C75A8]sodisconn+0001A8 (??)
    [004C6DE4]soclose2+000BE4 (??, ??)
    [004C8070]soclose+000010 (??)
    [004D2734]soo_close+000294 (??)
    [00630F7C]closef+00005C (??)
    [00547890]closefd+000150 (??, ??)
    [00547C44]closex+000304 (??, ??)
    [00547EBC]close+00011C (??)
    [00003850]ovlya_addr_sc_flih_main+000130 ()
    [kdb_get_virtual_memory] no real storage @ 111694A50
    [900000000B192F4]0900000000B192F4 ()
    [kdb_read_mem] no real storage @ FFFFFFFFFFF6380
    
    This is because the socket in unp_conn is already freed.
    

Local fix

Problem summary

  • ****************************************************************
    * USERS AFFECTED:
    * Systems running the AIX 5300-12 Technology Level with the
    * kernels (bos.mp, bos.mp64) below the 5.3.12.6 level.
    *
    * Customers running Oracle RAC 11.2.0.3 or newer, with
    * Oracle Cluster Health Monitor (CHM), are likely to see
    * this problem.
    *
    ****************************************************************
    * PROBLEM DESCRIPTION:
    * System crash using socketpair() with a stack similar to:
    *
    *  00009514 .simple_lock+000014 ()
    *  0049EB08 unp_disconnect@AF32_12+000228 (??, ??, ??, ??, ??)
    *  0049C574 uipc_usrreq+0005D4 (??, ??, ??, ??, ??)
    *  004C75A8 sodisconn+0001A8 (??)
    *  004C6DE4 soclose2+000BE4 (??, ??)
    *  004C8070 soclose+000010 (??)
    *  004D2734 soo_close+000294 (??)
    *  00630F7C closef+00005C (??)
    *  00547890 closefd+000150 (??, ??)
    *  00547C44 closex+000304 (??, ??)
    

Problem conclusion

  • The code in socketpair() system call modifid so that in
    multithreaded environment, another thread is not able to close
    one of the sockets during critical pairing phase.
    

Temporary fix

  • *********
    * HIPER *
    *********
    

Comments

  • 5300-12 - use AIX APAR IV19178
    6100-06 - use AIX APAR IV16603
    7100-00 - use AIX APAR IV21131
    

APAR Information

  • APAR number

    IV19178

  • Reported component name

    AIX 5.3

  • Reported component ID

    5765G0300

  • Reported release

    530

  • Status

    CLOSED PER

  • PE

    NoPE

  • HIPER

    YesHIPER

  • Submitted date

    2012-04-09

  • Closed date

    2012-04-09

  • Last modified date

    2013-03-26

  • APAR is sysrouted FROM one or more of the following:

    IV16603

  • APAR is sysrouted TO one or more of the following:

Fix information

  • Fixed component name

    AIX 5.3

  • Fixed component ID

    5765G0300

Applicable component levels

  • R530 PSY

       UP

[{"Business Unit":{"code":"BU054","label":"Systems w\/TPS"},"Product":{"code":"SG11P","label":"APARs - AIX 5.3 environment"},"Component":"","ARM Category":[],"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"530","Edition":"","Line of Business":{"code":"","label":""}}]

Document Information

Modified date:
26 March 2013