IBM Support

IT23996: DISCONNECTION OF A SESSION DURING DATA INGEST TO CONTAINER STORAGE CAN CAUSE IT TO HANG.

Subscribe

You can track all active APARs for this component.

 

APAR status

  • Closed as program error.

Error description

  • If a session is disconnected due to a hard failure or a cancel
    session command, the session may not end but hang.  This can
    happen when the disconnect occurs during the ingest of data to a
     directory storage pool. (ie writing to a container)
    
    The problem occurs when the session thread and the control
    thread become out of sync.
    
    The  'Show Threads' could show:
    
    Thread 10324, Parent 274: psSessionThread, Storage 11044506,
    AllocCnt 18003 HighWaterAmt 47947886
       tid=140730650932992, ptid=140733150521088, det=1, zomb=0,
    join=0, result=0, sess=3270, procToken=0, sessToken=3270
       lwp=34289
        Awaiting cond sessP->bufQFullCond (0x0x21b2590), using mutex
    sessP->mutex (0x0x7ffbf41ba848), at sdbuf.c(746)
       Thread context:
         COMMMETHOD: Tcp/Ip
         SRC_STRATEGY: CONTAINER
         SRC_STGPOOL_NAME: CONTAINER_POOL
         SERVER_NAME: SERVER1
         THREAD_TYPE: SESSION
         SESSION_TYPE: SERVER
    
        Thread 10471, Parent 10324: SdCQSinkThread, Storage
    3124105988, AllocCnt 411707 HighWaterAmt 3127747871
         tid=140736141965056, ptid=140730650932992, det=1, zomb=0,
    join=0, result=0, sess=0, procToken=0, sessToken=3270
         lwp=46319
          Awaiting cond sessP->bufQEmptyCond (0x0x21b26e0), using
    mutex sessP->mutex (0x0x7ffbf41ba848), at sdbuf.c(1081)
         Thread context:
           COMMMETHOD: Tcp/Ip
           SRC_STRATEGY: CONTAINER
           SERVER_NAME: SERVER1
           SRC_STGPOOL_NAME: CONTAINER_POOL
           THREAD_TYPE: SESSION
           SESSION_TYPE: SERVER
    
    We see that the first (parent) thread is awaiting for
    bufQFullCond but the second (child) thread is awaiting
    bufQEmptyCond - these are opposite conditions which will never
    be met and hence the session does not end.
    
    The hung session may or may not be cancellable once this state
    is met
    
    
    IBM Spectrum Protect versions affected:   Server version 7.1.3
    and higher, and 8.1 and higher on all platforms
    
    
    Initial Impact:
    Medium
    
    Additional Keywords:
    Tivoli Storage Manager  TSM IBM Spectrum Protect hanging
    

Local fix

  • Halt and restart the server
    

Problem summary

  • ****************************************************************
    * USERS AFFECTED:                                              *
    * All IBM Spectrum Protect server users of container type      *
    * storage pools.                                               *
    ****************************************************************
    * PROBLEM DESCRIPTION:                                         *
    * See error description.                                       *
    ****************************************************************
    * RECOMMENDATION:                                              *
    * Apply fixing level when available. This problem is currently *
    * projected to be                                              *
    * fixed in level 8.1.5. Note that this is subject to change at *
    * the discretion of IBM.                                       *
    ****************************************************************
    

Problem conclusion

  • This problem was fixed.
    
    Affected platforms for reported release:  AIX, Linux, and
    Windows.
    Platforms fixed:   AIX, Linux, Windows.
    

Temporary fix

Comments

APAR Information

  • APAR number

    IT23996

  • Reported component name

    TSM SERVER

  • Reported component ID

    5698ISMSV

  • Reported release

    81L

  • Status

    CLOSED PER

  • PE

    NoPE

  • HIPER

    NoHIPER

  • Special Attention

    NoSpecatt / Xsystem

  • Submitted date

    2018-02-19

  • Closed date

    2018-03-27

  • Last modified date

    2018-03-27

  • APAR is sysrouted FROM one or more of the following:

  • APAR is sysrouted TO one or more of the following:

Fix information

  • Fixed component name

    TSM SERVER

  • Fixed component ID

    5698ISMSV

Applicable component levels

[{"Business Unit":{"code":"BU058","label":"IBM Infrastructure w\/TPS"},"Product":{"code":"SSGSG7","label":"Tivoli Storage Manager"},"Component":"","ARM Category":[],"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"81L","Edition":"","Line of Business":{"code":"LOB26","label":"Storage"}}]

Document Information

Modified date:
27 March 2018