APAR status
Closed as program error.
Error description
Client session is hung in MediaW state. New client and admin sessions hang in the Run state and in time the server can run out of sessions preventing new client/admin sessions. These symptoms are the same as APAR IC99934, but the cause and identifiers are different. Tivoli Storage Manager Versions Affected: All V62, V63 and V7 Servers Customer/L2 Diagnostics: Query Session will show growing number of client sessions in the run state: Sess Comm. Sess Wait Sess Number Method State Time Type ------- ------ ------ ------ ------- 1,336,400 Tcp/Ip SendW 0 S Node 1,337,000 Tcp/Ip MediaW 4.7 M Node 1,337,301 Tcp/Ip Run 0 S Node 1,337,302 Tcp/Ip Run 0 S Node 1,337,303 Tcp/Ip Run 0 S Node 1,337,304 Tcp/Ip Run 0 S Node 1,337,305 Tcp/Ip Run 0 S Node 1,337,306 Tcp/Ip Run 0 S Admin ... Collect the servermonV6.pl script data during the time you get traces. Although this problem is difficult to trap with a trace, the following flags can be used: TRACE ENABLE INSTR AS MMS PVRNOIO Tracing with just INSTR and PVR seemed to help trigger the problem more frequently. The traces will show a latch deadlock. The deadlocked threads will both have the same stack. Example: Thread 3631144, Parent 1048: psSessionThread, Storage 7797835, AllocCnt 1621 HighWaterAmt 7867735 tid=828, ptid=2218, det=1, zomb=0, join=0, result=0, sess=871784 Awaiting cond latchP->sFree (0x110cdc6f0), using mutex PVRV->mutex (0x111b8c148), at latch.c(256) Stack trace: 0x09000000004b9ab0 _cond_wait_global 0x09000000004ba63c _cond_wait 0x09000000004bb32c pthread_cond_wait 0x00000001000079f4 pkWaitConditionTracked 0x000000010068a51c SmNodeSession 0x000000010053e39c HandleNodeSession 0x0000000100535cb4 DoNodeGeneral 0x0000000100530ce0 smExecuteSession 0x000000010017827c psSessionThread 0x000000010000c264 StartThread Trace data: 06:56:18.143 [3631144][latch.c][238][AcquireLatchSpecific]:LATCH Acquire ATTEMPTED for latch 12740150, mode 0, using mutex 11b8c008 by requestor pvrclass.c(3631144). 06:56:18.143 [3631144][latch.c][294][AcquireLatchSpecific]:LATCH Acquire SUCCESSFUL for latch 12740150, mode 0, using mutex 11b8c008 by requestor pvrclass.c(3631144). 06:56:18.142 [3631155][latch.c][238][AcquireLatchSpecific]:LATCH Acquire ATTEMPTED for latch 12750d90, mode 1, using mutex 11b8c008 by requestor pvrmp.c(938)(3631155). ----> Another thread requesting for Xlatch blocked the 3631144's Slatch request Initial Impact: High Additional Keywords: Hang hung Login failed session client backups fail
Local fix
Restart the Tivoli Storage Manager server
Problem summary
**************************************************************** * USERS AFFECTED: * * All Tivoli Storage Manager server users. * **************************************************************** * PROBLEM DESCRIPTION: * * See ERROR DESCRIPTION. * **************************************************************** * RECOMMENDATION: * * Apply fixing level when available. This problem is currently * * projected to be fixed in levels 6.3.5 and 7.1.1. Note that * * this is subject to change at the discretion of IBM. * ****************************************************************
Problem conclusion
This problem was fixed. Affected platforms: AIX, HP-UX, Solaris, Linux, and Windows.
Temporary fix
Comments
APAR Information
APAR number
IT02590
Reported component name
TSM SERVER
Reported component ID
5698ISMSV
Reported release
63A
Status
CLOSED PER
PE
NoPE
HIPER
NoHIPER
Special Attention
NoSpecatt
Submitted date
2014-06-16
Closed date
2014-06-23
Last modified date
2014-06-23
APAR is sysrouted FROM one or more of the following:
APAR is sysrouted TO one or more of the following:
Fix information
Fixed component name
TSM SERVER
Fixed component ID
5698ISMSV
Applicable component levels
R63A PSY
UP
R63H PSY
UP
R63L PSY
UP
R63S PSY
UP
R63W PSY
UP
R71A PSY
UP
R71H PSY
UP
R71L PSY
UP
R71S PSY
UP
R71W PSY
UP
[{"Business Unit":{"code":"BU058","label":"IBM Infrastructure w\/TPS"},"Product":{"code":"SSGSG7","label":"Tivoli Storage Manager"},"Component":"","ARM Category":[],"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"63A","Edition":"","Line of Business":{"code":"LOB26","label":"Storage"}}]
Document Information
Modified date:
23 June 2014