APAR status
Closed as program error.
Error description
The Tivoli Storage Manager server can hang during the upgradedb process when upgrading to any 5.5.X.X release on the Linux platform. After executing the upgrade install_server script, the command process can hang, and can only be terminated by issuing kill -xx <pid> command, where xx is the kill signal sent to terminate the process. To generate a core dump issue signal 11 so the core file can be used to verify this APAR. During a normal upgrade of the Tivoli Storage Manager, the server will complete initialization process, and then issue the HALT command to end the upgrade process. Under an unknown circumstance there is a possibility that the HALT command will be issued before the Storage Manager can complete the initialization process, and this will block the upgrade from terminating. The core stack will show the main thread is not finished initializing duringa call to function dbUpgradeComplete when HALT is called, and is blocked in function LvmCheckpoint(). The call stack is as below: #0 0x00000200000c638e in pthread_cond_wait@@GLIBC_2.3.2 () from /lib64/tls/libpthread.so.0 #1 0x000000008088481a in pkWaitConditionTracked () #2 0x00000000804af9c0 in LvmCheckpoint () #3 0x00000000804ace10 in lvmWriteRestartRecord () #4 0x00000000802d9d50 in dbUpgradeComplete () #5 0x00000000801396e4 in admStartServer () #6 0x0000000080087be8 in main () The current code is written so that function LvmCheckpoint() will be in a wait condition forever when the HALT command is in progress. If a core file is not available, or if the upgrade is currently in a hung state the APAR can be verified by checking the "tsm_upgrade.log". This log is located in the "server/bin" directory, and can be checked by looking for the following messages: 1. ANR0129I Database upgrade completed successfully. Even though the upgrade hangs, this message shows that the upgrade is successful, and the 5.5.3.0 Storage Manager server is ready to use. 2. ANR0993I Server initialization complete. This message is missing from the log file during a hang condition, and when missing means the server did not complete initialization. 3. ANR2017I Administrator SERVER_CONSOLE issued command: HALT HALT is last message displayed in the log file during the hang, and should never be issued until initialization has completed. This message should always follow message ANR0993I. 4. ANR0991I Server shutdown complete. Server shutdown complete is always the last message displayed in the log and comes right after the HALT message. In a hang condition this message will not be displayed in the log. This hang condition has been seen on the zLinux platform, but could effect other Linux or Unix platforms, and other versions of the Tivoli Storage Manager. The problem is completely random and is not easily reproduced. Initial Impact: Low - the upgrade is successful. Additional Keywords: Hung Hang upgrade Linux install_server
Local fix
None - the upgrade is successful and the Tivoli Storage Manager server is ready for use.
Problem summary
**************************************************************** * USERS AFFECTED: All Tivoli Storage Manager server users. * **************************************************************** * PROBLEM DESCRIPTION: See error description. * **************************************************************** * RECOMMENDATION: Apply fixing level when available. This * * problem is currently projected to be fixed * * in levels 5.5.5. Note that this is * * subject to change at the discretion of IBM. * **************************************************************** *
Problem conclusion
This problem was fixed. Affected platforms: AIX, HP-UX, Sun Solaris, Linux, z/OS, and Windows.
Temporary fix
Comments
APAR Information
APAR number
IC62990
Reported component name
TSM SERVER
Reported component ID
5698ISMSV
Reported release
55Z
Status
CLOSED PER
PE
NoPE
HIPER
NoHIPER
Special Attention
NoSpecatt
Submitted date
2009-09-04
Closed date
2009-10-23
Last modified date
2009-10-23
APAR is sysrouted FROM one or more of the following:
APAR is sysrouted TO one or more of the following:
Fix information
Fixed component name
TSM SERVER
Fixed component ID
5698ISMSV
Applicable component levels
R55A PSY
UP
R55H PSY
UP
R55L PSY
UP
R55S PSY
UP
R55W PSY
UP
R55Z PSY
UP
[{"Business Unit":{"code":"BU058","label":"IBM Infrastructure w\/TPS"},"Product":{"code":"SSGSG7","label":"Tivoli Storage Manager"},"Component":"","ARM Category":[],"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"55Z","Edition":"","Line of Business":{"code":"LOB26","label":"Storage"}}]
Document Information
Modified date:
23 October 2009