APAR status
Closed as program error.
Error description
The Monitoring Agent for UNIX OS does not collect any data when the Process attribute group and the Data Collection Status attribute group both are collecting data. With tracing set to (UNIT:kux ALL) in the agent RAS1 log file will see: Will show kux03agt calling init_psTable_once: (5D13C1DF.0127-4:kux03agt.cpp,981,"init_psTable_once") Calling pthread_mutex_init for lockPsTable (5D13C1DF.0128-4:kux03agt.cpp,986,"init_psTable_once") Locking psTable (5D13C1DF.0129-4:kux03agt.cpp,992,"init_psTable_once") psTable initialized for node: itmmtlpm16:KUX. Unlocking psTable Will show kuxcputhrd (thread -4) calling Locking PsTable followed by kux03agt calling Locking psTable: (5D13C1DF.012B-4:kuxcputhrd.cpp,108,"updateProcessStats") Locking psTable (5D13C1EC.007D-7:kux03agt.cpp,530,"TakeSample") Locking psTable - It is a complete datacollection Then before kuxcputhred calls "unlocking psTable" the kux42agt.cpp thread does another psTable initialization: (5D13C1E0.0678-1D:kux42agt.cpp,404,"init_psTable_once_2") Locking psTable (5D13C1E0.0679-1D:kux42agt.cpp,411,"init_psTable_once_2") psTable initialized for node: itmmtlpm16:KUX. Unlocking psTable (5D13C1DF.012C-4:kuxcputhrd.cpp,115,"updateProcessStats") Freeing and unlocking psTable Will not see kux03agt issuing the Unlocking psTable or other processing in kux03agt.cpp (5D13C1EC.042C-7:kux03agt.cpp,963,"TakeSample") Unlocking psTable
Local fix
Stop the UNIXDCSTAT situation/historical collections from starting.
Problem summary
Agent does not collect any data when UNIXDCSTAT attribute is collected. The Monitoring Agent for UNIX OS does not collect any data when the Process attribute group and the Data Collection Status attribute group both are collecting data and there are enough processes running that it takes more than a second to process them (e.g. 7k processes).
Problem conclusion
Two different threads were initializing the locking structure. The code has been changed to only initialize it once. With tracing set to (UNIT:kux ALL) in the agent RAS1 log file will see: Will show kux03agt calling init_psTable_once: (5D13C1DF.0127-4:kux03agt.cpp,981,"init_psTable_once") Calling pthread_mutex_init for lockPsTable (5D13C1DF.0128-4:kux03agt.cpp,986,"init_psTable_once") Locking psTable (5D13C1DF.0129-4:kux03agt.cpp,992,"init_psTable_once") psTable initialized for node: itmmtlpm16:KUX. Unlocking psTable Will show kuxcputhrd (thread -4) calling Locking PsTable followed by kux03agt calling Locking psTable: (5D13C1DF.012B-4:kuxcputhrd.cpp,108,"updateProcessStats") Locking psTable (5D13C1EC.007D-7:kux03agt.cpp,530,"TakeSample") Locking psTable - It is a complete datacollection Then before kuxcputhred calls "unlocking psTable" the kux42agt.cpp thread does another psTable initialization: (5D13C1E0.0678-1D:kux42agt.cpp,404,"init_psTable_once_2") Locking psTable (5D13C1E0.0679-1D:kux42agt.cpp,411,"init_psTable_once_2") psTable initialized for node: itmmtlpm16:KUX. Unlocking psTable (5D13C1DF.012C-4:kuxcputhrd.cpp,115,"updateProcessStats") Freeing and unlocking psTable Will not see kux03agt issuing the Unlocking psTable or other processing in kux03agt.cpp (5D13C1EC.042C-7:kux03agt.cpp,963,"TakeSample") Unlocking psTable The fix for this APAR is contained in the following maintenance packages: | service pack | 6.3.0.7-TIV-ITM-SP0002
Temporary fix
Stop the situation/historical collection for UNIXDCSTAT attribute group.
Comments
APAR Information
APAR number
IJ16977
Reported component name
ITM AGENT UNIX
Reported component ID
5724C040U
Reported release
630
Status
CLOSED PER
PE
NoPE
HIPER
NoHIPER
Special Attention
NoSpecatt / Xsystem
Submitted date
2019-06-20
Closed date
2019-10-23
Last modified date
2019-10-23
APAR is sysrouted FROM one or more of the following:
APAR is sysrouted TO one or more of the following:
Fix information
Fixed component name
ITM AGENT UNIX
Fixed component ID
5724C040U
Applicable component levels
[{"Business Unit":{"code":"BU059","label":"IBM Software w\/o TPS"},"Product":{"code":"SSTFXA","label":"Tivoli Monitoring"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"630","Line of Business":{"code":"LOB45","label":"Automation"}}]
Document Information
Modified date:
08 March 2023