PH12258: Client ObjectGrid instances can leak when they are not destroyed.

APAR status

Closed as program error.

Error description

System outages can occur when applications are coded to create
many client ObjectGrid instances and not destroy them when
done with them. An example symptom can be OutOfMemoryError
exceptions occuring in your application that is a client to an
eXtreme Scale data grid. Also when stopping client applications
with many ObjectGrid instances for the same ObjectGrid, errors
can occur on the containers side like those below:

XSThreadPool W CWOBJ7853W: Detected a hung thread named
"XIOPrimaryPool : 107" TID:434 BLOCKED. Executing since
5/16/2019 01:02:09:699 +0200.
Stack Trace:
com.ibm.ws.xs.pubsub.publication.Publisher.dispatchException(Pub
lisher.java:772)
com.ibm.ws.xsspi.xio.actor.DispatchExceptionListenerRegistry.dis
patchExceptionEncountered(DispatchExceptionListenerRegistry.java
:249)
com.ibm.ws.xsspi.xio.actor.DispatchExceptionListenerRegistry.rec
eive(DispatchExceptionListenerRegistry.java:78)
com.ibm.ws.xs.xio.actor.impl.XIOReferableImpl.dispatch(XIORefera
bleImpl.java:110)
com.ibm.ws.xsspi.xio.actor.XIORegistry.sendToTarget(XIORegistry.
java:973)
com.ibm.ws.xsspi.xio.dispatch.MessageDispatcher.tell(MessageDisp
atcher.java:168)
com.ibm.ws.xs.xio.actor.impl.ActorRefImpl.tell(ActorRefImpl.java
:66)
com.ibm.ws.xsspi.xio.dispatch.DispatchExceptionRunnable.sendErro
r(DispatchExceptionRunnable.java:203)
com.ibm.ws.xsspi.xio.dispatch.DispatchExceptionRunnable.run(Disp
atchExceptionRunnable.java:159)
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExec
utor.java:1164)
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExe
cutor.java:634)
com.ibm.ws.objectgrid.thread.XSThreadPool$Worker.run(XSThreadPoo
l.java:309)

Followed by connection lost between Catalog and containers.

00010def LeaderManager I CWOBJ7218I: This catalog server based
on an explicit attempt at communication has determined that
C000WXSP\C000WXSPAS603\C000WXSCONTAINERP603 is lost.

Containers need to be restarted to recover service.

Local fix

Limit number of clients started concurrently.

Problem summary

****************************************************************
* USERS AFFECTED:  Users of eXtreme Scale who use the Spring   *
*                  Cache integration, near cache invalidation  *
*                  or continuous query functions.              *
****************************************************************
* PROBLEM DESCRIPTION: When applications do not                *
*                      appropriately destroy their             *
*                      ObjectGrid instances, they remain in    *
*                      memory and can cause problems.          *
****************************************************************
* RECOMMENDATION:  Use a single ObjectGrid instance per        *
*                  application instead of continually using    *
*                  ObjectGridManager.connect() or destroy      *
*                  ObjectGrid instances when they are          *
*                  no longer needed by your application.       *
****************************************************************
When using Spring Cache integration, near cache invalidation
or continuous query functions, client ObjectGrid instances
would not garbage collect when no longer referenced by an
application.  It is the application's responsibility to destroy
an ObjectGrid instance or disconnect a ClientClusterContext
when the ObjectGrid instance or connection is no longer
needed. However, Spring Cache function does not have the
ability to destroy ObjectGrid instances or disconnect.  To
handle applications not re-using ObjectGrid or Spring Cache
instances appropriately or not destroying instances when done
with them, the Spring Cache integration, near cache
invalidation and continuous query functions were updated to
utilize WeakReferences to allow for instances to appropriate
garbage collect and notify containers that the client
ObjectGrid instances are destroyed.

Problem conclusion

An interim fix is available for this APAR upon request.

Temporary fix

Comments

APAR Information

APAR number
PH12258
Reported component name
WS EXTREME SCAL
Reported component ID
5724X6702
Reported release
860
Status
CLOSED PER
PE
NoPE
HIPER
NoHIPER
Special Attention
NoSpecatt / Xsystem
Submitted date
2019-05-20
Closed date
2020-02-24
Last modified date
2020-02-24

APAR is sysrouted FROM one or more of the following:
APAR is sysrouted TO one or more of the following:

Fix information

Fixed component name
WS EXTREME SCAL
Fixed component ID
5724X6702

Applicable component levels

[{"Business Unit":{"code":"BU053","label":"Cloud & Data Platform"},"Product":{"code":"SSTVLU","label":"WebSphere eXtreme Scale"},"Component":"","ARM Category":[],"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"860","Edition":"","Line of Business":{"code":"LOB45","label":"Automation"}}]

Document Information

Modified date:
27 March 2020

Tips

PH12258: Client ObjectGrid instances can leak when they are not destroyed.

Subscribe to this APAR

APAR status

Closed as program error.

Error description

Local fix

Problem summary

Problem conclusion

Temporary fix

Comments

APAR Information

APAR number

Reported component name

Reported component ID

Reported release

Status

PE

HIPER

Special Attention

Submitted date

Closed date

Last modified date

APAR is sysrouted FROM one or more of the following:

APAR is sysrouted TO one or more of the following:

Fix information

Fixed component name

Fixed component ID

Applicable component levels

Document Information

Share your feedback

Need support?