Skip to main content

Troubleshooting WebSphere MQ clusters


Technote (troubleshooting)


Problem(Abstract)

Your WebSphere MQ cluster is having a problem and you need to know how to troubleshoot it. This document describes several cluster issues and how to address them.

Resolving the problem

Cluster Hints and Tips

  • Avoid using the REFRESH CLUSTER command until you have exhausted other options.

  • If you are running WebSphere MQ V7.0.1.0 or later, make sure you have a SYSTEM.CLUSTER.HISTORY.QUEUE in place before using REFRESH CLUSTER. If IBM support are involved in the problem, the history saved to this queue can help determine the root cause of your clustering issue.

  • Make sure your cluster objects have either the CLUSTER or CLUSTERNL attribute set, depending on whether they appear in one or more clusters.

  • When setting up or extending a cluster, you should define a CLUSSDR channel from each partial repository to a full repository, never the other way around.

  • Be aware that clustering allows you to send messages to queues elsewhere in the cluster, however, you can get messages only from queues on the local queue manager.


Check the Cluster Status

  • Make sure the cluster repository process (amqrrmfa) for your queue manager is running. If this process has ended abnormally, restart the queue manager in order to get it running again.

  • Make sure your cluster queue managers and cluster channels are in a good working status:

    DISPLAY CLUSQMGR(*) ALL
    DISPLAY CHSTATUS(*) WHERE(CHLTYPE EQ CLUSSDR) ALL
    DISPLAY CHSTATUS(*) WHERE(CHLTYPE EQ CLUSRCVR) ALL

  • If your cluster channels are not working, or your cluster queue managers show a "SYSTEM.TEMPUUID" value, which indicates a communications problem, review the WebSphere MQ channel troubleshooting page for advice on clearing up channel problems.

  • Make sure you can see the cluster queues you are using:

    DISPLAY QCLUSTER(YOUR.Q.NAME) ALL
    DISPLAY Q(YOUR.Q.NAME) CLUSINFO

  • Be aware that partial repository queue managers will not display cluster queues which they have not accessed recently. If you run a program locally which accesses (MQOPENs) the cluster queue, you should then see it displayed.

Workload Balancing and Round-Robin Processing

  • Your cluster queue should have the parameter DEFBIND set to NOTFIXED, otherwise any program opening the queue will send all messages to it rather than spreading them around.

  • Any MQI application sending messages should use the MQOPEN option MQOO_BIND_NOT_FIXED for precisely the same reason.

  • Any MQI application opening a cluster queue should leave the queue manager name empty in the object descriptor. If the application sets the MQOD.ObjectQMgrName field, then instances of the cluster queue on other queue managers will be ineligible to receive messages.

  • If your queue manager has a local instance of a cluster queue, local applications will default to sending all of their messages to it. You can change this behavior by modifying the queue manager:

    DISPLAY QMGR CLWLUSEQ
    ALTER QMGR CLWLUSEQ(ANY)

  • Make sure your cluster channels are running properly in order to achieve an even distribution of messages. Use CLWLRANK rather than CLWLPRTY if you want the cluster workload algorithm to ignore cluster channel status when distributing messages to cluster queues.

Product Alias/Synonym

WebSphere MQ WMQ

Rate this page:

(0 users)Average rating

Copyright and trademark information

IBM, the IBM logo and ibm.com are trademarks of International Business Machines Corp., registered in many jurisdictions worldwide. Other product and service names might be trademarks of IBM or other companies. A current list of IBM trademarks is available on the Web at "Copyright and trademark information" at www.ibm.com/legal/copytrade.shtml.

Rate this page:


(0 users)Average rating

Add comments

Document information

WebSphere MQ

Clustering


Software version:
5.3, 6.0, 7.0, 7.1, 7.5


Operating system(s):
AIX, HP-UX, IBM i, Linux, Solaris, Windows


Software edition:
All Editions


Reference #:
1620882


Modified date:
2012-12-26

Translate my page

Content navigation