Monitoring overall system health

Monitoring overall system health is important to ensure your system is performing well. This includes web servers, application servers, databases, back-end systems, and any other systems critical to running your web site.

Before you begin

If any system has a problem, it might cause the servlet is slow message to appear. IBM® and several other business partners leverage the WebSphere® APIs to capture performance data and to incorporate it into an overall 24-by-7 monitoring solution. WebSphere Application Server provides Performance Monitoring Infrastructure (PMI) data to help monitor the overall health of the WebSphere Application Server environment. PMI provides average statistics on WebSphere Application Server resources, application resources, and system metrics. Many statistics are available in WebSphere Application Server, and you might want to understand the ones that most directly measure your site's resources to detect problems.

About this task

Table 1. Monitoring overall system health . To monitor overall system health, monitor the following statistics at a minimum:
Metric Meaning
Average response time Include statistics, for example, servlet or enterprise beans response time. Response time statistics indicate how much time is spent in various parts of WebSphere Application Server and might quickly indicate where the problem is (for example, the servlet or the enterprise beans).
Number of requests (transactions) Enables you to look at how much traffic is processed by WebSphere Application Server, helping you to determine the capacity that you have to manage. As the number of transactions increase, the response time of your system might be increasing, showing the need for more system resources or the need to retune your system to handle increased traffic.
Number of live HTTP sessions The number of live HTTP sessions reflects the concurrent usage of your site. The more concurrent live sessions, the more memory is required. As the number of live sessions increase, you might adjust the session time-out values or the Java™ virtual machine (JVM) heap available.
[AIX Solaris HP-UX Linux Windows][IBM i]Web server thread pools [AIX Solaris HP-UX Linux Windows][IBM i]Interpret the web server thread pools, the web container thread pools, and the Object Request Broker (ORB) thread pools, and the data source or connection pool size together. These thread pools might constrain performance due to their size. The thread pools setting can be too small or too large, therefore causing performance problems. Setting the thread pools too large impacts the amount of memory that is needed on a system or might cause too much work to flow downstream if downstream resources cannot handle a high influx of work. Setting thread pools too small might also cause bottlenecks if the downstream resource can handle an increase in workload.
[AIX Solaris HP-UX Linux Windows][IBM i]The web and Enterprise JavaBeans (EJB) thread pools
Database and connection pool size
Java virtual memory (JVM) Use JVM metrics to understand the JVM heap dynamics, including the frequency of garbage collection. This data can assist in setting the optimal heap size. In addition, use the metric to identify potential memory leaks.
CPU Observe these metrics to know if you are at or near the maximum capacity of your system resources.
I/O
System paging

[z/OS]WebSphere Application Server for z/OS® relies on WLM services to collect some of the accounting and performance data.

[z/OS]Resource Measurement Facility (RMF) and RMF-written System Management Facility (SMF) records present performance and accounting information to the WebSphere Application Server. In addition, the WebSphere Application Server for z/OS has SMF records that collect additional domain-specific information

[z/OS]Turn off the SMF records or RMF data using the administrative console and the SMFPRMxx parmlib member if you do not need the information. Use the SMFPRMxx parmlib member to control the detail of the WebSphere Application Server for z/OS SMF records. If you need SMF information, review the SMF records to ensure you are collecting only the record types and details that you need.

[z/OS]Setting up your workload manager goals and filtering criteria is beyond the scope of this section. You can classify work into service classes based on user ID and server name. Classify the control regions as reasonably high-performing system tasks

To monitor several of these statistics, WebSphere Application Server provides the Performance Monitoring Infrastructure to obtain the data, and provides the Tivoli® Performance Viewer in the administrative console to view this data.

Procedure

  1. Enable PMI through the administrative console to begin data collection.
  2. Use Tivoli Performance Viewer or other performance monitoring and management solutions to monitor performance. These other solutions include wsadmin scripting, PerfServlet, IBM Tivoli Composite Application Manager for WebSphere Application Server, third-party performance monitoring and management tools, or your own monitoring applications.
  3. Extend monitoring capabilities by developing your own monitoring applications or extending PMI.