Viewing the performance of runtime servers

As an administrator, use Insight Monitor to check that your runtime servers are running as you expect. The interface provides information useful to troubleshoot your cluster.

About this task

Insight Monitor provides useful information to troubleshoot your runtime servers if the cluster is not running as you expect. The Events page shows the number of events, a list of the event types, and the rate at which they are processed. The event load can also be viewed by its distribution across the cluster. The Memory and CPU pages provide information about the capacity allocation usage and how much processing is being consumed by the servers.

Procedure

  1. In the menu bar, select Event to display the three Event submenus:
    Note: The Event pages display event totals from when the runtime server was started. If a server is restarted, the event counts is reset. The event counts also includes time triggers, which the system schedules, so it can be higher than expected.
    1. Click Event Rates to view the events rates. The event rates give information about the actual number of events that are processed per second based on the most current information available.
    2. Click Event Distribution to view the event distribution across all of the runtime. The distribution is a percentage calculated by the event count per server divided by the total number of events processed.
    3. Click Event Count to view the total number of events processed per server. The event counts also include retries of event processing as a result of processing errors.
  2. Click Memory to view the memory utilization.

    It shows the total, used, and free heap usage on each server. It indicates whether the system is running low on Java heap. If the memory usage is near the configured amount, you might want to add more RAM and increase the size of the Java virtual machines.

  3. Click CPU to view the CPU utilization.

    It displays the central Processing Unit (CPU) percentage that the server is using. It shows that the system is running efficiently. If the system has a high event rate, the CPU utilization must be near 100%, which indicates that the system is processing the backlog of events as quickly as possible.

Example

The following figure shows the Event Counts page. The AirportSolution has a single rule agent that processed all of the event count for the solution, and the majority of the events are processed by the second server. The event types are listed below the agent name. The event type named airport.FlightMovementEvent shows 99% of the total event count.

This image shows the total number of events processed per server. The event type named airport.FlightMovementEvent shows 99% of the total event count.

The following figure shows the Event Rates page. The event rates vary across the servers and can be used to monitor the processing rate of events on each servers in real-time. 

This image shows the event rates processed by four servers. The event type named airport.FlightMovementEvent shows 99% of the total event rates.