Configuring the default Failure Detection Protocol for a core group
The default Failure Detection Protocol monitors the core group network connections that the default Discovery Protocol establishes, and notifies the default Discovery Protocol if a connection failure occurs.
Before you begin
- Understand the concepts that are described in the topic Core group discovery and failure detection protocols.
- Check your operating system settings that are relevant to TCP/IP socket closing events.
- Determine your failure detection goals and which settings must change to accomplish these goals.
The value that you specify for the
should equal the product of multiplying the value specified for the property, times the property.- The heartbeat transmission period specifies the frequency at which a core group member sends a heartbeat packet over every established connection. The default value for the heartbeat transmission period is 30 seconds.
- The heartbeat timeout period specifies the failure detection time. If no packets are received during the specified time period, a failure is declared. The default value for the heartbeat transmission period is 180 seconds.
About this task
- You want to change the failover characteristics of your system.
- Your core groups are large and analysis indicates excessive CPU usage is spent monitoring heartbeats.
The heartbeat transmission period and heartbeat timeout period are configurable. Use the administrative console or the wsadmin tool to adjust these settings if the default values are not appropriate for your environment, unless you are running in a mixed cell environment that includes core groups that contain a mixture of Version 7.0 and Version 6.x processes,
- In the administrative console, click core_group_name. Then, in the Additional Properties section, click .
- In the
The IBM_CS_FD_PERIOD_SECS custom property specifies how frequently the Failure Detection Protocol checks the core group network connections that the discovery protocol establishes.
The IBM_CS_FD_CONSECUTIVE_MISSED property specifies the number of consecutive heartbeats that a member can missed before it is communication with that member is discontinued.
field, specify either
IBM_CS_FD_PERIOD_SECS or IBM_CS_FD_CONSECUTIVE_MISSED,
and then specify a new value for these properties in the field.
Remember, when you use the administrative console or the wsadmin tool to configure the Failure Detection Protocol, you configure the heartbeat transmission period, and the heartbeat timeout period. However if you are use the custom properties to configure the Failure Detection Protocol, you configure the heartbeat transmission period, and the number of missed consecutive heartbeats.
To use the administrative console to change the settings for the default Failure Detection Protocol complete the following steps.