K-Means Cluster Analysis Options

Statistics. You can select the following statistics: initial cluster centers, ANOVA table, and cluster information for each case.

  • Initial cluster centers. First estimate of the variable means for each of the clusters. By default, a number of well-spaced cases equal to the number of clusters is selected from the data. Initial cluster centers are used for a first round of classification and are then updated.
  • ANOVA table. Displays an analysis-of-variance table which includes univariate F tests for each clustering variable. The F tests are only descriptive and the resulting probabilities should not be interpreted. The ANOVA table is not displayed if all cases are assigned to a single cluster.
  • Cluster information for each case. Displays for each case the final cluster assignment and the Euclidean distance between the case and the cluster center used to classify the case. Also displays Euclidean distance between final cluster centers.

Missing Values. Available options are Exclude cases listwise or Exclude cases pairwise.

  • Exclude cases listwise. Excludes cases with missing values for any clustering variable from the analysis.
  • Exclude cases pairwise. Assigns cases to clusters based on distances that are computed from all variables with nonmissing values.

Specifying Options

This feature requires the Statistics Base option.

  1. From the menus choose:

    Analyze > Classify > K-Means Cluster...

  2. In the K-Means Cluster dialog box, click Options.