![]() ![]() ![]() ![]() ![]() ![]() ![]() |
The SPSS Cluster procedure (Analyze, Cluster, Hierarchical Cluster Analysis) generates all possible clusters of sizes 1...K, but may be used only for relatively small samples. One may wish to use the Cluster procedure on a sample of cases (ex., 200) to inspect results for different numbers of clusters. The optimum number of clusters depends on the research purpose. Identifying "typical" types may call for few clusters and indentifying "exceptional" types may call for many clusters. After using Cluster to determine the desired number of clusters, the researcher may wish then to analyze the entire dataset with the Quick Cluster procedure (Analyze, Cluster, K-Means Cluster Analysis), specifying that number of clusters.
In forward clustering, small clusters are formed by using a high similarity index cut-off (ex., .9). Then this cut-off is relaxed to establish broader and broader clusters in stages until all cases are in a single cluster at some low similarity index cut-off. There are a variety of different measures of inter-observation distances and inter-clustances to use as criteria when merging nearest clusters into broader and broader stages. The merging of clusters is visualized using a tree format.
Backward clustering is the same idea, but starting with a low cut-off and working toward a high cut-off. Forward and backward methods need not generate the same results.
In addition, Clustan includes the following non-hierarchical clustering methods:
Clustan also includes the related procedures: