Cluster analysis is an exploratory data analysis tool for solving classification problems. Its object is to sort cases (people, things, events, etc) into groups, or clusters, so that the degree of association is strong between members of the same cluster and weak between members of different clusters. A cluster is a group of relatively homogenous cases or observations. Each cluster thus describes, in terms of the data collected, the class to which its members belong; and this description may be abstracted through use from the particular to the general class or type.
Cluster analysis can be applied to data that exhibits "natural" groupings. This analysis sorts through the raw data and groups them into clusters.
The diagram below illustrates the results of a survey that studied drinkers' perceptions of spirits (alcohol). Each point represents the results from one respondent. The research indicates there are four clusters in this market. |