Cafe Cerebral - CHAID

CHAID or Chi-Square Automatic Interaction Detector is an exploratory method or more precisely an algorithm to study the relationship between a dependent variable and a series of predictor variables. This algorithm selects a set of predictors and their interactions and predicts the optimal value of the dependent variable. In the end what we get is a classification tree. The dependent variable could be a qualitative variable or a quantitative variable.

The CHAID model or a CHAID diagram can be thought of as an inverted tree trunk, which splits into different branches and sub branches. Initially the "Tree Trunk" is the totality of all the participants in the study. A series of predictor variables are studied to see if splitting the sample based on these predictors leads to a statistically significant discrimination in the dependent variable. For this Chi square tests and F tests are done and their P values are calculated. If the p values are not statistically significant, then the algorithm merges the respective predictor variables (or categories in case of categorical data). If a statistical significance is observed then a split is made. This becomes the first branching of the tree. Then for each of the groups, we face the question whether they can be further split into subgroups so that there are significant differences in the dependent variable.

At the end of the tree building process we have a series of groups that are significantly different from one another on the dependent variable. The CHAID analysis is advantageous when we are looking for patterns in complicated datasets. The variables can be categorical or interval in nature. Moreover not all the predictor variables need to be measured in the same level.

CHAID is an useful method of summarizing the data and is analogous to stepwise regression. Companies for segmenting the market use CHAID extensively. It is used as a tool in identifying the characteristics of the consumers of a certain product or service. CHAID is one of the latest tools in market research where identifying the profiles of the customers is necessary for their acquisition through direct mailing. A lot psychological research has been done where CHAID has been used to model the patterns of human behavior.

CHAID plays a major role in the field of data mining too.

Contact Mu Sigma
info@mu-sigma.com
Site Map | Disclaimer | Privacy Policy
© 2005 - 2009 Mu Sigma. All rights reserved