ANOVA or Analysis of Variance is an important tool in
statistics which tests the significance of difference of means for
two or more groups. A t-test is used normally when
two groups of observations needs to be compared. If the
number of groups increase, for e.g. five groups, then
we would have to perform in all 5C2 or ten different
t-tests.The drawback with using this method is that, as the number of groups increase, the
number of t-tests increase geometrically and leads to increase in
Type I error. In such a situation we resort to the technique of
ANOVA. One would consider the name ANOVA a misnomer as because we
are comparing means. But in reality, the name comes from the fact
that in order to test the significance of difference of means we are
actually analyzing the variances.
The kernel of ANOVA lies in partitioning its total variation into
two groups, variation within groups and variation between groups.
This is also known as partitioning of its total sum of squares. The
variation within groups is known as sum of squares due to error or
SSE, as this variation cannot be explained and that for variation
between groups is known as sum of squares due to effect or
SS(Effect). We consider the mean sum of squares respectively and
take their ratio. If there is no significant difference in means
then we can expect MS(Effect) and MSE to be of the same order or
consequently their ratio, very small. But if there is actually a
significant difference between the groups, then this difference
would be reflected in the MS(Effect). Then our null hypothesis (that
there is no significant differences between groups in the
population) would be rejected. The variable that is measured is
called the dependent variable and the variables that are manipulated
or controlled are called factors or independent variables. In
comparison to t-tests, this method is more powerful and more
flexible. Moreover, ANOVA allows us to detect interaction effects
between variables too. |