The Goodness of Fit or Chi - Squared Distribution

Thetest can only be used the goodness of fit of a data set to a hypothesised probability distribution.

The observed data is sorted into frequency classes, and for each frequency class, the expected number of observations that would fall into that frequency class is calculated.

The difference between each observed frequency O-i and expected frequency E-i is calculated, squared and divided by the expected frequency. This calculation is performed for each frequency class and the results are all added to give a single number, the test statistic, equal to

If the expected frequency for a frequency class is less than 5,then this group is combined with other frequency classes so that all frequency classes have expected frequencies at least equal to 5.

The significance of the test statistic depends on the number of degrees of freedom,of the data, which is equal to the number of frequency classes AFTER any classes have been combined, c, minus 1, so that