Confidence Interval
Confidence intervals make a reasoned statement about the true mean of a population based on a random sample. When taking the mean (X) of a random population sample; most likely, the mean of that sample will not be the true mean of the population ( µ ), but rather an estimate. The confidence interval represents a range of values around the sample mean that include the true mean.
| Confidence intervals are written with a percentage; what does this percentage represent? If the researcher were to take 100 random samples with a 95% confidence interval for each sample, then he or she expects that for 95 of the 100 samples (95%), the range of values produced by the confidence interval procedure will include the true mean of the population. Although the researcher in practice only has a single sample, the researcher is confident that their interval contains the true mean due to the process used to calculate the confidence interval. |
A Random Sample from this Population |
The following Java Applet is a visual
representation of this notion of confidence. In the Applet there are 20 different
data sets, which are separated by the green lines. The blue lines are the
data points in the data sets. Each red line represents the confidence range.
The small red dash in the center of the red line is the mean and the lines at
the top and bottom
are the limits of the confidence interval. The pink line is the population mean
and is thus measured from 0 because
this is a "standard normal" population.
Run the Applet 5 times (which gives you information on 100 data sets) with the
alpha level left at its default (.05). Count the number of times the single red
line
in a data set does not intersect the pink line for every run. After 5 runs, this
number should be
5, which gives you your 95% Confidence Interval (100%-5%=95%). For a
series of 5 runs, you may get a few more or less than 5 since confidence
intervals are based on probability theory. An alpha of .01 is a 99% Confidence Interval (note the larger red lines),
which means only 1 line should not intersect in 5 runs (100%-1%=99%), and an alpha
of .1 is a 90% Confidence Interval (note the smaller red lines), which means 10
lines should not intersect in 5 runs (100%-10%=90%). Again, remember that
because of probability, you may get a few more or less than the target number
in
any given series of 5 runs.
The width of the confidence interval increases as the confidence level increases, since with a greater width one is more likely to have included the true mean.
In the previous menu, the Baby Weight Activity uses a confidence interval. The activity assumes that you have access to Excel, a TI-83 calculator or another software package capable of performing descriptive tests. To see the actual math used in finding a confidence interval, go to Confidence Interval Math.
Copyright © 2002 Central Virginia Governor's School, Lynchburg, VA