Two Sample t-test A t-test is used to determine if there is a
"statistical" difference Testing at the River
Now, let's assume the researcher wants to test another site down river to see if an industrial complex is polluting the river and reducing its DO content. She collects 12 sample readings from the original site and calculates the mean of the data set. She then collects 12 sample readings at a site down river from the industrial complex and calculates the mean for the set? Is the difference between these two means due to normal random variations of the DO in the water or is it due to pollution from the industrial complex? Figure 1 - T-Test
A t-test does a statistical analysis that looks at the size of the difference between the two means. If the river is not polluted, the sites above and below the industrial complex have the same water quality. Therefore, the difference in the sample means are due to normal random variations just like the difference in the sample means taken from the same spot. The samples are taken from the same "population" of DO values and the range of mean differences expected due to normal random variations in the "population" can be calculated statistically. On the other hand, if the river is being polluted, the samples taken down river from the industrial complex will represent a different "population" of DO values. The difference between the means of the data sets taken above and below the industrial complex represent the influence of an outside factor that altered the DO in the water. Hence, the mean difference between the two sites will be larger than the differences expected due to random normal variations in the original population. Bottom Line
In the previous menu, the Coliform and Football activities use t-tests to analyze data sets. The activities assume that you have access to Excel, a TI-83 calculator or another software package capable of performing inferential tests. Copyright © 1997 Central Virginia Governor's School for Science and Technology Lynchburg, VA |