Table of Contents

Statistics - (Student's) t-test (Mean Comparison)

About

The t-test is a test that compares means.

T-tests:

The two last ones are really common.

A t-test is a parametric statistic.

History

The Sutdent's t-test was developed by William Gossett in 1908. He was not only a statistics expert, but he was also a beer expert. He monitored the quality of stout beer at the Guinness brewery in Dublin, Ireland. He wanted to be empirical about it, and he sort of randomly assigned different things to different batches and then, developed this test to compare means.

But the management at Guinness were so protective of their brand, that they did not want William Gossett to share his procedures with anyone else. They didn't even want him to publish this in an academic journal.

They convinced him to publish under the name student.

More: Wine, Beer, and Statistics

Test statistic

Definition

The z-test and the single sample t-test are pretty rare because the population parameters are typically not known.

Calculation

The z or t test statistic is some observed value minus an expected value relative to standard error.

<MATH> \text{z or t} = \frac{(\text{Observed} - \text{Expected})}{\href{Standard_Error}{\text{Standard Error}}} </MATH>

Test Observed Expected Standard Error
z Sample mean Population mean Standard Error of the mean
t (single sample) Sample mean Population mean Standard Error of the mean
t (dependent) Sample mean of difference score Population mean of difference score Standard Error of the mean difference
t (independent) Difference between two sample mean Difference between two population mean Standard Error of the difference between mean

In the dependent test, the observed is the sample mean of difference scores. As we have the same people measured twice, we can calculate the difference score for each individual, and then average the difference scores. That's not the case in the independent t-test as we have two different groups. In the independent t-test, we get the mean for the group one and the mean for group two, and then, look at the difference between the means.

p-value

The p-values for z and t (back to the central limit theorem) is dependent on:

degree of freedom

Test degree of freedom
z Non Applicable
t (single sample) <math>N - 1</math>
t (dependent) <math>N - 1</math>
t (independent) <math>(N_1 - 1) + (N_2 - 1)</math>

Example

Preferences comparison for a political candidate.

???? Taking a sample size of 10, you find that your group performs 1.5 units of standard error higher the population mean for the test. You can then conclude that your group is not significantly different from the population. ????