data_mining:t-test

About

The t-test is a test that compares means.

NHST can be conducted yielding to a p-value
Effect Size can be calculated like in multiple regression.
Confidence Interval around the mean can also be reported

T-tests:

t-test (single sample)
t-test (dependent)
t-test (independent)

The two last ones are really common.

A t-test is a parametric statistic.

Articles Related

History

The Sutdent's t-test was developed by William Gossett in 1908. He was not only a statistics expert, but he was also a beer expert. He monitored the quality of stout beer at the Guinness brewery in Dublin, Ireland. He wanted to be empirical about it, and he sort of randomly assigned different things to different batches and then, developed this test to compare means.

But the management at Guinness were so protective of their brand, that they did not want William Gossett to share his procedures with anyone else. They didn't even want him to publish this in an academic journal.

They convinced him to publish under the name student.

More: Wine, Beer, and Statistics

Test statistic

Definition

the z-test is used when comparing a sample mean to a population mean, and the population standard deviation is known.
the single sample t-test is used when comparing a sample mean to a population mean and the population standard deviation is not known.

The z-test and the single sample t-test are pretty rare because the population parameters are typically not known.

A dependent t-test is appropriate when we have the same people measured twice. We can calculate a different score and then see if the mean different score is signifigantly different from zero. If so, we have a significant change.

The independent t-test is appropriate when we want to compare two separate groups of people or two separate samples. For instance, if we want to compare one condition to another as men versus women.

Calculation

The z or t test statistic is some observed value minus an expected value relative to standard error.

<MATH> \text{z or t} = \frac{(\text{Observed} - \text{Expected})}{\href{Standard_Error}{\text{Standard Error}}} </MATH>

Test	Observed	Expected	Standard Error
z	Sample mean	Population mean	Standard Error of the mean
t (single sample)	Sample mean	Population mean	Standard Error of the mean
t (dependent)	Sample mean of difference score	Population mean of difference score	Standard Error of the mean difference
t (independent)	Difference between two sample mean	Difference between two population mean	Standard Error of the difference between mean

In the dependent test, the observed is the sample mean of difference scores. As we have the same people measured twice, we can calculate the difference score for each individual, and then average the difference scores. That's not the case in the independent t-test as we have two different groups. In the independent t-test, we get the mean for the group one and the mean for group two, and then, look at the difference between the means.

p-value

The p-values for z and t (back to the central limit theorem) is dependent on:

whether the test is directional or non-directional
on the sample size. degree of freedom. Different t-distributions for different sample sizes.

degree of freedom

Test	degree of freedom
z	Non Applicable
t (single sample)	<math>N - 1</math>
t (dependent)	<math>N - 1</math>
t (independent)	<math>(N_1 - 1) + (N_2 - 1)</math>

Example

Preferences comparison for a political candidate.

In the 1st study you recruit liberals and conservatives and have them rate their likelihood to vote for the candidate. → independent t-test
In the 2nd study you ask a group to rate their likelihood to vote for the candidate before and after the candidate makes a speech. dependent t-test
In the 3rd study, you compare your group’s preference for the candidate to the voting population. → single sample t-test

???? Taking a sample size of 10, you find that your group performs 1.5 units of standard error higher the population mean for the test. You can then conclude that your group is not significantly different from the population. ????

Table of Contents

Statistics - (Student's) t-test (Mean Comparison)

About

Articles Related

History

Test statistic

Definition

Calculation

p-value

degree of freedom

Example