What is a T Test?
A t test is a statistical hypothesis test that assesses sample means to draw conclusions about population means. Frequently, analysts use a t test to determine whether the population means for two groups are different. For example, it can determine whether the difference between the treatment and control group means is statistically significant.
There are three types of t tests. They all evaluate sample means using t-values, t-distributions, and degrees of freedom to calculate statistical significance. It is a parametric analysis that compares one or two group means.
The following are the standard t tests:
- One-sample: Compares a sample mean to a reference value.
- Two-sample: Compares two sample means.
- Paired: Compares the means of matched pairs, such as before and after scores.
In this post, you’ll learn about the different types of t tests, when you should use each one, and their assumptions. Additionally, I interpret an example of each type.
Which T Test Should I Use?
To choose the correct t test, you must know whether you are assessing one or two group means. If you’re working with two group means, do the groups have the same or different items/people? Use the table below to choose the proper analysis.
Number of Group Means | Group Type | T Test |
One | One sample t test | |
Two | Different items in each group | Two sample t test |
Two | Same items in both groups | Paired t test |
Now, let’s review each t test to see what it can do!
Imagine we’ve developed a drug that supposedly boosts your IQ score. In the following sections, we’ll address the same research question, and I’ll show you how the various t tests can help you answer it.
One Sample T Test
Use a one-sample t test to compare a sample mean to a reference value. It allows you to determine whether the population mean differs from the reference value. The reference value is usually highly relevant to the subject area.
For example, a coffee shop claims their large cup contains 16 ounces. A skeptical customer takes a random sample of 10 large cups of coffee and measures their contents to determine if the mean volume differs from the claimed 16 ounces using a one-sample t test.
One-Sample T Test Hypotheses
- Null hypothesis (H0): The population mean equals the reference value (µ = µ0).
- Alternative hypothesis (HA): The population mean DOES NOT equal the reference value (µ ≠ µ0).
Reject the null when the p-value is less than the significance level (e.g., 0.05). This condition indicates the difference between the sample mean and the reference value is statistically significant. Your sample data support the idea that the population mean does not equal the reference value.
The above hypotheses are two-sided analyses. Alternatively, you can use one-sided hypotheses to find effects in only one direction. Learn more in my article, One- and Two-Tailed Hypothesis Tests Explained.
Related posts: Null Hypothesis: Definition, Rejecting & Examples and Understanding Significance Levels
Example
We want to evaluate our IQ boosting drug using a one-sample t test. First, we draw a single random sample of 15 participants and administer the medicine to all of them. Then we measure all their IQs and calculate a sample average IQ of 109.
In the general population, the average IQ is defined as 100. So, we’ll use 100 as our reference value. Is the difference between our sample mean of 109 and the reference value of 100 statistically significant? The t test output is below.
In the output, we see that our sample mean is 109. The procedure compares the sample mean to the reference value of 100 and produces a p-value of 0.036. Consequently, we can reject the null hypothesis and conclude that the population mean for those who take the IQ drug is higher than 100.
Two-Sample T Test
Use a two-sample t test to compare the sample means for two groups. It allows you to determine whether the population means for these two groups are different. For the two-sample procedure, the groups must contain different sets of items or people.
For example, you might compare averages between males and females or treatment and controls.
Two-Sample T Test Hypotheses
- Null hypothesis (H0): Two population means are equal (µ1 = µ2).
- Alternative hypothesis (HA): Two population means are not equal (µ1 ≠ µ2).
Again, when the p-value is less than or equal to your significance level, reject the null hypothesis. The difference between the two means is statistically significant. Your sample data support the theory that the two population means are different.
Learn more about the two-sample t test.
Related posts: How to Interpret P Values and Statistical Significance
Example
For our IQ drug, we collect two random samples, a control group and a treatment group. Each group has 15 subjects. We give the treatment group the medication and a placebo to the control group.
We’ll use a two-sample t test to evaluate if the difference between the two group means is statistically significant. The t test output is below.
In the output, you can see that the treatment group (Sample 1) has a mean of 109 while the control group’s (Sample 2) average is 100. The p-value for the difference between the groups is 0.112. We fail to reject the null hypothesis. There is insufficient evidence to conclude that the IQ drug has an effect.
Paired Sample T Test
Use a paired t-test when you measure each subject twice, such as before and after test scores. This procedure determines if the mean difference between paired scores differs from zero, where zero represents no effect. Because researchers measure each item in both conditions, the subjects serve as their own controls.
For example, a pharmaceutical company develops a new drug to reduce blood pressure. They measure the blood pressure of 20 patients before and after administering the medication for one month. Analysts use a paired t-test to assess whether there is a statistically significant difference in pressure measurements before and after taking the drug.
Paired T Test Hypotheses
- Null hypothesis: The mean difference between pairs equals zero in the population (µD = 0).
- Alternative hypothesis: The mean difference between pairs does not equal zero in the population (µD ≠ 0).
Reject the null when the p-value is less than or equal to your significance level (e.g., 0.05). Your sample provides sufficiently strong evidence to conclude that the mean difference between pairs does not equal zero in the population.
Learn more about the paired t test.
Example
Back to our IQ boosting drug. This time, we’ll draw one random sample of 15 participants. We’ll measure their IQ before taking the medicine and then again afterward. The before and after groups contain the same people. The procedure subtracts the After — Before scores to calculate the individual differences. Then it calculates the average difference.
If the drug increases IQs effectively, we should see a positive difference value. Conversely, a value near zero indicates that the IQ scores didn’t improve between the Before and After scores. The paired t test will determine whether the difference between the pre-test and post-test is statistically significant.
The t test output is below.
The mean difference between the pre-test and post-test scores is 9 IQ points. In other words, the average IQ increased by 9 points between the before and after measurements. The p-value of 0.000 causes us to reject the null. We conclude that the difference between the pre-test and post-test population means does not equal zero. The drug appears to increase IQs by an average of 9 IQ points in the population.
T Test Assumptions
For your t test to produce reliable results, your data should meet the following assumptions:
You have a random sample
Drawing a random sample from your target population helps ensure it represents the population. Representative samples are crucial for accurately inferring population properties. The t test results are invalid if your data do not reflect the population.
Related posts: Random Sampling and Representative Samples
Continuous data
A t test requires continuous data. Continuous variables can take on all numeric values, and the scale can be divided meaningfully into smaller increments, such as fractional and decimal values. For example, weight, height, and temperature are continuous.
Other analyses can assess additional data types. For more information, read Comparing Hypothesis Tests for Continuous, Binary, and Count Data.
Your sample data follow a normal distribution, or you have a large sample size
A t test assumes your data follow a normal distribution. However, due to the central limit theorem, you can waive this assumption when your sample is large enough.
The following sample size guidelines specify when normality becomes less of a restriction:
- One-Sample and Paired: 20 or more observations.
- Two-Sample: At least 15 in each group.
Related posts: Central Limit Theorem and Skewed Distributions
Population standard deviation is unknown
A t test assumes you have a sample estimate of the standard deviation. In other words, you don’t know the precise value of the population standard deviation. This assumption is almost always true. However, if you know the population standard deviation, use the Z test instead. However, when n > 30, the difference between the t and Z tests becomes trivial.
Related post: Standard Deviations
Thank you for your awesome work.
Your explanation is comprehensive even to non-statisticians
Thanks so much, Daniel. So glad my blog post could help!