Welch's t-Test#
Welch's t-test (also known as the unequal variances t-test) is a variant of the independent samples t-test that does not assume equal variances. It uses the Welch-Satterthwaite approximation to adjust the degrees of freedom and delivers reliable results even with unequal variances and unequal sample sizes. Many statisticians now recommend it as the default procedure instead of the classic Student's t-test.
When to Use#
- You are comparing the means of two independent groups
- The variances in the two groups are not equal (Levene's test significant) or you do not want to make this assumption
- The sample sizes are unequal β this is precisely where Welch's test outperforms the classic t-test
- You want a robust test that loses only minimal power even when variances happen to be equal
- Data are approximately normally distributed but variances are heteroscedastic
Assumptions#
- Normal distribution in both groups (Shapiro-Wilk test, QQ plot)
- Independence of observations (no repeated measures)
- Continuous (interval or ratio scaled) dependent variable
- Homogeneity of variances is NOT required
Note: The key advantage of Welch's t-test over the classic t-test is that it does not require equal variances. When variances are equal, it yields nearly identical results to the classic t-test (only marginally more conservative). Therefore, Delacre et al. (2017) recommend using Welch's test as the default approach.
Formula#
The test statistic of Welch's t-test:
Where and are the group means, and are the group variances, and and are the sample sizes.
The degrees of freedom are calculated using the Welch-Satterthwaite approximation:
These degrees of freedom are typically not whole numbers and fall between and .
Example#
Practical Example: Salary Comparison Between Departments
An HR analyst compares salaries in the Marketing department (n = 45) and the IT department (n = 120). The sample sizes and variances differ considerably.
- Marketing: ,
- IT: ,
Levene's test is significant (), confirming unequal variances. The classic t-test would be inappropriate here.
Welch's t-test:
- (Welch-Satterthwaite)
- (two-tailed)
- (medium effect)
The IT department earns significantly more than the Marketing department, even after accounting for unequal variances and sample sizes.
Effect Size#
The effect size is calculated with Cohen's d, the same as for the classic t-test:
| Effect Size | |d| |
|---|---|
| Small | 0.20 |
| Medium | 0.50 |
| Large | 0.80 |
When variances differ substantially, Glass's delta () can be used as an alternative, which uses only the standard deviation of the control group in the denominator.
Further Reading
- Delacre, M., Lakens, D. & Leys, C. (2017). Why psychologists should by default use Welch's t-test instead of Student's t-test. International Review of Social Psychology, 30(1), 92β101.
- Ruxton, G. D. (2006). The unequal variance t-test is an underused alternative to Student's t-test and the MannβWhitney U test. Behavioral Ecology, 17(4), 688β690.
- Zimmerman, D. W. (2004). A note on preliminary tests of equality of variances. British Journal of Mathematical and Statistical Psychology, 57(1), 173β181.