PickMyTest

Paired Samples t-Test

Compare means of two related measurements

Paired Samples t-Test#

The paired samples t-test (also: dependent samples t-test) tests whether the means of two related measurements differ significantly. Typical use cases include before-and-after measurements on the same subjects.

When to Use#

Use the paired t-test when you want to:

  • Compare two related measurements (e.g., before vs. after)
  • The dependent variable is metric (continuous)
  • The differences between paired values are approximately normally distributed
  • Each observation in one group can be matched to exactly one observation in the other group

Assumptions#

  • Paired observations (each measurement has a corresponding pair)
  • Metric scale of the dependent variable
  • Normal distribution of the differences (apply Shapiro-Wilk test to the differences)
  • Independence of pairs (the pairs themselves are independent of each other)

Note: The normality assumption refers to the differences of the paired values, not the raw values themselves. If this assumption is violated, the Wilcoxon signed-rank test is the appropriate alternative.

Formula#

The test statistic is based on the differences di=X1iβˆ’X2id_i = X_{1i} - X_{2i}:

t=dˉsd/nt = \frac{\bar{d}}{s_d / \sqrt{n}}

where:

  • dΛ‰\bar{d} is the mean of the differences
  • sds_d is the standard deviation of the differences
  • nn is the number of pairs

The mean and standard deviation of the differences are calculated as:

dΛ‰=1nβˆ‘i=1ndi\bar{d} = \frac{1}{n} \sum_{i=1}^{n} d_i sd=1nβˆ’1βˆ‘i=1n(diβˆ’dΛ‰)2s_d = \sqrt{\frac{1}{n-1} \sum_{i=1}^{n} (d_i - \bar{d})^2}

The test statistic follows a t-distribution with df=nβˆ’1df = n - 1 degrees of freedom.

Example#

Practical Example: Training Effect on Endurance

A sports physician wants to investigate whether an 8-week endurance training program improves maximum oxygen uptake (VOβ‚‚max). They measure VOβ‚‚max in 25 participants before and after the training.

  • Measurement 1 (Before): VOβ‚‚max before the training program
  • Measurement 2 (After): VOβ‚‚max after 8 weeks of training

Since each person is measured twice, the data are paired. The paired t-test checks whether the mean difference in VOβ‚‚max values differs significantly from zero.

Effect Size#

Cohen's dzd_z as a measure of effect size for paired designs:

dz=dˉsdd_z = \frac{\bar{d}}{s_d}
Effect SizeCohen's d_z
Small0.2
Medium0.5
Large0.8

Tip: Cohen's dzd_z refers to the standardized mean difference. Alternatively, Cohen's davd_{av} can be calculated, which uses the average standard deviation of the two measurement points.

Further Reading

  • Student (1908). The probable error of a mean. Biometrika, 6(1), 1–25.
  • Field, A. (2018). Discovering Statistics Using IBM SPSS Statistics (5th ed.). SAGE.