Partial Correlation#
Partial correlation measures the linear association between two variables and after statistically removing the influence of one or more control variables . It is an essential tool for uncovering spurious correlations and isolating the "true" relationship between two variables. When a third variable influences both and , the simple Pearson correlation can paint a misleading picture β partial correlation adjusts for this confounding effect.
When to Use#
- You suspect the association between and may be distorted by a third variable
- You want to check whether an observed correlation is a spurious correlation
- You want to report the adjusted association between two variables
- All variables are metric (interval or ratio scaled)
- You want to control for a confounding variable without running a full regression analysis
Assumptions#
- Metric scale level for all variables (X, Y, and Z)
- Linear relationship between all variable pairs
- Approximate normality of all variables
- Independent observations
- No perfect multicollinearity between variables
Formula#
The first-order partial correlation (controlling for one variable ) is computed from the three bivariate Pearson correlations:
Here, is the correlation between and , is the correlation between and , and is the correlation between and .
Significance is tested using a t-test:
When controlling for variables, .
Example#
Practical Example: Ice Cream Sales, Drowning, and Temperature
A study finds a high positive correlation between ice cream sales () and the number of drowning incidents () at public pools. Does eating more ice cream increase the risk of drowning? Of course not β temperature () is the common cause driving both variables.
Correlations (n = 50 summer days):
- (ice cream sales β drowning incidents)
- (ice cream sales β temperature)
- (drowning incidents β temperature)
Calculating the partial correlation:
Interpretation: The originally strong correlation of drops to after controlling for temperature β a small and likely non-significant association. The observed correlation was largely a spurious correlation caused by the shared confounding variable temperature.
Effect Size#
The partial correlation is itself an effect size measure and is interpreted using the same conventions as Pearson's :
| | Interpretation | |---|---| | 0.10 | Small effect | | 0.30 | Medium effect | | 0.50 | Large effect |
Additionally, the proportion of uniquely explained variance can be calculated:
This value indicates how much variance in is explained by after the influence of has already been accounted for. In the example above, ice cream sales explain only of the variance in drowning incidents after controlling for temperature.
Further Reading
- Cohen, J., Cohen, P., West, S. G. & Aiken, L. S. (2003). Applied Multiple Regression/Correlation Analysis for the Behavioral Sciences (3rd ed.). Lawrence Erlbaum Associates.
- Field, A. (2018). Discovering Statistics Using IBM SPSS Statistics (5th ed.). SAGE Publications.
- Agresti, A. & Finlay, B. (2009). Statistical Methods for the Social Sciences (4th ed.). Pearson.