Correlation-Causation Fallacy
What is it?
Correlation-Causation Fallacy is a bias that makes people believe that when two events happen together, one causes the other.
The correlation-causation fallacy occurs when someone mistakenly assumes that because two variables are correlated, one must cause the other. In reality, correlation simply means that two variables are related in some way, but it doesn't necessarily imply that one causes the other. There could be other factors involved, or the correlation could be purely coincidental.
Here are two simple examples to help you understand the correlation-causation fallacy:
Ice cream sales and drowning incidents: Suppose you notice that as ice cream sales increase, so do the number of drowning incidents. It would be a correlation-causation fallacy to assume that eating ice cream causes people to drown. In reality, both ice cream sales and drowning incidents are likely to increase during summer months when the weather is warm, people spend more time at the beach or pool, and ice cream is more appealing. The common factor here is the warm weather, not the ice cream causing drowning.
Shoe size and reading ability: Imagine you find that children with larger shoe sizes tend to have better reading skills. It would be a correlation-causation fallacy to assume that having larger feet causes children to become better readers. Instead, both shoe size and reading ability are related to a child's age—older children generally have larger feet and more advanced reading skills. The common factor is age, not shoe size causing better reading skills.
To avoid the correlation-causation fallacy, it's essential to consider other potential explanations for a correlation and not jump to conclusions about causation without further evidence.
The correlation-causation fallacy, also known as cum hoc ergo propter hoc (Latin for "with this, therefore because of this"), is a logical error that occurs when someone incorrectly assumes that because two variables are correlated, one must cause the other. While correlation can provide evidence of a relationship between two variables, it does not imply causation, and inferring causality from correlation alone can lead to erroneous conclusions.
The correlation-causation fallacy is related to several other principles and scientific topics, including:
Spurious correlations: Correlations that arise due to chance or the presence of a common underlying factor, known as a confounding variable, rather than a causal relationship. Spurious correlations can lead to the correlation-causation fallacy when individuals mistakenly infer causation from these coincidental relationships.
Confounding variables: Factors that are related to both the independent and dependent variables in a study and can distort the observed relationship between them. Accounting for confounding variables is crucial to establish causality, as they can create or obscure the true causal relationship between variables.
Experimental design: Proper experimental design, particularly through the use of randomized controlled trials, can help establish causality by controlling for potential confounding variables and isolating the effect of the independent variable on the dependent variable. Observational studies, on the other hand, can only demonstrate correlations, making it difficult to infer causality without additional evidence or analyses.
Granger causality: A statistical concept used to test whether one time series can predict another, which can provide evidence of a causal relationship between variables. While Granger causality can help identify potential causal relationships in time series data, it cannot establish causality definitively, as other factors may still confound the relationship.
Understanding the correlation-causation fallacy and its connections to other scientific concepts can help researchers and decision-makers avoid drawing erroneous conclusions from correlational data and instead seek more rigorous evidence to establish causality.
References
- Holland, P. W. (1986). Statistics and causal inference. Journal of the American Statistical Association, 81(396), 945-960.
- Pearl, J. (2009). Causality: Models, Reasoning, and Inference (2nd ed.). Cambridge University Press.
- Granger, C. W. J. (1969). Investigating Causal Relations by Econometric Models and Cross-spectral Methods. Econometrica, 37(3), 424-438.
- Shadish, W. R., Cook, T. D., & Campbell, D. T. (2002). Experimental and Quasi-Experimental Designs for Generalized Causal Inference. Houghton Mifflin.