How to lie with statistics [4] - Darrell Huff
Chapter 4 - Much Ado about Practically Nothing
This chapter delves into the concept of statistical significance and how it can be misinterpreted or misused to make results seem more important than they actually are.
Key Points:
Statistical Significance vs. Practical Significance: The chapter emphasizes the distinction between statistical significance and practical significance. A result might be statistically significant, meaning it's unlikely to have occurred by chance, but the actual difference or effect might be so small that it's not meaningful in a practical sense. For example, a study might find that a new drug lowers cholesterol levels by an average of 2 mg/dL compared to a placebo. This difference could be statistically significant with a large enough sample size, but the practical significance is questionable. A 2 mg/dL reduction might not have a noticeable impact on a person's health.
Small Differences: Even a tiny difference between two groups can be statistically significant if the sample size is large enough. However, this doesn't necessarily mean the difference is important or relevant in the real world. For example, a new drug might be shown to be statistically significantly more effective than an existing drug, but the actual improvement in patient outcomes might be negligible. For instance, a survey of thousands of people might find that those who use a particular brand of shampoo have 0.5% fewer split ends than those who don't. While this difference might be statistically significant due to the large sample size, it's unlikely to be noticeable or meaningful to individuals.
Large Sample Sizes: The chapter highlights how large sample sizes can amplify the appearance of statistical significance. With a large enough sample, even minor differences can become statistically significant, even if they have no practical implications. This can lead to the misinterpretation of results and the overemphasis of trivial findings. For instance, a study with a massive sample size might find that people who drink a cup of coffee a day live 3 days longer on average than those who don't. While this might be statistically significant, the effect size is small and could be due to other factors not accounted for in the study
Misleading Interpretations: Statistical significance is often misinterpreted as proof of a causal relationship. However, correlation does not equal causation. Just because two variables are statistically associated doesn't mean that one causes the other. There could be other factors at play or the relationship could be purely coincidental. For example, a study might find a correlation between ice cream sales and drowning deaths. While this correlation is statistically significant, it doesn't mean that eating ice cream causes drowning. The underlying factor is likely the hot weather, which leads to increased ice cream consumption and more people swimming.
The Importance of Context:
The chapter emphasizes the importance of considering the context and magnitude of the effect when interpreting statistical significance. It's crucial to ask:
Is the effect size large enough to be meaningful in the real world?
Are there other factors that could be influencing the results?
Is the statistically significant finding practically significant?
By understanding the limitations of statistical significance and considering the broader context, readers can avoid being misled by seemingly impressive but ultimately meaningless results.