Statistics - Correlation does not imply causation

Card Puncher Data Processing


Correlation does not imply causation

Studies have shown that people who have more birthdays live longer.



Eating ice cream

In the late 1940s, public health experts recommended that people stop eating ice cream as part of an anti-polio diet. It turned out however that there was only a correlation between polio incidence and ice-cream consumption, because outbreaks were most common in the summer.

Seven Countries Study

wiki/Seven Countries Study shows a correlation between the fat calories consumed and deaths per 1,000 from degenerative heart disease.

The experiment failed to consider other factors that might cause deaths from degenerative heart disease.

It appears in this study that there's a correlation between per capital annual sugar consumption in pounds, and deaths per 1,000 of degenerative heart disease.

Including additional data or factors would have likely led Ancel Keys to a different conclusion.

Facebook vs princeton

Debunking Princeton “we are even more concerned about the fate of the planet, because Google Trends for 'air' have also been climbing steadily, and our projection show that by the year 2060, there we know air left at all.”

Google Search Trend Air

vs Princeton: Epidemiological modeling of online social network dynamic (John Cannarella, Joshua A. Spechler)

Discover More
P Value Pipeline
Data Mining - (Life cycle|Project|Data Pipeline)

Data mining is an experimental science. Data mining reveals correlation, not causation. With good data, you will make good algorithm. The most preferable solution is then to work on good features....
Thomas Bayes
Data Science - History

A brief history of data analysis Fisher proposed a design of experiments along with his statistical tests ANOVA, and Fisher's exact tests. He's also credited with the quotation, “Correlation does...
Thomas Bayes
Statistics - Causation - Causality (Cause and Effect) Relationship

Cause and Effect Relationship. Nothing beats a simple, elegant, controlled, randomized experiment if you want to make strong claims causality. Causal inference is a difficult and slippery topic, which...
Statistics - Correlation (Coefficient analysis)

Correlation is a statistical analysis used to measure and describe the relationship betweentwo variables. The Correlations coefficient is a statistic and it can range between +1 and -1 +1 is a perfect...
Time Serie - Correlation

Two time series that are not related can have a strong, but spurious, correlation when a trend is added. This strong correlation is just the fact that they're both dependent on the time (X) They both...

Share this page:
Follow us:
Task Runner