about · email me · subscribe

spurious correlations

correlation is not causation

discover · random · spurious scholar
← previous page · next page →

A linear line chart with years as the X-axis and two variables on the Y-axis. The first variable is UFO sightings in Maryland and the second variable is Liquefied petroleum gas used in Thailand.  The chart goes from 1980 to 2021, and the two variables track closely in value over that time. Small Image
View details about correlation #1,115


Unidentified Frying Objects: A Close Encounter of the LPG Kind
Perhaps the UFOs were emitting some kind of intergalactic energy that inadvertently improved the chemical composition of the LPG during its transit through the Earth's atmosphere. Or maybe the aliens were just big fans of Thai cuisine and their presence somehow enhanced the LPG used in cooking, leading to a tastier Pad Thai for everyone. Who knew that extraterrestrial visitors could have such a sizzling impact on the culinary world!




What else correlates?
UFO sightings in Maryland · all random state specific
Liquefied petroleum gas used in Thailand · all energy

A linear line chart with years as the X-axis and two variables on the Y-axis. The first variable is Popularity of the first name Malika and the second variable is Customer satisfaction with Macy's.  The chart goes from 2005 to 2020, and the two variables track closely in value over that time. Small Image
View details about correlation #2,613


Making Mirthful Metrics: Malika's Moniker and Macy's Merriment
Every time someone with the name Malika walked into Macy's, all the mannequins would mysteriously strike the perfect pose, creating an awe-inspiring atmosphere of fashion perfection. This inadvertently led to a significant boost in customer satisfaction as shoppers couldn't help but be impressed by the runway-ready displays and impeccable clothing arrangements. It was like having a personal stylist for every customer, courtesy of the Malika effect!




What else correlates?
Popularity of the first name Malika · all first names
Customer satisfaction with Macy's · all weird & wacky

A linear line chart with years as the X-axis and two variables on the Y-axis. The first variable is The distance between Neptune and Uranus and the second variable is SAP SE's stock price (SAP).  The chart goes from 2002 to 2023, and the two variables track closely in value over that time. Small Image
View details about correlation #1,275


Neptunian Nonsense: Navigating the Nexus between Neptune's Nearness and SAP SE Stock Price
As the gap between the two far-off planets widened, it created a ripple effect in the cosmic energy market, leading to an unexpected boost in SAP SE's stock price on Earth's stock exchanges. It's a stellar reminder that even astronomical distances can't eclipse the power of stock market fluctuations!




What else correlates?
The distance between Neptune and Uranus · all planets
SAP SE's stock price (SAP) · all stocks

A linear line chart with years as the X-axis and two variables on the Y-axis. The first variable is Google searches for 'why isnt 11 pronounced onety one' and the second variable is The number of tax collectors in New Mexico.  The chart goes from 2010 to 2022, and the two variables track closely in value over that time. Small Image
View details about correlation #2,209


Pondering Onety-One and Tax Fun: A Correlation in New Mexico Unspun
Because as the search for the onety-grail of numerical pronunciation continued, it inadvertently led people to question all forms of taxing language, creating a taxing demand for those who could audit, refund, or 1040-solve any tax-related issues in New Mexico. The connection between 11 and 'onety one' sparked a wealth of curiosity that multiplied like the interest on a good investment, ultimately adding to the state's revenue in an unexpected and punexpected way.




What else correlates?
Google searches for 'why isnt 11 pronounced onety one' · all google searches
The number of tax collectors in New Mexico · all cccupations

A linear line chart with years as the X-axis and two variables on the Y-axis. The first variable is Popularity of the 'call me maybe' meme and the second variable is Average number of comments on Numberphile YouTube videos.  The chart goes from 2012 to 2023, and the two variables track closely in value over that time. Small Image
View details about correlation #4,850


The Meme and the Metrics: Mapping the Marriage of 'Call Me Maybe' Popularity and the Proliferation of Comments on Numberphile
As the 'call me maybe' meme faded into obscurity, so did the interest in mathematical concepts set to its catchy tune. Without the internet's ongoing obsession with the meme, there were fewer viewers seeking out number-related content, leading to a decline in comments on Numberphile videos. It seems the equation for online engagement had shifted, and the solution just didn't add up without the meme factor.




What else correlates?
Popularity of the 'call me maybe' meme · all memes
Average number of comments on Numberphile YouTube videos · all YouTube

A linear line chart with years as the X-axis and two variables on the Y-axis. The first variable is The number of movies Amy Poehler appeared in and the second variable is Total Number of Successful Mount Everest Climbs.  The chart goes from 1996 to 2011, and the two variables track closely in value over that time. Small Image
View details about correlation #5,864


Laughing Matters: The Amy Poehler Effect on Everest Ascents
As the number of Amy Poehler movies increased, so did the levels of laughter and positivity in the world. This led to a corresponding rise in the overall happiness of people, including mountaineers. The sheer joy and motivation they received from Amy's performances served as the ultimate catalyst for their successful ascents of Mount Everest. In a way, Amy Poehler's on-screen brilliance became the real-life inspiration that helped climbers reach new heights, quite literally.




What else correlates?
The number of movies Amy Poehler appeared in · all films & actors
Total Number of Successful Mount Everest Climbs · all weird & wacky

A linear line chart with years as the X-axis and two variables on the Y-axis. The first variable is US kids in public school and the second variable is Fossil fuel use in Grenada.  The chart goes from 1990 to 2021, and the two variables track closely in value over that time. Small Image
View details about correlation #1,047


The Fuel of Knowledge: Connecting the Dots Between U.S. Public School Kids and Fossil Fuel Use in Grenada
As the number of kids in US public schools grew, so did the demand for school buses. The increased production of school buses led to a spike in the consumption of fossil fuels, inadvertently affecting the global market. This phenomenon, now known as the "Diesel-Powered Domino Effect," has left Grenada wondering why their fuel usage is being driven by American education.




What else correlates?
US kids in public school · all education
Fossil fuel use in Grenada · all energy

A linear line chart with years as the X-axis and two variables on the Y-axis. The first variable is Popularity of the first name Stevie and the second variable is Netflix's stock price (NFLX).  The chart goes from 2003 to 2022, and the two variables track closely in value over that time. Small Image
View details about correlation #3,268


Stevie's Stock Surprises: The Silly Saga of Netflix's NFLX
The soothing sound of the name Stevie resulted in a nation-wide decrease in stress levels, leading to more people unwinding with Netflix, and therefore driving up the stock price. Plus, there's a rumor that Stevie Wonder's secret side gig is writing Netflix original series, and everyone's loving them.




What else correlates?
Popularity of the first name Stevie · all first names
Netflix's stock price (NFLX) · all stocks

A linear line chart with years as the X-axis and two variables on the Y-axis. The first variable is The number of art directors in Arkansas and the second variable is Yearly peak of NYSE composite index.  The chart goes from 2003 to 2012, and the two variables track closely in value over that time. Small Image
View details about correlation #2,581


Art Directing the Stock Market: An Unconventional Connection between Arkansas and the NYSE Composite Index
As the number of art directors in Arkansas increased, so did the demand for their creative services. This led to a spike in advertising and marketing campaigns, which captured the public's attention in innovative ways. As consumer confidence and spending rose, the companies behind these artistic endeavors saw record profits, driving up stock prices and ultimately boosting the NYSE composite index. It's as if the state's newfound artistic flair painted a masterpiece of economic growth, creating a bull market fueled by a blend of creativity and commerce.




What else correlates?
The number of art directors in Arkansas · all cccupations
Yearly peak of NYSE composite index · all stocks

A linear line chart with years as the X-axis and two variables on the Y-axis. The first variable is Average views of SciShow Space YouTube videos and the second variable is Popularity of the 'willy wonka' meme.  The chart goes from 2014 to 2023, and the two variables track closely in value over that time. Small Image
View details about correlation #5,013


The Milky Way: Exploring the Correlation Between SciShow Space YouTube Video Views and the Golden Ticket to 'Willy Wonka' Meme Popularity
It turns out, as people watched fewer space videos, their interest in all things 'wonka' dwindled. It seems there's a hole in the market for space-themed memes, and without those astronomical numbers, the 'willy wonka' meme just couldn't rocket to the same heights of popularity. It's a classic case of when the SciShow Space views go down, the willy wonka meme just doesn't have the same universal appeal. In other words, without the stellar content from SciShow Space, the 'willy wonka' meme couldn't maintain its Milky Way of popularity. It's a meme-orable connection, for sure!




What else correlates?
Average views of SciShow Space YouTube videos · all YouTube
Popularity of the 'willy wonka' meme · all memes

A linear line chart with years as the X-axis and two variables on the Y-axis. The first variable is GMO use in corn grown in Minnesota and the second variable is Pirate attacks globally.  The chart goes from 2009 to 2022, and the two variables track closely in value over that time. Small Image
View details about correlation #2,052


Grain and Rum: Unearthing the Nexus Between GMO Corn Cultivation in Minnesota and Global Pirate Raids
As GMO use in Minnesota decreased, the corn stalks started resembling traditional masts, confusing the pirates. This led to a global decrease in pirate attacks as they couldn't distinguish between actual ships and the corn fields.




What else correlates?
GMO use in corn grown in Minnesota · all food
Pirate attacks globally · all weird & wacky

A linear line chart with years as the X-axis and two variables on the Y-axis. The first variable is Google searches for 'how much wood can a woodchuck chuck' and the second variable is Kerosene used in Venezuela.  The chart goes from 2004 to 2021, and the two variables track closely in value over that time. Small Image
View details about correlation #4,245


How Much Kerosene for a Woodchuck Spleen: The Correlation Between Google Searches for 'How Much Wood Can a Woodchuck Chuck' and Kerosene Usage in Venezuela
As the woodchucks realized the futility of their chucking efforts, they shifted their focus to alternative energy sources, leading to a surplus of kerosene in Venezuela. One-liner: It's a case of supply and demand being influenced by woodchuck contemplation!




What else correlates?
Google searches for 'how much wood can a woodchuck chuck' · all google searches
Kerosene used in Venezuela · all energy

A linear line chart with years as the X-axis and two variables on the Y-axis. The first variable is The number of movies Tom Hanks appeared in and the second variable is The number of special education teachers in Georgia.  The chart goes from 2012 to 2022, and the two variables track closely in value over that time. Small Image
View details about correlation #5,857


Forrest Grump: The Impact of Tom Hanks' Movie Appearances on Special Education Teachers in Georgia
As the number of Tom Hanks movies climbs, more people are inspired to pursue careers in the film industry. This leads to a higher demand for special effects and production crew members. With Georgia being a popular filming location, the need for specialized education in these fields also grows, prompting an increase in special education teachers to meet the unique learning needs of future film professionals.




What else correlates?
The number of movies Tom Hanks appeared in · all films & actors
The number of special education teachers in Georgia · all cccupations

A linear line chart with years as the X-axis and two variables on the Y-axis. The first variable is Popularity of the first name Katherine and the second variable is Burglaries in Hawaii.  The chart goes from 1985 to 2022, and the two variables track closely in value over that time. Small Image
View details about correlation #3,951


Breaking and Entering Katherine: An Unconventional Connection Between Name Popularity and Burglaries in Hawaii
As the saying goes, "Kat's out of the bag," and it seems that also applies to burglars in Hawaii! With fewer Katherines around, there were less Kat burglars trying to pull off heists in the sunny state. It appears that the name Katherine was previously a common alias for cat burglars with a penchant for pilfering pineapples. However, with this name falling out of favor, it seems the purr-petrators have also disappeared, leading to a decrease in burglaries. It's a feline mystery, but it looks like Hawaii can rest easy knowing that the Katherine connection has been pawsitively purr-vented!




What else correlates?
Popularity of the first name Katherine · all first names
Burglaries in Hawaii · all random state specific

A linear line chart with years as the X-axis and two variables on the Y-axis. The first variable is Master's degrees awarded in Education and the second variable is US bank failures.  The chart goes from 2012 to 2021, and the two variables track closely in value over that time. Small Image
View details about correlation #2,718


Masters in Money Matters: Mapping the Marvelous Mistake in the Marriage of Education and Economic Meltdown
As the number of Master's degrees awarded in Education decreased, there were fewer people able to comprehend the concept of "fractional reserve banking." This led to a decrease in risky financial practices and ultimately contributed to a lower rate of US bank failures. After all, you can't spell "financial stability" without "STEM education" - or so the bankers now realize!




What else correlates?
Master's degrees awarded in Education · all education
US bank failures · all weird & wacky

A linear line chart with years as the X-axis and two variables on the Y-axis. The first variable is American cheese consumption and the second variable is Alphabet's stock price (GOOGL).  The chart goes from 2005 to 2021, and the two variables track closely in value over that time. Small Image
View details about correlation #5,884


The Big Cheese Squeeze: How American Cheese Consumption Swings Alphabet's Stock Price
As American cheese consumption melted, so did the hearts of investors, causing a ripple effect in the stock market. As more people savored the idea of a cheesy investment, the demand for Alphabet's stock increased, leading to a gouda rise in their stock price. It seems like the secret to their success was simply to brie-lieve in the power of dairy deliciousness!




What else correlates?
American cheese consumption · all food
Alphabet's stock price (GOOGL) · all stocks

A linear line chart with years as the X-axis and two variables on the Y-axis. The first variable is Air pollution in Albuquerque and the second variable is Google searches for 'who is prince william'.  The chart goes from 2008 to 2023, and the two variables track closely in value over that time. Small Image
View details about correlation #2,863


A Breath of Fresh Heir: Investigating the Relationship Between Air Pollution in Albuquerque and Google Searches for 'Who is Prince William'
As air pollution in Albuquerque increased, more people sought refuge indoors. With limited entertainment options, they turned to browsing the internet. This led to an uptick in searches for 'who is prince william' as a way to pass the time. This is a prime example of how environmental factors can directly impact our curiosity about the British royal family.




What else correlates?
Air pollution in Albuquerque · all weather
Google searches for 'who is prince william' · all google searches

A linear line chart with years as the X-axis and two variables on the Y-axis. The first variable is UFO sightings in New Mexico and the second variable is Patents granted in the US.  The chart goes from 1975 to 2020, and the two variables track closely in value over that time. Small Image
View details about correlation #3,099


The New Mexico Extraterrestrial Effect: Unveiling The Otherworldly Influence on Patent Granting in the United States
The influx of intergalactic inspiration in New Mexico led to an explosion of creative scientific ideas, spurring a wave of groundbreaking inventions and technological advancements. It turns out, alien innovation is the real secret behind some of our greatest patented achievements!




What else correlates?
UFO sightings in New Mexico · all random state specific
Patents granted in the US · all weird & wacky

next page →

← Previous page
Discover a new correlation
View random correlation
View all correlations
View all research papers
Get permission to re-use these charts

Why this works

  1. Data dredging: I have 25,237 variables in my database. I compare all these variables against each other to find ones that randomly match up. That's 636,906,169 correlation calculations! This is called “data dredging.” Fun fact: the chart used on the wikipedia page to demonstrate data dredging is also from me. I've been being naughty with data since 2014.
    Instead of starting with a hypothesis and testing it, I instead tossed a bunch of data in a blender to see what correlations would shake out. It’s a dangerous way to go about analysis, because any sufficiently large dataset will yield strong correlations completely at random.
  2. Lack of causal connection: There is probably no direct connection between these variables, despite what the AI says above. Because these pages are automatically generated, it's possible that the two variables you are viewing are in fact causually related. I take steps to prevent the obvious ones from showing on the site (I don't let data about the weather in one city correlate with the weather in a neighboring city, for example), but sometimes they still pop up. If they are related, cool! You found a loophole.
    This is exacerbated by the fact that I used "Years" as the base variable. Lots of things happen in a year that are not related to each other! Most studies would use something like "one person" in stead of "one year" to be the "thing" studied.
  3. Observations not independent: For many variables, sequential years are not independent of each other. You will often see trend-lines form. If a population of people is continuously doing something every day, there is no reason to think they would suddenly change how they are doing that thing on January 1. A naive p-value calculation does not take this into account. You will calculate a lower chance of "randomly" achieving the result than represents reality.

    To be more specific: p-value tests are probability values, where you are calculating the probability of achieving a result at least as extreme as you found completely by chance. When calculating a p-value, you need to assert how many "degrees of freedom" your variable has. I count each year (minus one) as a "degree of freedom," but this is misleading for continuous variables.

    This kind of thing can creep up on you pretty easily when using p-values, which is why it's best to take it as "one of many" inputs that help you assess the results of your analysis.
  4. Y-axes doesn't start at zero: I truncated the Y-axes of the graphs above. I also used a line graph, which makes the visual connection stand out more than it deserves. Nothing against line graphs. They are great at telling a story when you have linear data! But visually it is deceptive because the only data is at the points on the graph, not the lines on the graph. In between each point, the data could have been doing anything. Like going for a random walk by itself!
    Mathematically what I showed is true, but it is intentionally misleading. If you click on any of the charts that abuse this, you can scroll down to see a version that starts at zero.
  5. Confounding variable: Confounding variables (like global pandemics) will cause two variables to look connected when in fact a "sneaky third" variable is influencing both of them behind the scenes.
  6. Outliers: Some datasets here have outliers which drag up the correlation. In concept, "outlier" just means "way different than the rest of your dataset." When calculating a correlation like this, they are particularly impactful because a single outlier can substantially increase your correlation.

    Because this page is automatically generated, I don't know whether any of the charts displayed on it have outliers. I'm just a footnote. ¯\_(ツ)_/¯
    I intentionally mishandeled outliers, which makes the correlation look extra strong.
  7. Low n: There are not many data points included in some of these charts. You can do analyses with low ns! But you shouldn't data dredge with a low n.
    Even if the p-value is high, we should be suspicious of using so few datapoints in a correlation.


Pro-tip: click on any correlation to see:

Project by Tyler Vigen
emailme@tylervigen.com · about · subscribe


CC BY 4.0