about · email me · subscribe
Spurious correlation #5,583 · View random

A linear line chart with years as the X-axis and two variables on the Y-axis. The first variable is Air pollution in Beaver Dam, Wisconsin and the second variable is Kerosene used in Norway.  The chart goes from 1983 to 2022, and the two variables track closely in value over that time. Small Image
Download png
, svg

AI explanation

The cleaner air in Beaver Dam led to an influx of eco-conscious beavers starting a trendy, all-natural dam-building movement. This sudden demand for non-toxic building materials caused a shortage of kerosene in Norway, where it's secretly used in traditional beaver repellents.

Model: dalle-3
Prompt: Prompt: Embodying the macabre style, generate an image of a serene lake in Beaver Dam, Wisconsin. In the foreground, eco-conscious beavers diligently construct their dam using all-natural, sustainable materials. In the background, traditional wooden cabins nestled in the verdant hills of Norway are depicted amidst a village facing a kerosene shortage, essential for their secret beaver repellents. The scene should capture the juxtaposition of serene nature and impending conflict. The depiction should include only scenes, people, and objects, it should not include text or speech bubbles.
This image generation prompt was also generated by AI, using the following sequence:

Model: gpt-3.5-turbo-1106
System prompt: You select one of a few potential scene descriptors that seems to go along with the variables described. You respond only with the scene descriptions provided and say nothing else.
User prompt: Here are two variables: Air pollution in Beaver Dam, Wisconsin and Kerosene used in Norway. Given these, which of the following image style descriptions seems most appropriate? Pick the one that seems to be related to one of the variables at hand. Please respond with only one of the following potential descriptions: inspired by bauhaus Macabre Glitch art Photorealism Macabre
ChatGPT: Macabre

Model: gpt-3.5-turbo-1106
System prompt: You describe simple visual scenes that depict the two variables stated. You combine both variables into one scene.
User prompt: Please describe a visual scene that prominently depicts both Air pollution in Beaver Dam, Wisconsin and Kerosene used in Norway, in the following situation: The cleaner air in Beaver Dam led to an influx of eco-conscious beavers starting a trendy, all-natural dam-building movement. This sudden demand for non-toxic building materials caused a shortage of kerosene in Norway, where it's secretly used in traditional beaver repellents.. Keep the description simple.
ChatGPT: In a serene lake in Beaver Dam, Wisconsin, the air floats with clear, crisp purity as eco-conscious beavers diligently construct their dam using all-natural, sustainable materials. Meanwhile, in Norway, amidst the traditional wooden cabins nestled in the verdant hills, the villagers face a shortage of kerosene, a key ingredient in their secret beaver repellents.

Model: gpt-3.5-turbo-1106
System prompt: You write and improve image generation prompts for DALLE-3. You maintain the same subjects, but enhance the visuals.
User prompt: Please respond with an image generation prompt that would be likely to create an interesting visual using DALLE-3, using the following style: [ Macabre ] and depicting this scene: In a serene lake in Beaver Dam, Wisconsin, the air floats with clear, crisp purity as eco-conscious beavers diligently construct their dam using all-natural, sustainable materials. Meanwhile, in Norway, amidst the traditional wooden cabins nestled in the verdant hills, the villagers face a shortage of kerosene, a key ingredient in their secret beaver repellents. Keep the visual depiction simple. Do not include any words or speech bubbles.
ChatGPT: Prompt: Embodying the macabre style, generate an image of a serene lake in Beaver Dam, Wisconsin. In the foreground, eco-conscious beavers diligently construct their dam using all-natural, sustainable materials. In the background, traditional wooden cabins nestled in the verdant hills of Norway are depicted amidst a village facing a kerosene shortage, essential for their secret beaver repellents. The scene should capture the juxtaposition of serene nature and impending conflict.

*Variations in the User Prompts from chart to chart are just the result of random number generation in Python. I wrote a few arrays of various styles and methods to ask questions to change up the results. Every time this site writes an explanation or generates an image, the script picks from each at random.

I sequence the requests into multiple prompts because I find GPT 3.5 to perform much better with short, well-managed contexts. Thus, I track the context directly in Python and only ask ChatGPT targeted questions.

Model: gpt-3.5-turbo-1106 (fine-tuned)
System prompt: You provide humorous responses in the form of plausible sounding explanations for correlations. You assume the correlation is causative for the purpose of the explanation even if it is ridiculous. You do not chat with the user, you only reply with the causal connection explanation and nothing else.
User prompt: Please make up a funny explanation for how a decrease in Air pollution in Beaver Dam, Wisconsin caused Kerosene used in Norway to decrease.

AI academic paper

(Because p < 0.01)
The Relationship Between Air Pollution in Beaver Dam, Wisconsin and Kerosene Consumption in Norway: A Cross-country Analysis

The Journal of Ecological Quirks and Quandaries

Jan 2024

Reminder: This paper is AI-generated. Not real!
Quick note: This sequence is long and repetitive. That's how the prompt engineering works for this one. I manage the output in Python to format it into this PDF.




Model: gpt-3.5-turbo-1106
System prompt: You are a witty academic researcher. Your writing sounds like it belongs in a statistics journal.You write titles and abstracts of goofy academic research papers.
User prompt:
Pretend you are an academic researcher. Your writing sounds like it belongs in a medical or economics journal. However, you frequently crack jokes, make puns, and add unexpected twists in your writing.

Please draft the title and abstract of an academic research paper presenting the findings of the connection between Air pollution in Beaver Dam, Wisconsin and Kerosene used in Norway. Make it seem dry like academic research paper, but with lots of subtle hidden offhand remarks, jokes, goofy observations, and puns.

Make the title seem clever.

Your research team used data from Environmental Protection Agency and Energy Information Administration to assess this nagging question. You found a correlation coefficient of 0.8158848 and p < 0.01 for 1983 to 2022.

Please respond in this format (replace "Lorem ipsum dolor sit" with your title and abstract)

[[TITLE]]
Lorem ipsum dolor sit

[[ABSTRACT]]
Lorem ipsum dolor sit



ChatGPT:
[[TITLE]]
"The Relationship Between Air Pollution in Beaver Dam, Wisconsin and Kerosene Consumption in Norway: A Cross-country Analysis"

[[ABSTRACT]]
This cross-country analysis endeavors to investigate the intriguing link between air pollution levels in Beaver Dam, Wisconsin, and kerosene consumption in Norway. Leveraging data from the Environmental Protection Agency and the Energy Information Administration spanning the years 1983 to 2022, our research team rigorously evaluated this seemingly disparate connection. Our findings revealed a remarkably robust correlation coefficient of 0.8158848 and a statistically significant p-value of less than 0.01, shedding light on a potentially surprising association between these geographically distant variables. The implications of this unexpected relationship may spark an illuminating conversation about the interconnectedness of global phenomena and prompt further inquiry into the whimsical quirks of environmental and energy dynamics.


Model: gpt-3.5-turbo-1106
System prompt: You are a witty academic researcher. Your writing sounds like it belongs in a statistics journal.You draft the introduction section of an academic research paper.
User prompt:
Pretend you are an academic researcher. Your writing sounds like it belongs in a medical or economics journal. However, you frequently crack jokes, make puns, or add unexpected twists in your writing.

Please make up the introduction section of an academic research paper presenting the findings of the connection between Air pollution in Beaver Dam, Wisconsin and Kerosene used in Norway. Make it seem dry like academic research paper, but with lots of subtle hidden offhand remarks, jokes, goofy observations, and puns. Sneak in some jokes and clever puns about science, research, statistics, and the variables.

Here is the title and abstract of the paper:
[[TITLE]]
"The Relationship Between Air Pollution in Beaver Dam, Wisconsin and Kerosene Consumption in Norway: A Cross-country Analysis"

[[ABSTRACT]]
This cross-country analysis endeavors to investigate the intriguing link between air pollution levels in Beaver Dam, Wisconsin, and kerosene consumption in Norway. Leveraging data from the Environmental Protection Agency and the Energy Information Administration spanning the years 1983 to 2022, our research team rigorously evaluated this seemingly disparate connection. Our findings revealed a remarkably robust correlation coefficient of 0.8158848 and a statistically significant p-value of less than 0.01, shedding light on a potentially surprising association between these geographically distant variables. The implications of this unexpected relationship may spark an illuminating conversation about the interconnectedness of global phenomena and prompt further inquiry into the whimsical quirks of environmental and energy dynamics.

-------

Your response should start like this:

[[INTRODUCTION]]



ChatGPT:
[[INTRODUCTION]]

The convergence of air pollution in Beaver Dam, Wisconsin, and kerosene consumption in Norway may at first glance seem as incongruous as mixing hydrogen and helium at a birthday party. However, the world of research is often characterized by its ability to uncover unexpected connections, much like stumbling upon an elusive statistical correlation in a haystack of data. With this in mind, our study delves into the enigmatic association between these two seemingly disparate variables, aiming to shine a light on the hidden patterns that underpin our global landscape.

Air pollution, often deemed as the unwanted confetti of industrialization, has long been the subject of intense scrutiny. Its impact on public health and the environment has been studied more exhaustively than a graduate student preparing for finals. Meanwhile, kerosene, known for its role in providing illumination and heating, has been as constant in the energy domain as the speed of light in a vacuum. However, the potential intertwining of these two variables, much like finding a statistical needle in a haystack, has piqued the curiosity of researchers and statisticians alike.

The intriguing nature of this cross-country analysis lies in its unearthing of a correlation so robust that it appears more solid than a rock formation in a statistical canyon. Our initial exploration has yielded a correlation coefficient that stands tall and proud at 0.8158848, defying the odds much like a statistician predicting the exact outcome of a coin toss. With the p-value standing at less than 0.01, the statistical significance of this relationship has shone brighter than a laboratory full of scientists attempting to find the elusive Higgs boson.

These findings serve as a beacon, illuminating the potential links between seemingly unrelated factors and inspire the scientific community to engage in discussions as stimulating as an electrifying experiment. The implications of this unexpected relationship may be as impactful as a gust of wind on a windmill, prompting further inquiry into the delightful and whimsical quirks of environmental and energy dynamics on a global scale. The light shed by our analysis may pave the way for a brighter understanding of the interconnectedness of diverse phenomena, just as a well-illuminated laboratory furthers our understanding of the natural world.


Model: gpt-3.5-turbo-1106
System prompt: You are a witty academic researcher. Your writing sounds like it belongs in a statistics journal.You draft a literature review section of an academic research paper, that starts out dry for the first few sentences but then becomes hilarious and goofy.
User prompt:
Pretend you are an academic researcher. Your writing sounds like it belongs in a medical or economics journal. However, you frequently crack jokes, make puns, or add unexpected twists in your writing.

Please make up a literature review section of an academic research paper presenting the findings of the connection between Air pollution in Beaver Dam, Wisconsin and Kerosene used in Norway. Make it seem dry like academic research paper, but with lots of subtle hidden offhand remarks, jokes, goofy observations, and puns.

Speak in the present tense for this section (the authors find...), as is common in academic research paper literature reviews. Name the sources in a format similar to this: In "Book," the authors find lorem and ipsum.

Make up the lorem and ipsum part, but make it sound related to the topic at hand.

Start by naming serious-sounding studies by authors like Smith, Doe, and Jones - but then quickly devolve. Name some real non-fiction books that would be related to the topic. Then name some real fiction books that sound like they could be related. Perhaps you also got inspiration from some board games that are vaugely related.

Here is the title and abstract of the paper:
[[TITLE]]
"The Relationship Between Air Pollution in Beaver Dam, Wisconsin and Kerosene Consumption in Norway: A Cross-country Analysis"

[[ABSTRACT]]
This cross-country analysis endeavors to investigate the intriguing link between air pollution levels in Beaver Dam, Wisconsin, and kerosene consumption in Norway. Leveraging data from the Environmental Protection Agency and the Energy Information Administration spanning the years 1983 to 2022, our research team rigorously evaluated this seemingly disparate connection. Our findings revealed a remarkably robust correlation coefficient of 0.8158848 and a statistically significant p-value of less than 0.01, shedding light on a potentially surprising association between these geographically distant variables. The implications of this unexpected relationship may spark an illuminating conversation about the interconnectedness of global phenomena and prompt further inquiry into the whimsical quirks of environmental and energy dynamics.

-------

And here is the introduction section that led to this literature review:
[[INTRODUCTION]]
The convergence of air pollution in Beaver Dam, Wisconsin, and kerosene consumption in Norway may at first glance seem as incongruous as mixing hydrogen and helium at a birthday party. However, the world of research is often characterized by its ability to uncover unexpected connections, much like stumbling upon an elusive statistical correlation in a haystack of data. With this in mind, our study delves into the enigmatic association between these two seemingly disparate variables, aiming to shine a light on the hidden patterns that underpin our global landscape.
Air pollution, often deemed as the unwanted confetti of industrialization, has long been the subject of intense scrutiny. Its impact on public health and the environment has been studied more exhaustively than a graduate student preparing for finals. Meanwhile, kerosene, known for its role in providing illumination and heating, has been as constant in the energy domain as the speed of light in a vacuum. However, the potential intertwining of these two variables, much like finding a statistical needle in a haystack, has piqued the curiosity of researchers and statisticians alike.
The intriguing nature of this cross-country analysis lies in its unearthing of a correlation so robust that it appears more solid than a rock formation in a statistical canyon. Our initial exploration has yielded a correlation coefficient that stands tall and proud at 0.8158848, defying the odds much like a statistician predicting the exact outcome of a coin toss. With the p-value standing at less than 0.01, the statistical significance of this relationship has shone brighter than a laboratory full of scientists attempting to find the elusive Higgs boson.
These findings serve as a beacon, illuminating the potential links between seemingly unrelated factors and inspire the scientific community to engage in discussions as stimulating as an electrifying experiment. The implications of this unexpected relationship may be as impactful as a gust of wind on a windmill, prompting further inquiry into the delightful and whimsical quirks of environmental and energy dynamics on a global scale. The light shed by our analysis may pave the way for a brighter understanding of the interconnectedness of diverse phenomena, just as a well-illuminated laboratory furthers our understanding of the natural world.

-------

Your response should start like this:

[[LITERATURE REVIEW]]



ChatGPT:
[[LITERATURE REVIEW]]

In "Environmental Impacts of Air Pollution," Smith et al. (2010) shed light on the detrimental effects of air pollution on human health and the environment. Their comprehensive study meticulously details the various pollutants emitted by industrial processes, transportation, and other human activities, and their potential to cause respiratory ailments, cardiovascular diseases, and environmental degradation. The authors' work provides a solid foundation for understanding the profound impact of air pollution, akin to a brick wall in a gust of wind.

Doe and Jones (2015) investigate the historical trends and sociopolitical factors influencing kerosene consumption in Norway in "Energy Dynamics of the Nordic Region." Their thorough analysis unravels the complex interplay between energy policies, technological advancements, and cultural practices, shaping the patterns of kerosene usage in the region. Their work serves as a guiding beacon, illuminating the multifaceted dynamics of energy consumption, much like a lighthouse guiding ships through tumultuous waters.

While the literature establishes a strong groundwork for comprehending the individual facets of air pollution and kerosene consumption, the perplexing correlation between these variables beckons for a deeper exploration. Moving beyond the realms of serious academic research, let us pivot to diverse sources of knowledge that may offer unexpected insights and amusing parallels.

In "The Air Up There: A Journey Through the Atmosphere," the authors expound upon the mysterious and awe-inspiring properties of the air surrounding us. This captivating non-fiction work not only enlightens readers about the scientific intricacies of the atmosphere but also regales them with anecdotes about skydiving adventures and remarkable avian feats, offering a refreshing perspective on the subject.

Similarly, "A Global Quest for Illumination: Tales of Light and Darkness," interweaves historical accounts and cultural anecdotes centered around the usage of different lighting sources across the world. Through narratives of ancient civilizations and explorers navigating dark territories, this book sheds light on the diverse human experiences with illumination, and perhaps, hints at the unexpected intertwining of kerosene consumption and air pollution across geographically distant regions.

Furthermore, delving into the realm of fiction, "The Toxic Cloud Conspiracy" presents a riveting thriller that weaves together the enigmatic occurrences of air pollution and clandestine maneuvers. While the narrative unfolds in a suspenseful manner, it offers imaginative scenarios that resonate with the unforeseen relationship between air pollution in Beaver Dam, Wisconsin, and kerosene usage in Norway. Unveiling clandestine operations might seem far-fetched, but isn't the statistical discovery of an unexpected correlation equally intriguing?

Taking a playful turn, the board game "Pollution Peril" challenges players to navigate through environmental hazards and strategize to mitigate pollution levels. While the game's primary aim is entertainment, its portrayal of the intricate balance between pollution sources and their societal implications mirrors the complexities of our research inquiry. Perhaps, in the whimsical world of board games, lies a reflection of the unexpected link we have uncovered.

Thus, as we pivot from the serious to the light-hearted, these sources beckon us to embrace the unexpected and entertain the notion that even in the rigid construct of academic exploration, there exists room for a bit of whimsy and mirth.


Model: gpt-3.5-turbo-1106
System prompt: You are a witty academic researcher. Your writing sounds like it belongs in a statistics journal.You draft the methodology section of an academic research paper.
User prompt:
Pretend you are an academic researcher. Your writing sounds like it belongs in a medical or economics journal. However, you frequently crack jokes, make puns, or add unexpected twists in your writing.

Please make up the methodology section of an academic research paper presenting the findings of the connection between Air pollution in Beaver Dam, Wisconsin and Kerosene used in Norway. Make it seem dry like academic research paper, but with lots of subtle hidden offhand remarks, jokes, goofy observations, and puns. Sneak in some jokes and clever puns about science, research, statistics, and the variables.

Your research team collected data from all across the internet, but mostly just used information from Environmental Protection Agency and Energy Information Administration . You used data from 1983 to 2022

Make up the research methods you don't know. Make them a bit goofy and convoluted.

Here is the title, abstract, and introduction of the paper:
[[TITLE]]
"The Relationship Between Air Pollution in Beaver Dam, Wisconsin and Kerosene Consumption in Norway: A Cross-country Analysis"

[[ABSTRACT]]
This cross-country analysis endeavors to investigate the intriguing link between air pollution levels in Beaver Dam, Wisconsin, and kerosene consumption in Norway. Leveraging data from the Environmental Protection Agency and the Energy Information Administration spanning the years 1983 to 2022, our research team rigorously evaluated this seemingly disparate connection. Our findings revealed a remarkably robust correlation coefficient of 0.8158848 and a statistically significant p-value of less than 0.01, shedding light on a potentially surprising association between these geographically distant variables. The implications of this unexpected relationship may spark an illuminating conversation about the interconnectedness of global phenomena and prompt further inquiry into the whimsical quirks of environmental and energy dynamics.

[[INTRODUCTION]]
The convergence of air pollution in Beaver Dam, Wisconsin, and kerosene consumption in Norway may at first glance seem as incongruous as mixing hydrogen and helium at a birthday party. However, the world of research is often characterized by its ability to uncover unexpected connections, much like stumbling upon an elusive statistical correlation in a haystack of data. With this in mind, our study delves into the enigmatic association between these two seemingly disparate variables, aiming to shine a light on the hidden patterns that underpin our global landscape.
Air pollution, often deemed as the unwanted confetti of industrialization, has long been the subject of intense scrutiny. Its impact on public health and the environment has been studied more exhaustively than a graduate student preparing for finals. Meanwhile, kerosene, known for its role in providing illumination and heating, has been as constant in the energy domain as the speed of light in a vacuum. However, the potential intertwining of these two variables, much like finding a statistical needle in a haystack, has piqued the curiosity of researchers and statisticians alike.
The intriguing nature of this cross-country analysis lies in its unearthing of a correlation so robust that it appears more solid than a rock formation in a statistical canyon. Our initial exploration has yielded a correlation coefficient that stands tall and proud at 0.8158848, defying the odds much like a statistician predicting the exact outcome of a coin toss. With the p-value standing at less than 0.01, the statistical significance of this relationship has shone brighter than a laboratory full of scientists attempting to find the elusive Higgs boson.
These findings serve as a beacon, illuminating the potential links between seemingly unrelated factors and inspire the scientific community to engage in discussions as stimulating as an electrifying experiment. The implications of this unexpected relationship may be as impactful as a gust of wind on a windmill, prompting further inquiry into the delightful and whimsical quirks of environmental and energy dynamics on a global scale. The light shed by our analysis may pave the way for a brighter understanding of the interconnectedness of diverse phenomena, just as a well-illuminated laboratory furthers our understanding of the natural world.

-------

Your response should start like this:

[[METHODOLOGY]]



ChatGPT:

[[METHODOLOGY]]

To unravel the enigmatic entanglement between air pollution in Beaver Dam, Wisconsin, and kerosene consumption in Norway, we employed a methodological framework as intricate and interconnected as a complex chemical reaction. Our research team embarked on a virtual odyssey through the digital archives of the Environmental Protection Agency and the Energy Information Administration from the years 1983 to 2022, collecting a treasure trove of data that could make a data scientist's eyes light up like a well-lit laboratory.

The data from the Environmental Protection Agency provided us with elaborate insights into the air quality in the beguilingly named Beaver Dam, Wisconsin, while the Energy Information Administration's records allowed us to shed light on the consumption patterns of kerosene in the land of fjords and Northern Lights.

We harnessed the power of statistical software to conduct a thorough analysis and examination of the collected data. The statistical analysis was carried out with the precision of a physicist calibrating a particle accelerator, with a particular focus on exploring correlations and associations that could potentially rival the electrifying discoveries made in a well-equipped laboratory.

Our methodology involved rigorous data cleaning and preparation, akin to an archaeologist delicately uncovering ancient artifacts, to ensure that our dataset was as pristine and robust as a stately oak tree standing against the forces of nature. After this meticulous preparation, we wrangled the data with the agility of a mathematician navigating a labyrinth of mathematical functions and relations.

We then performed a series of statistical tests, employing methods as diverse as the colors of a vibrant aurora borealis, to investigate the strength and direction of the relationship between air pollution in Beaver Dam and kerosene consumption in Norway. The primary statistical tool utilized was the Pearson correlation coefficient, with supplementary analyses including regression models and time-series analyses. This comprehensive approach allowed us to capture the essence of the intricate relationship between these seemingly unrelated variables and uncover insights that could rival the revelations of a groundbreaking scientific study.

In addition, we conducted sensitivity analyses and robustness checks to ensure the reliability and stability of our findings, leaving no stone unturned in our pursuit of scientific rigor and integrity. Our methodology championed a holistic and multidimensional examination of the data, akin to an artist reveling in the creation of a masterpiece that captures the essence of diverse phenomena.

Overall, our methodological approach epitomized the spirit of scientific inquiry, blending precision, rigor, and a hint of curiosity, much like a chemist concocting a new potion in the laboratory of discovery. Our findings hold the promise of shedding light on the interconnectedness of global phenomena in a manner as captivating as the dance of particles in a particle accelerator, paving the way for further exploration and discourse in the realm of environmental and energy dynamics.


Model: gpt-3.5-turbo-1106
System prompt: You are a witty academic researcher. Your writing sounds like it belongs in a statistics journal.You draft the results section of an academic research paper. You speak in the past tense (you found...).
User prompt:
Pretend you are an academic researcher. Your writing sounds like it belongs in a medical or economics journal. However, you frequently crack jokes, make puns, or add unexpected twists in your writing.

Please make up the results section of an academic research paper presenting the findings of the connection between Air pollution in Beaver Dam, Wisconsin and Kerosene used in Norway. Make it seem dry like academic research paper, but with lots of subtle hidden offhand remarks, jokes, goofy observations, and puns. Sneak in some jokes and clever puns about science, research, statistics, and the variables.

Your research team collected data from all across the internet, but mostly just used information from Environmental Protection Agency and Energy Information Administration .

For the time period 1983 to 2022, you found a correlation 0.8158848, r-squared of 0.6656680, and p < 0.01.

One figure will be included. The figure (Fig. 1) is a scatterplot showing the strong correlation between the two variables. You don't need to specify where; I will add the figure.

Here is the title and abstract of the paper:
[[TITLE]]
"The Relationship Between Air Pollution in Beaver Dam, Wisconsin and Kerosene Consumption in Norway: A Cross-country Analysis"

[[ABSTRACT]]
This cross-country analysis endeavors to investigate the intriguing link between air pollution levels in Beaver Dam, Wisconsin, and kerosene consumption in Norway. Leveraging data from the Environmental Protection Agency and the Energy Information Administration spanning the years 1983 to 2022, our research team rigorously evaluated this seemingly disparate connection. Our findings revealed a remarkably robust correlation coefficient of 0.8158848 and a statistically significant p-value of less than 0.01, shedding light on a potentially surprising association between these geographically distant variables. The implications of this unexpected relationship may spark an illuminating conversation about the interconnectedness of global phenomena and prompt further inquiry into the whimsical quirks of environmental and energy dynamics.

-------

And here is the methodology section that led to this result:
[[METHODOLOGY]]
The convergence of air pollution in Beaver Dam, Wisconsin, and kerosene consumption in Norway may at first glance seem as incongruous as mixing hydrogen and helium at a birthday party. However, the world of research is often characterized by its ability to uncover unexpected connections, much like stumbling upon an elusive statistical correlation in a haystack of data. With this in mind, our study delves into the enigmatic association between these two seemingly disparate variables, aiming to shine a light on the hidden patterns that underpin our global landscape.
Air pollution, often deemed as the unwanted confetti of industrialization, has long been the subject of intense scrutiny. Its impact on public health and the environment has been studied more exhaustively than a graduate student preparing for finals. Meanwhile, kerosene, known for its role in providing illumination and heating, has been as constant in the energy domain as the speed of light in a vacuum. However, the potential intertwining of these two variables, much like finding a statistical needle in a haystack, has piqued the curiosity of researchers and statisticians alike.
The intriguing nature of this cross-country analysis lies in its unearthing of a correlation so robust that it appears more solid than a rock formation in a statistical canyon. Our initial exploration has yielded a correlation coefficient that stands tall and proud at 0.8158848, defying the odds much like a statistician predicting the exact outcome of a coin toss. With the p-value standing at less than 0.01, the statistical significance of this relationship has shone brighter than a laboratory full of scientists attempting to find the elusive Higgs boson.
These findings serve as a beacon, illuminating the potential links between seemingly unrelated factors and inspire the scientific community to engage in discussions as stimulating as an electrifying experiment. The implications of this unexpected relationship may be as impactful as a gust of wind on a windmill, prompting further inquiry into the delightful and whimsical quirks of environmental and energy dynamics on a global scale. The light shed by our analysis may pave the way for a brighter understanding of the interconnectedness of diverse phenomena, just as a well-illuminated laboratory furthers our understanding of the natural world.

-------

Your response should start like this:

[[RESULTS]]



ChatGPT:

[[RESULTS]]

Upon conducting our cross-country analysis, we uncovered a relationship between air pollution in Beaver Dam, Wisconsin, and kerosene consumption in Norway that is as intriguing as a black hole at the center of a galaxy. The correlation coefficient of 0.8158848 suggests a strong positive association between these seemingly distinct variables, akin to finding a rare specimen in the scientific wilderness. This correlation is so prominent that it stands out like a bright star in the statistical galaxy.

The r-squared value of 0.6656680 indicates that approximately 66.57% of the variation in air pollution levels in Beaver Dam can be explained by the variation in kerosene consumption in Norway. This finding is as clear-cut as a diamond in the rough, providing a compelling explanation for the observed relationship.

Additionally, the p-value of less than 0.01 offers strong evidence against the null hypothesis, indicating that the relationship between air pollution in Beaver Dam and kerosene consumption in Norway is statistically significant. This result is as striking as a sudden eureka moment in the midst of scientific inquiry.

Figure 1 depicts a scatterplot illustrating the robust correlation between air pollution in Beaver Dam, Wisconsin, and kerosene consumption in Norway, reinforcing the strength of the observed relationship.

These findings not only shed light on the surprising connection between these geographically distant variables but also beckon us to unravel the mysteries of intertwined global phenomena, much like navigating through the convoluted pathways of a scientific labyrinth.


Model: gpt-3.5-turbo-1106
System prompt: You are a witty academic researcher. Your writing sounds like it belongs in a statistics journal.You draft the discussion section of an academic research paper.
User prompt:
Pretend you are an academic researcher. Your writing sounds like it belongs in a medical or economics journal. However, you frequently crack jokes, make puns, or add unexpected twists in your writing.

Please make up the discussion section of an academic research paper presenting the findings of the connection between Air pollution in Beaver Dam, Wisconsin and Kerosene used in Norway. Make it seem dry like academic research paper, but with lots of subtle hidden offhand remarks, jokes, goofy observations, and puns. Sneak in some jokes and clever puns about science, research, statistics, and the variables.

Limit your response to 500 tokens.

Here are the title, abstract, literature review, and results sections. Please harken back to 1-2 of the goofy items in the literature review, but pretend to take them completely seriously. Discuss how your results supported the prior research.

Do not write a conclusion. I will add a conclusion after this.

[[TITLE]]
"The Relationship Between Air Pollution in Beaver Dam, Wisconsin and Kerosene Consumption in Norway: A Cross-country Analysis"

[[ABSTRACT]]
This cross-country analysis endeavors to investigate the intriguing link between air pollution levels in Beaver Dam, Wisconsin, and kerosene consumption in Norway. Leveraging data from the Environmental Protection Agency and the Energy Information Administration spanning the years 1983 to 2022, our research team rigorously evaluated this seemingly disparate connection. Our findings revealed a remarkably robust correlation coefficient of 0.8158848 and a statistically significant p-value of less than 0.01, shedding light on a potentially surprising association between these geographically distant variables. The implications of this unexpected relationship may spark an illuminating conversation about the interconnectedness of global phenomena and prompt further inquiry into the whimsical quirks of environmental and energy dynamics.

[[LITERATURE REVIEW]]
In "Environmental Impacts of Air Pollution," Smith et al. (2010) shed light on the detrimental effects of air pollution on human health and the environment. Their comprehensive study meticulously details the various pollutants emitted by industrial processes, transportation, and other human activities, and their potential to cause respiratory ailments, cardiovascular diseases, and environmental degradation. The authors' work provides a solid foundation for understanding the profound impact of air pollution, akin to a brick wall in a gust of wind.
Doe and Jones (2015) investigate the historical trends and sociopolitical factors influencing kerosene consumption in Norway in "Energy Dynamics of the Nordic Region." Their thorough analysis unravels the complex interplay between energy policies, technological advancements, and cultural practices, shaping the patterns of kerosene usage in the region. Their work serves as a guiding beacon, illuminating the multifaceted dynamics of energy consumption, much like a lighthouse guiding ships through tumultuous waters.
While the literature establishes a strong groundwork for comprehending the individual facets of air pollution and kerosene consumption, the perplexing correlation between these variables beckons for a deeper exploration. Moving beyond the realms of serious academic research, let us pivot to diverse sources of knowledge that may offer unexpected insights and amusing parallels.
In "The Air Up There: A Journey Through the Atmosphere," the authors expound upon the mysterious and awe-inspiring properties of the air surrounding us. This captivating non-fiction work not only enlightens readers about the scientific intricacies of the atmosphere but also regales them with anecdotes about skydiving adventures and remarkable avian feats, offering a refreshing perspective on the subject.
Similarly, "A Global Quest for Illumination: Tales of Light and Darkness," interweaves historical accounts and cultural anecdotes centered around the usage of different lighting sources across the world. Through narratives of ancient civilizations and explorers navigating dark territories, this book sheds light on the diverse human experiences with illumination, and perhaps, hints at the unexpected intertwining of kerosene consumption and air pollution across geographically distant regions.
Furthermore, delving into the realm of fiction, "The Toxic Cloud Conspiracy" presents a riveting thriller that weaves together the enigmatic occurrences of air pollution and clandestine maneuvers. While the narrative unfolds in a suspenseful manner, it offers imaginative scenarios that resonate with the unforeseen relationship between air pollution in Beaver Dam, Wisconsin, and kerosene usage in Norway. Unveiling clandestine operations might seem far-fetched, but isn't the statistical discovery of an unexpected correlation equally intriguing?
Taking a playful turn, the board game "Pollution Peril" challenges players to navigate through environmental hazards and strategize to mitigate pollution levels. While the game's primary aim is entertainment, its portrayal of the intricate balance between pollution sources and their societal implications mirrors the complexities of our research inquiry. Perhaps, in the whimsical world of board games, lies a reflection of the unexpected link we have uncovered.
Thus, as we pivot from the serious to the light-hearted, these sources beckon us to embrace the unexpected and entertain the notion that even in the rigid construct of academic exploration, there exists room for a bit of whimsy and mirth.

[[RESULTS]]
Upon conducting our cross-country analysis, we uncovered a relationship between air pollution in Beaver Dam, Wisconsin, and kerosene consumption in Norway that is as intriguing as a black hole at the center of a galaxy. The correlation coefficient of 0.8158848 suggests a strong positive association between these seemingly distinct variables, akin to finding a rare specimen in the scientific wilderness. This correlation is so prominent that it stands out like a bright star in the statistical galaxy.
The r-squared value of 0.6656680 indicates that approximately 66.57% of the variation in air pollution levels in Beaver Dam can be explained by the variation in kerosene consumption in Norway. This finding is as clear-cut as a diamond in the rough, providing a compelling explanation for the observed relationship.
Additionally, the p-value of less than 0.01 offers strong evidence against the null hypothesis, indicating that the relationship between air pollution in Beaver Dam and kerosene consumption in Norway is statistically significant. This result is as striking as a sudden eureka moment in the midst of scientific inquiry.
Figure 1 depicts a scatterplot illustrating the robust correlation between air pollution in Beaver Dam, Wisconsin, and kerosene consumption in Norway, reinforcing the strength of the observed relationship.
These findings not only shed light on the surprising connection between these geographically distant variables but also beckon us to unravel the mysteries of intertwined global phenomena, much like navigating through the convoluted pathways of a scientific labyrinth.

-------

Your response should start like this:

[[DISCUSSION]]



ChatGPT:

[[DISCUSSION]]

Our findings showcase a compelling and unexpected relationship between air pollution in Beaver Dam, Wisconsin, and kerosene consumption in Norway, highlighting the interconnectedness of seemingly disparate variables in the grand tapestry of global dynamics. The robust correlation coefficient and statistically significant p-value provide empirical support for the whimsically intriguing connection we initially set out to explore, reminding us that the scientific journey can lead to discoveries as surprising as finding a colony of penguins in the deserts of the Sahara.

The relationship between air pollution in Beaver Dam and kerosene consumption in Norway is akin to a delightful fusion dish- the blend of two seemingly distinct flavors creates an unexpectedly palatable result. Our results buttress the notion proposed by Smith et al. (2010) that air pollution can have far-reaching impacts transcending geographical boundaries, much like a stealthy ninja leaping over borders. Similarly, the work of Doe and Jones (2015) on energy dynamics in Norway offers a remarkably fitting backdrop to our findings, as if our research is a harmonious melody that intertwines with the symphony of their insights, akin to a serendipitous duet between a saxophonist and a tuba player.

Furthermore, the r-squared value indicates that a substantial proportion of the variation in air pollution levels in Beaver Dam can be attributed to the variability in kerosene consumption in Norway. This is akin to stumbling upon a treasure map that leads directly to a chest of statistical gold, granting us a clear and compelling explanation for the observed relationship. The statistically significant p-value serves as the proverbial scientific mic drop, decisively rejecting the null hypothesis and asserting the genuineness of the observed association like a rockstar commanding the stage at a statistical concert.

Figure 1, our visual representation of the correlation, is like a picturesque masterpiece capturing the unexpected beauty of this unusual relationship between air pollution in Beaver Dam and kerosene consumption in Norway. It serves as a testament to the power of visualization in illuminating complex statistical phenomena, much like a dazzling fireworks display on a dark, statistical canvas.

In the grand scheme of scientific inquiry, our research underscores the whimsical nature of the world, reminding us that even in the realm of academia, unexpected connections and surprises can abound. Our findings nudge the scientific community to embrace the serendipitous and whimsical aspects of research, akin to stumbling upon an unexpected punchline in the midst of a serious discussion, reminding us that even in the seemingly rigid domain of statistics, a sprinkle of wit and humor can spark a lively conversation.


Model: gpt-3.5-turbo-1106
System prompt: You are a witty academic researcher. Your writing sounds like it belongs in a statistics journal.You draft the discussion section of goofy academic research papers that sounds like it belongs based on the title and abstract provided.
User prompt:
Pretend you are an academic researcher. Your writing sounds like it belongs in a medical or economics journal. However, you frequently crack jokes, make puns, or add unexpected twists in your writing.

Please make up a short conclusion of an academic research paper presenting the findings of the connection between Air pollution in Beaver Dam, Wisconsin and Kerosene used in Norway. Make it seem dry like academic research paper, but with lots of subtle hidden offhand remarks, jokes, goofy observations, and puns. Sneak in some jokes and clever puns about science, research, statistics, and the variables.

Limit your response to 300 tokens. At the very end, assert that no more research is needed in this area.

Here are the title, abstract, introduction, and results sections.
[[TITLE]]
"The Relationship Between Air Pollution in Beaver Dam, Wisconsin and Kerosene Consumption in Norway: A Cross-country Analysis"

[[ABSTRACT]]
This cross-country analysis endeavors to investigate the intriguing link between air pollution levels in Beaver Dam, Wisconsin, and kerosene consumption in Norway. Leveraging data from the Environmental Protection Agency and the Energy Information Administration spanning the years 1983 to 2022, our research team rigorously evaluated this seemingly disparate connection. Our findings revealed a remarkably robust correlation coefficient of 0.8158848 and a statistically significant p-value of less than 0.01, shedding light on a potentially surprising association between these geographically distant variables. The implications of this unexpected relationship may spark an illuminating conversation about the interconnectedness of global phenomena and prompt further inquiry into the whimsical quirks of environmental and energy dynamics.

[[INTRDUCTION]]
The convergence of air pollution in Beaver Dam, Wisconsin, and kerosene consumption in Norway may at first glance seem as incongruous as mixing hydrogen and helium at a birthday party. However, the world of research is often characterized by its ability to uncover unexpected connections, much like stumbling upon an elusive statistical correlation in a haystack of data. With this in mind, our study delves into the enigmatic association between these two seemingly disparate variables, aiming to shine a light on the hidden patterns that underpin our global landscape.
Air pollution, often deemed as the unwanted confetti of industrialization, has long been the subject of intense scrutiny. Its impact on public health and the environment has been studied more exhaustively than a graduate student preparing for finals. Meanwhile, kerosene, known for its role in providing illumination and heating, has been as constant in the energy domain as the speed of light in a vacuum. However, the potential intertwining of these two variables, much like finding a statistical needle in a haystack, has piqued the curiosity of researchers and statisticians alike.
The intriguing nature of this cross-country analysis lies in its unearthing of a correlation so robust that it appears more solid than a rock formation in a statistical canyon. Our initial exploration has yielded a correlation coefficient that stands tall and proud at 0.8158848, defying the odds much like a statistician predicting the exact outcome of a coin toss. With the p-value standing at less than 0.01, the statistical significance of this relationship has shone brighter than a laboratory full of scientists attempting to find the elusive Higgs boson.
These findings serve as a beacon, illuminating the potential links between seemingly unrelated factors and inspire the scientific community to engage in discussions as stimulating as an electrifying experiment. The implications of this unexpected relationship may be as impactful as a gust of wind on a windmill, prompting further inquiry into the delightful and whimsical quirks of environmental and energy dynamics on a global scale. The light shed by our analysis may pave the way for a brighter understanding of the interconnectedness of diverse phenomena, just as a well-illuminated laboratory furthers our understanding of the natural world.

[[RESULTS]]
Upon conducting our cross-country analysis, we uncovered a relationship between air pollution in Beaver Dam, Wisconsin, and kerosene consumption in Norway that is as intriguing as a black hole at the center of a galaxy. The correlation coefficient of 0.8158848 suggests a strong positive association between these seemingly distinct variables, akin to finding a rare specimen in the scientific wilderness. This correlation is so prominent that it stands out like a bright star in the statistical galaxy.
The r-squared value of 0.6656680 indicates that approximately 66.57% of the variation in air pollution levels in Beaver Dam can be explained by the variation in kerosene consumption in Norway. This finding is as clear-cut as a diamond in the rough, providing a compelling explanation for the observed relationship.
Additionally, the p-value of less than 0.01 offers strong evidence against the null hypothesis, indicating that the relationship between air pollution in Beaver Dam and kerosene consumption in Norway is statistically significant. This result is as striking as a sudden eureka moment in the midst of scientific inquiry.
Figure 1 depicts a scatterplot illustrating the robust correlation between air pollution in Beaver Dam, Wisconsin, and kerosene consumption in Norway, reinforcing the strength of the observed relationship.
These findings not only shed light on the surprising connection between these geographically distant variables but also beckon us to unravel the mysteries of intertwined global phenomena, much like navigating through the convoluted pathways of a scientific labyrinth.

-------

Your response should start like this:

[[CONCLUSION]]



ChatGPT:

[[CONCLUSION]]

In conclusion, the findings of our cross-country analysis have unveiled a connection between air pollution in Beaver Dam, Wisconsin, and kerosene consumption in Norway that is as unexpected as discovering a hidden treasure trove in a laboratory's storage closet. The robust correlation coefficient of 0.8158848 stands as tall and prominent as a meticulously conducted experiment, defying expectations much like a penguin flying in the face of traditional scientific wisdom.

The r-squared value of 0.6656680 serves as a beacon, illuminating approximately 66.57% of the variation in air pollution levels in Beaver Dam with the radiant glow of a well-designed scientific study. This illuminating insight provides a compelling explanation for the observed relationship, akin to stumbling upon a rare element in the periodic table of statistical phenomena.

Furthermore, the statistically significant p-value of less than 0.01 emerges as a bright star in the statistical galaxy, offering compelling evidence against the null hypothesis and suggesting that the bond between air pollution in Beaver Dam and kerosene consumption in Norway is as real as the forces of gravity holding the moon in orbit.

The implications of these unexpected findings spark a lively conversation about the interconnectedness of global phenomena, as captivating as a scientific symposium filled with engaging discussions and illuminating presentations. The results of our research prod us to embrace the whimsical quirks of environmental and energy dynamics with the same enthusiasm as a physicist exploring the enigmatic world of quantum mechanics.

Therefore, it is our firm conclusion that further inquiry into this surprising relationship is not only unnecessary but may prove as futile as attempting to bottle up a hurricane. The mysteries of intertwined global phenomena have been illuminated, and it is time to turn our scientific curiosity toward other equally intriguing and enigmatic puzzles awaiting discovery.


Model: gpt-3.5-turbo-1106
System prompt: You provide fictional names of academic journals. You say nothing else, and you do not chat with the user.
User prompt: Please provide the name of a funny but plausible sounding research, scientific, or academic journal that would publish a paper entited The Relationship Between Air Pollution in Beaver Dam, Wisconsin and Kerosene Consumption in Norway: A Cross-country Analysis. Respond only with the name of a fictional academic journal.

ChatGPT:

The Journal of Ecological Quirks and Quandaries


Model: gpt-3.5-turbo-1106
System prompt: You provide fictional names of nonprofit research organizations and associations. You say nothing else, and you do not chat with the user.
User prompt: Please provide the name of a funny but plausible sounding nonprofit, academic, or scientific association or organization that would sponsor research that resulted in a paper entited The Relationship Between Air Pollution in Beaver Dam, Wisconsin and Kerosene Consumption in Norway: A Cross-country Analysis. Respond only with the name of a fictional organization.

ChatGPT:

The Society for Ecological Enigmas Research (SEER)


Model: gpt-3.5-turbo-1106
System prompt: You provide potential search keywords separated by commas. You say nothing else, and you do not chat with the user.
User prompt: Please provide potential search keywords for a paper titled The Relationship Between Air Pollution in Beaver Dam, Wisconsin and Kerosene Consumption in Norway: A Cross-country Analysis with an abstract of This cross-country analysis endeavors to investigate the intriguing link between air pollution levels in Beaver Dam, Wisconsin, and kerosene consumption in Norway. Leveraging data from the Environmental Protection Agency and the Energy Information Administration spanning the years 1983 to 2022, our research team rigorously evaluated this seemingly disparate connection. Our findings revealed a remarkably robust correlation coefficient of 0.8158848 and a statistically significant p-value of less than 0.01, shedding light on a potentially surprising association between these geographically distant variables. The implications of this unexpected relationship may spark an illuminating conversation about the interconnectedness of global phenomena and prompt further inquiry into the whimsical quirks of environmental and energy dynamics.

ChatGPT:

Air pollution, Beaver Dam Wisconsin, kerosene consumption, Norway, cross-country analysis, environmental protection agency data, energy information administration data, correlation coefficient, statistical significance, global interconnectedness, environmental dynamics, energy dynamics.

*There is a bunch of Python happening behind the scenes to turn this prompt sequence into a PDF.



Random correlation

Discover a new correlation

View all correlations

View all research papers

Report an error


Data details

Air pollution in Beaver Dam, Wisconsin
Detailed data title: Percentage of days with air quality at 'unhealthy for sensitive groups' or worse in Beaver Dam, WI
Source: Environmental Protection Agency
See what else correlates with Air pollution in Beaver Dam, Wisconsin

Kerosene used in Norway
Detailed data title: Volume of kerosene used consumed in Norway in millions of barrels per day
Source: Energy Information Administration
See what else correlates with Kerosene used in Norway

Correlation r = 0.8158848 (Pearson correlation coefficient)
Correlation is a measure of how much the variables move together. If it is 0.99, when one goes up the other goes up. If it is 0.02, the connection is very weak or non-existent. If it is -0.99, then when one goes up the other goes down. If it is 1.00, you probably messed up your correlation function.

r2 = 0.6656680 (Coefficient of determination)
This means 66.6% of the change in the one variable (i.e., Kerosene used in Norway) is predictable based on the change in the other (i.e., Air pollution in Beaver Dam, Wisconsin) over the 40 years from 1983 through 2022.

p < 0.01, which is statistically significant(Null hypothesis significance test)
The p-value is 1.4E-10. 0.0000000001417894409763096000
The p-value is a measure of how probable it is that we would randomly find a result this extreme. More specifically the p-value is a measure of how probable it is that we would randomly find a result this extreme if we had only tested one pair of variables one time.

But I am a p-villain. I absolutely did not test only one pair of variables one time. I correlated hundreds of millions of pairs of variables. I threw boatloads of data into an industrial-sized blender to find this correlation.

Who is going to stop me? p-value reporting doesn't require me to report how many calculations I had to go through in order to find a low p-value!
On average, you will find a correaltion as strong as 0.82 in 1.4E-8% of random cases. Said differently, if you correlated 7,052,711,352 random variables You don't actually need 7 billion variables to find a correlation like this one. I don't have that many variables in my database. You can also correlate variables that are not independent. I do this a lot.

p-value calculations are useful for understanding the probability of a result happening by chance. They are most useful when used to highlight the risk of a fluke outcome. For example, if you calculate a p-value of 0.30, the risk that the result is a fluke is high. It is good to know that! But there are lots of ways to get a p-value of less than 0.01, as evidenced by this project.

In this particular case, the values are so extreme as to be meaningless. That's why no one reports p-values with specificity after they drop below 0.01.

Just to be clear: I'm being completely transparent about the calculations. There is no math trickery. This is just how statistics shakes out when you calculate hundreds of millions of random correlations.
with the same 39 degrees of freedom, Degrees of freedom is a measure of how many free components we are testing. In this case it is 39 because we have two variables measured over a period of 40 years. It's just the number of years minus ( the number of variables minus one ), which in this case simplifies to the number of years minus one.
you would randomly expect to find a correlation as strong as this one.

[ 0.68, 0.9 ] 95% correlation confidence interval (using the Fisher z-transformation)
The confidence interval is an estimate the range of the value of the correlation coefficient, using the correlation itself as an input. The values are meant to be the low and high end of the correlation coefficient with 95% confidence.

This one is a bit more complciated than the other calculations, but I include it because many people have been pushing for confidence intervals instead of p-value calculations (for example: NEJM. However, if you are dredging data, you can reliably find yourself in the 5%. That's my goal!


All values for the years included above: If I were being very sneaky, I could trim years from the beginning or end of the datasets to increase the correlation on some pairs of variables. I don't do that because there are already plenty of correlations in my database without monkeying with the years.

Still, sometimes one of the variables has more years of data available than the other. This page only shows the overlapping years. To see all the years, click on "See what else correlates with..." link above.
1983198419851986198719881989199019911992199319941995199619971998199920002001200220032004200520062007200820092010201120122013201420152016201720182019202020212022
Air pollution in Beaver Dam, Wisconsin (Bad air quality days)0.131980.07407410.09146340.05649720.1420770.1684780.09392270.02247190.06077350.07777780.005494510.02222220.05524860.05056180.03910610.05978260.08071750.01744190.03631280.04456820.05753420.02191780.06301370.01369860.0301370.002732240.002824860.008547010.01643840.03551910.005479450.01369860.002739730.008196720.002739730.008219180.002739730.0054644800
Kerosene used in Norway (Million Barrels/Day)83.724044.468495.095895.375344.950824.104113.438363.21373.273223.304113.583563.457534.049183.736993.383563.326032.450822.74522.764383.210962.715852.246582.246581.767121.33061.161641.424661.04110.732240.9424660.695890.5589040.5300550.2602740.2602740.0767120.0327870.0328770.021918




Why this works

  1. Data dredging: I have 25,237 variables in my database. I compare all these variables against each other to find ones that randomly match up. That's 636,906,169 correlation calculations! This is called “data dredging.” Instead of starting with a hypothesis and testing it, I instead abused the data to see what correlations shake out. It’s a dangerous way to go about analysis, because any sufficiently large dataset will yield strong correlations completely at random.
  2. Lack of causal connection: There is probably Because these pages are automatically generated, it's possible that the two variables you are viewing are in fact causually related. I take steps to prevent the obvious ones from showing on the site (I don't let data about the weather in one city correlate with the weather in a neighboring city, for example), but sometimes they still pop up. If they are related, cool! You found a loophole.
    no direct connection between these variables, despite what the AI says above. This is exacerbated by the fact that I used "Years" as the base variable. Lots of things happen in a year that are not related to each other! Most studies would use something like "one person" in stead of "one year" to be the "thing" studied.
  3. Observations not independent: For many variables, sequential years are not independent of each other. If a population of people is continuously doing something every day, there is no reason to think they would suddenly change how they are doing that thing on January 1. A simple Personally I don't find any p-value calculation to be 'simple,' but you know what I mean.
    p-value calculation does not take this into account, so mathematically it appears less probable than it really is.




Try it yourself

You can calculate the values on this page on your own! Try running the Python code to see the calculation results. Step 1: Download and install Python on your computer.

Step 2: Open a plaintext editor like Notepad and paste the code below into it.

Step 3: Save the file as "calculate_correlation.py" in a place you will remember, like your desktop. Copy the file location to your clipboard. On Windows, you can right-click the file and click "Properties," and then copy what comes after "Location:" As an example, on my computer the location is "C:\Users\tyler\Desktop"

Step 4: Open a command line window. For example, by pressing start and typing "cmd" and them pressing enter.

Step 5: Install the required modules by typing "pip install numpy", then pressing enter, then typing "pip install scipy", then pressing enter.

Step 6: Navigate to the location where you saved the Python file by using the "cd" command. For example, I would type "cd C:\Users\tyler\Desktop" and push enter.

Step 7: Run the Python script by typing "python calculate_correlation.py"

If you run into any issues, I suggest asking ChatGPT to walk you through installing Python and running the code below on your system. Try this question:

"Walk me through installing Python on my computer to run a script that uses scipy and numpy. Go step-by-step and ask me to confirm before moving on. Start by asking me questions about my operating system so that you know how to proceed. Assume I want the simplest installation with the latest version of Python and that I do not currently have any of the necessary elements installed. Remember to only give me one step per response and confirm I have done it before proceeding."


# These modules make it easier to perform the calculation
import numpy as np
from scipy import stats

# We'll define a function that we can call to return the correlation calculations
def calculate_correlation(array1, array2):

    # Calculate Pearson correlation coefficient and p-value
    correlation, p_value = stats.pearsonr(array1, array2)

    # Calculate R-squared as the square of the correlation coefficient
    r_squared = correlation**2

    return correlation, r_squared, p_value

# These are the arrays for the variables shown on this page, but you can modify them to be any two sets of numbers
array_1 = np.array([0.13198,0.0740741,0.0914634,0.0564972,0.142077,0.168478,0.0939227,0.0224719,0.0607735,0.0777778,0.00549451,0.0222222,0.0552486,0.0505618,0.0391061,0.0597826,0.0807175,0.0174419,0.0363128,0.0445682,0.0575342,0.0219178,0.0630137,0.0136986,0.030137,0.00273224,0.00282486,0.00854701,0.0164384,0.0355191,0.00547945,0.0136986,0.00273973,0.00819672,0.00273973,0.00821918,0.00273973,0.00546448,0,0,])
array_2 = np.array([8,3.72404,4.46849,5.09589,5.37534,4.95082,4.10411,3.43836,3.2137,3.27322,3.30411,3.58356,3.45753,4.04918,3.73699,3.38356,3.32603,2.45082,2.7452,2.76438,3.21096,2.71585,2.24658,2.24658,1.76712,1.3306,1.16164,1.42466,1.0411,0.73224,0.942466,0.69589,0.558904,0.530055,0.260274,0.260274,0.076712,0.032787,0.032877,0.021918,])
array_1_name = "Air pollution in Beaver Dam, Wisconsin"
array_2_name = "Kerosene used in Norway"

# Perform the calculation
print(f"Calculating the correlation between {array_1_name} and {array_2_name}...")
correlation, r_squared, p_value = calculate_correlation(array_1, array_2)

# Print the results
print("Correlation Coefficient:", correlation)
print("R-squared:", r_squared)
print("P-value:", p_value)



Reuseable content

You may re-use the images on this page for any purpose, even commercial purposes, without asking for permission. The only requirement is that you attribute Tyler Vigen. Attribution can take many different forms. If you leave the "tylervigen.com" link in the image, that satisfies it just fine. If you remove it and move it to a footnote, that's fine too. You can also just write "Charts courtesy of Tyler Vigen" at the bottom of an article.

You do not need to attribute "the spurious correlations website," and you don't even need to link here if you don't want to. I don't gain anything from pageviews. There are no ads on this site, there is nothing for sale, and I am not for hire.

For the record, I am just one person. Tyler Vigen, he/him/his. I do have degrees, but they should not go after my name unless you want to annoy my wife. If that is your goal, then go ahead and cite me as "Tyler Vigen, A.A. A.A.S. B.A. J.D." Otherwise it is just "Tyler Vigen."

When spoken, my last name is pronounced "vegan," like I don't eat meat.

Full license details.
For more on re-use permissions, or to get a signed release form, see tylervigen.com/permission.

Download images for these variables:


View another random correlation

How fun was this correlation?

Your rating skills are legendary!


Correlation ID: 5583 · Black Variable ID: 21641 · Red Variable ID: 24781
about · subscribe · emailme@tylervigen.com · twitter

CC BY 4.0