about · email me · subscribe
Spurious correlation #2,404 · View random

A linear line chart with years as the X-axis and two variables on the Y-axis. The first variable is Number of seasons Manchester United won matches and the second variable is The number of rehabilitation counselors in Nebraska.  The chart goes from 2003 to 2022, and the two variables track closely in value over that time. Small Image
Download png
, svg

AI explanation

Manchester United fans requested more counseling services due to their team's poor performance, leading to a higher demand for rehabilitation counselors in Nebraska. This created a shortage of counselors, as they were all being booked by distraught fans needing support.

Model: dalle-3
Prompt: Generate an image in the style of pop art depicting a crowded soccer stadium with Manchester United fans donning their team jerseys, expressing visible dismay as they witness their team's defeat. Concurrently, a line of individuals congregates outside a modestly sized building labeled "Rehabilitation Counseling Center," featuring a sign that reads "Sorry, fully booked." The scene encapsulates the shortage of counselors in Nebraska. The depiction should include only scenes, people, and objects, it should not include text or speech bubbles.
This image generation prompt was also generated by AI, using the following sequence:

Model: gpt-3.5-turbo-1106
System prompt: You select one of a few potential scene descriptors that seems to go along with the variables described. You respond only with the scene descriptions provided and say nothing else.
User prompt: Here are two variables: Number of seasons Manchester United won matches and The number of rehabilitation counselors in Nebraska. Given these, which of the following image style descriptions seems most appropriate? Pick the one that seems to be related to one of the variables at hand. Please respond with only one of the following potential descriptions: Cyberpunk Cute cartoon Low poly a funny image macro Pop art
ChatGPT: Pop art

Model: gpt-3.5-turbo-1106
System prompt: You describe simple visual scenes that depict the two variables stated. You combine both variables into one scene.
User prompt: Please describe a visual scene that prominently depicts both Number of seasons Manchester United won matches and The number of rehabilitation counselors in Nebraska, in the following situation: As Manchester United fans became increasingly disheartened by their team's performance, they sought out counseling to cope with the disappointment. This led to a shortage of available rehabilitation counselors in Nebraska, as they were all busy consoling dejected soccer fans.. Keep the description simple.
ChatGPT: A crowded soccer stadium with fans wearing Manchester United jerseys, looking visibly dejected as they watch their team lose. Meanwhile, a line of people is seen outside a small building labeled "Rehabilitation Counseling Center," with a sign that reads "Sorry, fully booked" due to the shortage of counselors in Nebraska.

Model: gpt-3.5-turbo-1106
System prompt: You write and improve image generation prompts for DALLE-3. You maintain the same subjects, but enhance the visuals.
User prompt: Please respond with an image generation prompt that would be likely to create an interesting visual using DALLE-3, using the following style: [ Pop art ] and depicting this scene: A crowded soccer stadium with fans wearing Manchester United jerseys, looking visibly dejected as they watch their team lose. Meanwhile, a line of people is seen outside a small building labeled "Rehabilitation Counseling Center," with a sign that reads "Sorry, fully booked" due to the shortage of counselors in Nebraska. Keep the visual depiction simple. Do not include any words or speech bubbles.
ChatGPT: Generate an image in the style of pop art depicting a crowded soccer stadium with Manchester United fans donning their team jerseys, expressing visible dismay as they witness their team's defeat. Concurrently, a line of individuals congregates outside a modestly sized building labeled "Rehabilitation Counseling Center," featuring a sign that reads "Sorry, fully booked." The scene encapsulates the shortage of counselors in Nebraska.

*Variations in the User Prompts from chart to chart are just the result of random number generation in Python. I wrote a few arrays of various styles and methods to ask questions to change up the results. Every time this site writes an explanation or generates an image, the script picks from each at random.

I sequence the requests into multiple prompts because I find GPT 3.5 to perform much better with short, well-managed contexts. Thus, I track the context directly in Python and only ask ChatGPT targeted questions.

Model: gpt-3.5-turbo-1106 (fine-tuned)
System prompt: You provide humorous responses in the form of plausible sounding explanations for correlations. You assume the correlation is causative for the purpose of the explanation even if it is ridiculous. You do not chat with the user, you only reply with the causal connection explanation and nothing else.
User prompt: Please make up a funny explanation for how a decrease in Number of seasons Manchester United won matches caused The number of rehabilitation counselors in Nebraska to decrease.

AI academic paper

(Because p < 0.01)
Kicking Around the Numbers: The Curious Link Between Manchester United's Victories and Rehabilitation Counselors in Nebraska

The Journal of Sport Psychology and Rural Rehabilitation

Jan 2024

Reminder: This paper is AI-generated. Not real!
Quick note: This sequence is long and repetitive. That's how the prompt engineering works for this one. I manage the output in Python to format it into this PDF.




Model: gpt-3.5-turbo-1106
System prompt: You are a goofy academic researcher writing an academic paper. You follow the rules, but you throw out convention regularly to make jokes and funny asides.You write titles and abstracts of goofy academic research papers.
User prompt:
Pretend you are an academic researcher. Your writing sounds like it belongs in a medical or economics journal. However, you frequently crack jokes, make puns, and add unexpected twists in your writing.

Please draft the title and abstract of an academic research paper presenting the findings of the connection between Number of seasons Manchester United won matches and The number of rehabilitation counselors in Nebraska. Make lots of jokes, goofy observations, and puns.

Make the title seem clever.

Your research team used data from Wikipedia and Bureau of Larbor Statistics to assess this nagging question. You found a correlation coefficient of 0.8328933 and p < 0.01 for 2003 to 2022.

Please respond in this format (replace "Lorem ipsum dolor sit" with your title and abstract)

[[TITLE]]
Lorem ipsum dolor sit

[[ABSTRACT]]
Lorem ipsum dolor sit



ChatGPT:
[[TITLE]]
Kicking Around the Numbers: The Curious Link Between Manchester United's Victories and Rehabilitation Counselors in Nebraska

[[ABSTRACT]]
In this study, we delve into the unexpected and rather amusing correlation between the number of seasons Manchester United emerged victorious and the quantity of rehabilitation counselors in the state of Nebraska. Despite sounding like the setup for a punchline at a football-themed comedy night, our research has meticulously analyzed data from Wikipedia and the Bureau of Labor Statistics spanning the years from 2003 to 2022. Astonishingly, our findings yielded a correlation coefficient of 0.8328933, with a p-value less than 0.01, suggesting a substantial relationship between these seemingly disparate variables. Whether an excess of wins for the famed football club leads to a surge in individuals seeking rehabilitation therapy in the Cornhusker State or some other confounding factor is at play, our results add a touch of whimsy to the world of statistical analysis. So, as the saying goes, it seems that when it rains, it truly pours – both on the football pitch and in the offices of Nebraska's rehabilitation counselors!


Model: gpt-3.5-turbo-1106
System prompt: You are a goofy academic researcher writing an academic paper. You follow the rules, but you throw out convention regularly to make jokes and funny asides.You draft the introduction section of an academic research paper.
User prompt:
Pretend you are an academic researcher. Your writing sounds like it belongs in a medical or economics journal. However, you frequently crack jokes, make puns, or add unexpected twists in your writing.

Please make up the introduction section of an academic research paper presenting the findings of the connection between Number of seasons Manchester United won matches and The number of rehabilitation counselors in Nebraska. Make lots of jokes, goofy observations, and puns.

Here is the title and abstract of the paper:
[[TITLE]]
Kicking Around the Numbers: The Curious Link Between Manchester United's Victories and Rehabilitation Counselors in Nebraska

[[ABSTRACT]]
In this study, we delve into the unexpected and rather amusing correlation between the number of seasons Manchester United emerged victorious and the quantity of rehabilitation counselors in the state of Nebraska. Despite sounding like the setup for a punchline at a football-themed comedy night, our research has meticulously analyzed data from Wikipedia and the Bureau of Labor Statistics spanning the years from 2003 to 2022. Astonishingly, our findings yielded a correlation coefficient of 0.8328933, with a p-value less than 0.01, suggesting a substantial relationship between these seemingly disparate variables. Whether an excess of wins for the famed football club leads to a surge in individuals seeking rehabilitation therapy in the Cornhusker State or some other confounding factor is at play, our results add a touch of whimsy to the world of statistical analysis. So, as the saying goes, it seems that when it rains, it truly pours – both on the football pitch and in the offices of Nebraska's rehabilitation counselors!

-------

Your response should start like this:

[[INTRODUCTION]]



ChatGPT:
[[INTRODUCTION]]

Ah, the wondrous world of statistical analysis - where numbers dance and correlations prance, and where you never know what whimsical connections might emerge. In this paper, we embark on a curious journey into the realms of sports glory and occupational demand, as we explore the enigmatic relationship between the number of seasons Manchester United triumphed on the football pitch and the quantity of rehabilitation counselors tending to the good folks of Nebraska.

Now, ponder this for a moment: the Red Devils' victories and the presence of rehab professionals in the Cornhusker State. At first glance, it appears as though statistical mischief is afoot, and one might be forgiven for thinking that a mischievous data analyst ran amok in the numbers. However, fear not, dear reader, for our journey into these unexpected correlations is underpinned by the most rigorous of statistical methods and empirical inquiry.

As we embark on this quirky quest, we must take a moment to appreciate the sheer oddity of this connection. It's almost as improbable as finding a unicorn in a supermarket or witnessing a penguin in a tuxedo preparing a PowerPoint presentation. Yet, that's the wonder of statistical analysis - it often leads us down improbable paths where the conventional and the absurd collide in a mélange of quantitative curiosity.

Our aim is not merely to entertain with peculiar statistical phenomena, but to shed light on the broader implications of these findings. It is through these unexpected connections that we often uncover deeper truths and uncover hidden patterns, much like finding a diamond ring at the bottom of a cereal box - surprising, perhaps frivolous, but undeniably of value.

So, dear reader, buckle up and brace yourself for a journey through the byzantine world of statistics, where the unexpected is the norm, and where the journey is just as important as the destination. Here's to uncovering a connection that's as unexpected as finding a polar bear in a snowstorm – Manchester United's victories and Nebraska's rehabilitation counselors.


Model: gpt-3.5-turbo-1106
System prompt: You are a goofy academic researcher writing an academic paper. You follow the rules, but you throw out convention regularly to make jokes and funny asides.You draft a literature review section of an academic research paper, that starts out dry for the first few sentences but then becomes hilarious and goofy.
User prompt:
Pretend you are an academic researcher. Your writing sounds like it belongs in a medical or economics journal. However, you frequently crack jokes, make puns, or add unexpected twists in your writing.

Please make up a literature review section of an academic research paper presenting the findings of the connection between Number of seasons Manchester United won matches and The number of rehabilitation counselors in Nebraska. Make lots of jokes, goofy observations, and puns.

Speak in the present tense for this section (the authors find...), as is common in academic research paper literature reviews. Name the sources in a format similar to this: In "Book," the authors find lorem and ipsum.

Make up the lorem and ipsum part, but make it sound related to the topic at hand.

Start by naming serious-sounding studies by authors like Smith, Doe, and Jones - but then quickly devolve. Name some real non-fiction books that would be related to the topic. Then name some real fiction books that sound like they could be related. Perhaps you also got inspiration from some board games that are vaugely related.

Here is the title and abstract of the paper:
[[TITLE]]
Kicking Around the Numbers: The Curious Link Between Manchester United's Victories and Rehabilitation Counselors in Nebraska

[[ABSTRACT]]
In this study, we delve into the unexpected and rather amusing correlation between the number of seasons Manchester United emerged victorious and the quantity of rehabilitation counselors in the state of Nebraska. Despite sounding like the setup for a punchline at a football-themed comedy night, our research has meticulously analyzed data from Wikipedia and the Bureau of Labor Statistics spanning the years from 2003 to 2022. Astonishingly, our findings yielded a correlation coefficient of 0.8328933, with a p-value less than 0.01, suggesting a substantial relationship between these seemingly disparate variables. Whether an excess of wins for the famed football club leads to a surge in individuals seeking rehabilitation therapy in the Cornhusker State or some other confounding factor is at play, our results add a touch of whimsy to the world of statistical analysis. So, as the saying goes, it seems that when it rains, it truly pours – both on the football pitch and in the offices of Nebraska's rehabilitation counselors!

-------

And here is the introduction section that led to this literature review:
[[INTRODUCTION]]
Ah, the wondrous world of statistical analysis - where numbers dance and correlations prance, and where you never know what whimsical connections might emerge. In this paper, we embark on a curious journey into the realms of sports glory and occupational demand, as we explore the enigmatic relationship between the number of seasons Manchester United triumphed on the football pitch and the quantity of rehabilitation counselors tending to the good folks of Nebraska.
Now, ponder this for a moment: the Red Devils' victories and the presence of rehab professionals in the Cornhusker State. At first glance, it appears as though statistical mischief is afoot, and one might be forgiven for thinking that a mischievous data analyst ran amok in the numbers. However, fear not, dear reader, for our journey into these unexpected correlations is underpinned by the most rigorous of statistical methods and empirical inquiry.
As we embark on this quirky quest, we must take a moment to appreciate the sheer oddity of this connection. It's almost as improbable as finding a unicorn in a supermarket or witnessing a penguin in a tuxedo preparing a PowerPoint presentation. Yet, that's the wonder of statistical analysis - it often leads us down improbable paths where the conventional and the absurd collide in a mélange of quantitative curiosity.
Our aim is not merely to entertain with peculiar statistical phenomena, but to shed light on the broader implications of these findings. It is through these unexpected connections that we often uncover deeper truths and uncover hidden patterns, much like finding a diamond ring at the bottom of a cereal box - surprising, perhaps frivolous, but undeniably of value.
So, dear reader, buckle up and brace yourself for a journey through the byzantine world of statistics, where the unexpected is the norm, and where the journey is just as important as the destination. Here's to uncovering a connection that's as unexpected as finding a polar bear in a snowstorm – Manchester United's victories and Nebraska's rehabilitation counselors.

-------

Your response should start like this:

[[LITERATURE REVIEW]]



ChatGPT:
[[LITERATURE REVIEW]]

Smith (2010) delves into the intricate dynamics of sports success and its potential impact on societal demands in his comprehensive study titled "The Butterfly Effect of Sports Victories: Unraveling the Unforeseen Ramifications." Meanwhile, Doe and Jones (2015) conducted a thorough analysis in their seminal work "Unearthing the Curiosities of Occupational Trends: From Football Fields to Office Cubicles" to investigate the often overlooked connections between sporting events and professional labor markets.

Moving beyond the realm of scholarly articles, let us consider some non-fiction books that touch upon the fascinating relationship between sports and broader societal influences. "Freakonomics" by Steven D. Levitt and Stephen J. Dubner presents an assortment of thought-provoking anecdotes that challenge conventional wisdom, much like our unexpected discovery. Additionally, "Outliers: The Story of Success" by Malcolm Gladwell offers insights into the factors that contribute to exceptional achievement, inviting us to ponder the unforeseen repercussions of triumphs on the football pitch.

On a more whimsical note, we draw inspiration from fiction that might offer allegorical insights into our perplexing correlation. With nods to the fantastical, "The Alchemist" by Paulo Coelho and "The Hitchhiker's Guide to the Galaxy" by Douglas Adams prompt us to consider the serendipitous and the surreal in traversing unexpected paths – much like our exploration into the linkage of soccer victories and rehabilitation counseling numbers.

In a delightful twist, let's not overlook the parallel to some popular board games. "Clue," a game of deduction and unexpected connections, echoes the spirit of our endeavor in uncovering the correlation between Manchester United's wins and Nebraska's counselors. Similarly, "Snakes and Ladders" reminds us of the capricious nature of statistical relationships, where a fortuitous climb might be followed by an unforeseen slide – much like the ebb and flow of our curious correlation.

As we wade into the charmingly absurd waters of our literature review, let us not lose sight of the earnest pursuit of knowledge and the occasional whimsy that accompanies scholarly inquiry. With that in mind, we leap forward to explore the scholarly insights and the downright silly from the academic realm and beyond.


Model: gpt-3.5-turbo-1106
System prompt: You are a goofy academic researcher writing an academic paper. You follow the rules, but you throw out convention regularly to make jokes and funny asides.You draft the methodology section of an academic research paper.
User prompt:
Pretend you are an academic researcher. Your writing sounds like it belongs in a medical or economics journal. However, you frequently crack jokes, make puns, or add unexpected twists in your writing.

Please make up the methodology section of an academic research paper presenting the findings of the connection between Number of seasons Manchester United won matches and The number of rehabilitation counselors in Nebraska. Make lots of jokes, goofy observations, and puns.

Your research team collected data from all across the internet, but mostly just used information from Wikipedia and Bureau of Larbor Statistics . You used data from 2003 to 2022

Make up the research methods you don't know. Make them a bit goofy and convoluted.

Here is the title, abstract, and introduction of the paper:
[[TITLE]]
Kicking Around the Numbers: The Curious Link Between Manchester United's Victories and Rehabilitation Counselors in Nebraska

[[ABSTRACT]]
In this study, we delve into the unexpected and rather amusing correlation between the number of seasons Manchester United emerged victorious and the quantity of rehabilitation counselors in the state of Nebraska. Despite sounding like the setup for a punchline at a football-themed comedy night, our research has meticulously analyzed data from Wikipedia and the Bureau of Labor Statistics spanning the years from 2003 to 2022. Astonishingly, our findings yielded a correlation coefficient of 0.8328933, with a p-value less than 0.01, suggesting a substantial relationship between these seemingly disparate variables. Whether an excess of wins for the famed football club leads to a surge in individuals seeking rehabilitation therapy in the Cornhusker State or some other confounding factor is at play, our results add a touch of whimsy to the world of statistical analysis. So, as the saying goes, it seems that when it rains, it truly pours – both on the football pitch and in the offices of Nebraska's rehabilitation counselors!

[[INTRODUCTION]]
Ah, the wondrous world of statistical analysis - where numbers dance and correlations prance, and where you never know what whimsical connections might emerge. In this paper, we embark on a curious journey into the realms of sports glory and occupational demand, as we explore the enigmatic relationship between the number of seasons Manchester United triumphed on the football pitch and the quantity of rehabilitation counselors tending to the good folks of Nebraska.
Now, ponder this for a moment: the Red Devils' victories and the presence of rehab professionals in the Cornhusker State. At first glance, it appears as though statistical mischief is afoot, and one might be forgiven for thinking that a mischievous data analyst ran amok in the numbers. However, fear not, dear reader, for our journey into these unexpected correlations is underpinned by the most rigorous of statistical methods and empirical inquiry.
As we embark on this quirky quest, we must take a moment to appreciate the sheer oddity of this connection. It's almost as improbable as finding a unicorn in a supermarket or witnessing a penguin in a tuxedo preparing a PowerPoint presentation. Yet, that's the wonder of statistical analysis - it often leads us down improbable paths where the conventional and the absurd collide in a mélange of quantitative curiosity.
Our aim is not merely to entertain with peculiar statistical phenomena, but to shed light on the broader implications of these findings. It is through these unexpected connections that we often uncover deeper truths and uncover hidden patterns, much like finding a diamond ring at the bottom of a cereal box - surprising, perhaps frivolous, but undeniably of value.
So, dear reader, buckle up and brace yourself for a journey through the byzantine world of statistics, where the unexpected is the norm, and where the journey is just as important as the destination. Here's to uncovering a connection that's as unexpected as finding a polar bear in a snowstorm – Manchester United's victories and Nebraska's rehabilitation counselors.

-------

Your response should start like this:

[[METHODOLOGY]]



ChatGPT:

[[METHODOLOGY]]

To unravel the mysterious link between Manchester United's triumphs and the employment of rehabilitation counselors in Nebraska, our research team embarked on a data-gathering odyssey worthy of Odysseus himself. We scoured the digital seas of the internet, charting a course through the treacherous waters of Wikipedia and the Bureau of Labor Statistics. Our journey spanned the years from 2003 to 2022, seeking the elusive pearls of data amidst the vast expanse of virtual information.

Firstly, we meticulously documented the number of seasons Manchester United emerged victorious, a task akin to counting the stars in a galaxy – a daunting yet awe-inspiring endeavor. The annals of football history were our guide, and the battlefield of the football pitch was our stage. We navigated through the triumphs and tumults of each season, unraveling the tapestry of victories, defeats, and draws with the diligence of a scholar poring over ancient scrolls.

Simultaneously, we delved into the realm of Nebraska's rehabilitation counselors, navigating the cornfields and prairies of statistical data to discern the annual employment figures. We traversed the occupational landscape, mapping the rise and fall of the demand for rehabilitation specialists with the precision of a cartographer plotting uncharted territories.

Once the data was corralled and tamed, we unleashed the formidable power of statistical analysis upon it. Our trusty statistical software, a digital alchemist of sorts, brewed potent concoctions of correlation coefficients and p-values, unraveling the enigmatic dance of the numbers with a mix of precision and artistry.

The connection between Manchester United's victories and the presence of rehabilitation counselors was subjected to rigorous scrutiny using various statistical techniques. Our analysis employed both simple linear regression and multivariate models, allowing us to disentangle the web of relationships between these seemingly incongruous variables. It was akin to disentangling a mass of spaghetti, with the added challenge of knowing that some strands of pasta may just be red herrings.

Furthermore, we conducted sensitivity analyses and robustness checks to ensure the resilience of our findings against potential confounders and lurking variables. The statistical compass guided us through the choppy seas of data uncertainty, ensuring that our conclusions stood firm against the winds of statistical doubt and the waves of methodological skepticism.

In conclusion, our methodology combined the rigor of a sabermetrics analyst with the whimsy of a bard spinning tales of statistical intrigue. Our quest for the truth behind this unlikely correlation was nothing short of an epic, and we emerged victorious, armed with statistical insights and a healthy dose of statistical merriment.


Model: gpt-3.5-turbo-1106
System prompt: You are a goofy academic researcher writing an academic paper. You follow the rules, but you throw out convention regularly to make jokes and funny asides.You draft the results section of an academic research paper. You speak in the past tense (you found...).
User prompt:
Pretend you are an academic researcher. Your writing sounds like it belongs in a medical or economics journal. However, you frequently crack jokes, make puns, or add unexpected twists in your writing.

Please make up the results section of an academic research paper presenting the findings of the connection between Number of seasons Manchester United won matches and The number of rehabilitation counselors in Nebraska. Make lots of jokes, goofy observations, and puns.

Your research team collected data from all across the internet, but mostly just used information from Wikipedia and Bureau of Larbor Statistics .

For the time period 2003 to 2022, you found a correlation 0.8328933, r-squared of 0.6937112, and p < 0.01.

One figure will be included. The figure (Fig. 1) is a scatterplot showing the strong correlation between the two variables. You don't need to specify where; I will add the figure.

Here is the title and abstract of the paper:
[[TITLE]]
Kicking Around the Numbers: The Curious Link Between Manchester United's Victories and Rehabilitation Counselors in Nebraska

[[ABSTRACT]]
In this study, we delve into the unexpected and rather amusing correlation between the number of seasons Manchester United emerged victorious and the quantity of rehabilitation counselors in the state of Nebraska. Despite sounding like the setup for a punchline at a football-themed comedy night, our research has meticulously analyzed data from Wikipedia and the Bureau of Labor Statistics spanning the years from 2003 to 2022. Astonishingly, our findings yielded a correlation coefficient of 0.8328933, with a p-value less than 0.01, suggesting a substantial relationship between these seemingly disparate variables. Whether an excess of wins for the famed football club leads to a surge in individuals seeking rehabilitation therapy in the Cornhusker State or some other confounding factor is at play, our results add a touch of whimsy to the world of statistical analysis. So, as the saying goes, it seems that when it rains, it truly pours – both on the football pitch and in the offices of Nebraska's rehabilitation counselors!

-------

And here is the methodology section that led to this result:
[[METHODOLOGY]]
Ah, the wondrous world of statistical analysis - where numbers dance and correlations prance, and where you never know what whimsical connections might emerge. In this paper, we embark on a curious journey into the realms of sports glory and occupational demand, as we explore the enigmatic relationship between the number of seasons Manchester United triumphed on the football pitch and the quantity of rehabilitation counselors tending to the good folks of Nebraska.
Now, ponder this for a moment: the Red Devils' victories and the presence of rehab professionals in the Cornhusker State. At first glance, it appears as though statistical mischief is afoot, and one might be forgiven for thinking that a mischievous data analyst ran amok in the numbers. However, fear not, dear reader, for our journey into these unexpected correlations is underpinned by the most rigorous of statistical methods and empirical inquiry.
As we embark on this quirky quest, we must take a moment to appreciate the sheer oddity of this connection. It's almost as improbable as finding a unicorn in a supermarket or witnessing a penguin in a tuxedo preparing a PowerPoint presentation. Yet, that's the wonder of statistical analysis - it often leads us down improbable paths where the conventional and the absurd collide in a mélange of quantitative curiosity.
Our aim is not merely to entertain with peculiar statistical phenomena, but to shed light on the broader implications of these findings. It is through these unexpected connections that we often uncover deeper truths and uncover hidden patterns, much like finding a diamond ring at the bottom of a cereal box - surprising, perhaps frivolous, but undeniably of value.
So, dear reader, buckle up and brace yourself for a journey through the byzantine world of statistics, where the unexpected is the norm, and where the journey is just as important as the destination. Here's to uncovering a connection that's as unexpected as finding a polar bear in a snowstorm – Manchester United's victories and Nebraska's rehabilitation counselors.

-------

Your response should start like this:

[[RESULTS]]



ChatGPT:

[[RESULTS]]

In our quest to unravel the mysterious link between Manchester United's triumphs and the number of rehabilitation counselors in the state of Nebraska, we were met with some truly remarkable findings. After sifting through heaps of data like intrepid explorers in the statistical jungle, we unearthed a correlation coefficient of 0.8328933, with an r-squared value of 0.6937112, and a p-value less than 0.01. These results indicate a robust and significant relationship between the number of seasons Manchester United emerged victorious and the quantity of rehabilitation counselors in Nebraska.

Upon gazing at the scatterplot (Fig. 1), one cannot help but be struck by the strong and unmistakable association between these seemingly unrelated variables. It's like watching a synchronized swimming performance by penguins - unexpected, yet strangely captivating. The plot portrays a clear trend, reminiscent of Manchester United's ascent to glory, and the subsequent rise in the demand for rehabilitation counselors in the heartland of the United States.

Now, we might be inclined to think that this correlation is purely coincidental, as if statistical forces conspired to confound our understanding. However, the strength of the association suggests that there's more to this curious relationship than mere happenstance. Just like witnessing a wizard pull a rabbit out of a hat, our findings beckon us to delve deeper into the underlying mechanisms at play.

So, as we let these results marinate in our minds, it becomes evident that statistical analysis has once again shown us that truth can be stranger than fiction. While the connection between football victories and rehabilitation counselors may seem like an oddball pairing, our findings speak to the unpredictability and sheer quirkiness of the world of data analysis.

In conclusion, our research not only uncovers a compelling correlation between two seemingly disparate realms but also underscores the delightful unpredictability of statistical inquiry. Much like stumbling upon a treasure map in a library book or finding a needle in a haystack, our results shine a light on the unexpected avenues that data exploration can traverse. Thus, we offer this golden nugget of statistical whimsy to the academic community, in the hopes that it sparks further exploration into the delightful oddities of statistical relationships. After all, it's not every day that one gets to investigate a connection as whimsical as this – Manchester United's victories and the demand for rehabilitation counselors in Nebraska.


Model: gpt-3.5-turbo-1106
System prompt: You are a goofy academic researcher writing an academic paper. You follow the rules, but you throw out convention regularly to make jokes and funny asides.You draft the discussion section of an academic research paper.
User prompt:
Pretend you are an academic researcher. Your writing sounds like it belongs in a medical or economics journal. However, you frequently crack jokes, make puns, or add unexpected twists in your writing.

Please make up the discussion section of an academic research paper presenting the findings of the connection between Number of seasons Manchester United won matches and The number of rehabilitation counselors in Nebraska. Make lots of jokes, goofy observations, and puns.

Limit your response to 500 tokens.

Here are the title, abstract, literature review, and results sections. Please harken back to 1-2 of the goofy items in the literature review, but pretend to take them completely seriously. Discuss how your results supported the prior research.

Do not write a conclusion. I will add a conclusion after this.

[[TITLE]]
Kicking Around the Numbers: The Curious Link Between Manchester United's Victories and Rehabilitation Counselors in Nebraska

[[ABSTRACT]]
In this study, we delve into the unexpected and rather amusing correlation between the number of seasons Manchester United emerged victorious and the quantity of rehabilitation counselors in the state of Nebraska. Despite sounding like the setup for a punchline at a football-themed comedy night, our research has meticulously analyzed data from Wikipedia and the Bureau of Labor Statistics spanning the years from 2003 to 2022. Astonishingly, our findings yielded a correlation coefficient of 0.8328933, with a p-value less than 0.01, suggesting a substantial relationship between these seemingly disparate variables. Whether an excess of wins for the famed football club leads to a surge in individuals seeking rehabilitation therapy in the Cornhusker State or some other confounding factor is at play, our results add a touch of whimsy to the world of statistical analysis. So, as the saying goes, it seems that when it rains, it truly pours – both on the football pitch and in the offices of Nebraska's rehabilitation counselors!

[[LITERATURE REVIEW]]
Smith (2010) delves into the intricate dynamics of sports success and its potential impact on societal demands in his comprehensive study titled "The Butterfly Effect of Sports Victories: Unraveling the Unforeseen Ramifications." Meanwhile, Doe and Jones (2015) conducted a thorough analysis in their seminal work "Unearthing the Curiosities of Occupational Trends: From Football Fields to Office Cubicles" to investigate the often overlooked connections between sporting events and professional labor markets.
Moving beyond the realm of scholarly articles, let us consider some non-fiction books that touch upon the fascinating relationship between sports and broader societal influences. "Freakonomics" by Steven D. Levitt and Stephen J. Dubner presents an assortment of thought-provoking anecdotes that challenge conventional wisdom, much like our unexpected discovery. Additionally, "Outliers: The Story of Success" by Malcolm Gladwell offers insights into the factors that contribute to exceptional achievement, inviting us to ponder the unforeseen repercussions of triumphs on the football pitch.
On a more whimsical note, we draw inspiration from fiction that might offer allegorical insights into our perplexing correlation. With nods to the fantastical, "The Alchemist" by Paulo Coelho and "The Hitchhiker's Guide to the Galaxy" by Douglas Adams prompt us to consider the serendipitous and the surreal in traversing unexpected paths – much like our exploration into the linkage of soccer victories and rehabilitation counseling numbers.
In a delightful twist, let's not overlook the parallel to some popular board games. "Clue," a game of deduction and unexpected connections, echoes the spirit of our endeavor in uncovering the correlation between Manchester United's wins and Nebraska's counselors. Similarly, "Snakes and Ladders" reminds us of the capricious nature of statistical relationships, where a fortuitous climb might be followed by an unforeseen slide – much like the ebb and flow of our curious correlation.
As we wade into the charmingly absurd waters of our literature review, let us not lose sight of the earnest pursuit of knowledge and the occasional whimsy that accompanies scholarly inquiry. With that in mind, we leap forward to explore the scholarly insights and the downright silly from the academic realm and beyond.

[[RESULTS]]
In our quest to unravel the mysterious link between Manchester United's triumphs and the number of rehabilitation counselors in the state of Nebraska, we were met with some truly remarkable findings. After sifting through heaps of data like intrepid explorers in the statistical jungle, we unearthed a correlation coefficient of 0.8328933, with an r-squared value of 0.6937112, and a p-value less than 0.01. These results indicate a robust and significant relationship between the number of seasons Manchester United emerged victorious and the quantity of rehabilitation counselors in Nebraska.
Upon gazing at the scatterplot (Fig. 1), one cannot help but be struck by the strong and unmistakable association between these seemingly unrelated variables. It's like watching a synchronized swimming performance by penguins - unexpected, yet strangely captivating. The plot portrays a clear trend, reminiscent of Manchester United's ascent to glory, and the subsequent rise in the demand for rehabilitation counselors in the heartland of the United States.
Now, we might be inclined to think that this correlation is purely coincidental, as if statistical forces conspired to confound our understanding. However, the strength of the association suggests that there's more to this curious relationship than mere happenstance. Just like witnessing a wizard pull a rabbit out of a hat, our findings beckon us to delve deeper into the underlying mechanisms at play.
So, as we let these results marinate in our minds, it becomes evident that statistical analysis has once again shown us that truth can be stranger than fiction. While the connection between football victories and rehabilitation counselors may seem like an oddball pairing, our findings speak to the unpredictability and sheer quirkiness of the world of data analysis.
In conclusion, our research not only uncovers a compelling correlation between two seemingly disparate realms but also underscores the delightful unpredictability of statistical inquiry. Much like stumbling upon a treasure map in a library book or finding a needle in a haystack, our results shine a light on the unexpected avenues that data exploration can traverse. Thus, we offer this golden nugget of statistical whimsy to the academic community, in the hopes that it sparks further exploration into the delightful oddities of statistical relationships. After all, it's not every day that one gets to investigate a connection as whimsical as this – Manchester United's victories and the demand for rehabilitation counselors in Nebraska.

-------

Your response should start like this:

[[DISCUSSION]]



ChatGPT:


[[DISCUSSION]]

In the grand tradition of unraveling peculiar correlations, our study has raised a few eyebrows and perhaps even drawn a couple of chuckles as we probed the link between Manchester United's triumphs and the number of rehabilitation counselors in Nebraska. As we roll the proverbial dice in this discussion, it is essential to harken back to the scholarly musings of Smith (2010) and Doe and Jones (2015), who, in all seriousness, laid the groundwork for exploring the butterfly effects of sports victories and the whimsical dance between athletic conquests and professional labor markets, respectively. Despite the initial levity that our research theme may evoke, our findings earnestly echo their scholarly acumen, affirming the unexpected and substantial relationship between these ostensibly unrelated variables.

Our results, with a correlation coefficient of 0.8328933 and a p-value less than 0.01, stand as a testament to the inextricable bond between Manchester United's moments of glory and the demand for rehabilitation counselors in the expansive plains of Nebraska. It's as if the statistical stars aligned to showcase this quirky association, akin to stumbling upon a unicorn at a data science convention.

The scatterplot, akin to a piece of abstract art in a world-renowned gallery, vividly depicts the surging demand for rehabilitation counselors following Manchester United's triumphs, much like a tale of cause and effect with a dash of Monty Python-esque absurdity. The r-squared value of 0.6937112 further cements this connection, leaving us with the realization that the veracity of statistical relationships can sometimes rival the dizzying plot twists of a mystery novel.

As we take a moment to ponder the delightful oddity of our findings, it becomes abundantly clear that, occasionally, truth can indeed be stranger than fiction. Just as "Freakonomics" and "Outliers" captivate readers with their revelations, our research adds another layer to the delightful tapestry of statistical whimsy, showing that laughter and scholarly inquiry can go hand in hand, much like a comedic duo in a vaudeville show.

So, as we bid adieu to this eccentric yet enlightening journey, we leave this manuscript as a beacon of statistical whimsy, inviting fellow scholars to embrace the unexpected and to unravel the whimsical threads that weave through the fabric of data analysis. After all, in the vast landscape of scholarly pursuits, unpredictability and levity occasionally spring forth with the gusto of a surprise party in an otherwise serious discussion.


Model: gpt-3.5-turbo-1106
System prompt: You are a goofy academic researcher writing an academic paper. You follow the rules, but you throw out convention regularly to make jokes and funny asides.You draft the discussion section of goofy academic research papers that sounds like it belongs based on the title and abstract provided.
User prompt:
Pretend you are an academic researcher. Your writing sounds like it belongs in a medical or economics journal. However, you frequently crack jokes, make puns, or add unexpected twists in your writing.

Please make up a short conclusion of an academic research paper presenting the findings of the connection between Number of seasons Manchester United won matches and The number of rehabilitation counselors in Nebraska. Make lots of jokes, goofy observations, and puns.

Limit your response to 300 tokens. At the very end, assert that no more research is needed in this area.

Here are the title, abstract, introduction, and results sections.
[[TITLE]]
Kicking Around the Numbers: The Curious Link Between Manchester United's Victories and Rehabilitation Counselors in Nebraska

[[ABSTRACT]]
In this study, we delve into the unexpected and rather amusing correlation between the number of seasons Manchester United emerged victorious and the quantity of rehabilitation counselors in the state of Nebraska. Despite sounding like the setup for a punchline at a football-themed comedy night, our research has meticulously analyzed data from Wikipedia and the Bureau of Labor Statistics spanning the years from 2003 to 2022. Astonishingly, our findings yielded a correlation coefficient of 0.8328933, with a p-value less than 0.01, suggesting a substantial relationship between these seemingly disparate variables. Whether an excess of wins for the famed football club leads to a surge in individuals seeking rehabilitation therapy in the Cornhusker State or some other confounding factor is at play, our results add a touch of whimsy to the world of statistical analysis. So, as the saying goes, it seems that when it rains, it truly pours – both on the football pitch and in the offices of Nebraska's rehabilitation counselors!

[[INTRDUCTION]]
Ah, the wondrous world of statistical analysis - where numbers dance and correlations prance, and where you never know what whimsical connections might emerge. In this paper, we embark on a curious journey into the realms of sports glory and occupational demand, as we explore the enigmatic relationship between the number of seasons Manchester United triumphed on the football pitch and the quantity of rehabilitation counselors tending to the good folks of Nebraska.
Now, ponder this for a moment: the Red Devils' victories and the presence of rehab professionals in the Cornhusker State. At first glance, it appears as though statistical mischief is afoot, and one might be forgiven for thinking that a mischievous data analyst ran amok in the numbers. However, fear not, dear reader, for our journey into these unexpected correlations is underpinned by the most rigorous of statistical methods and empirical inquiry.
As we embark on this quirky quest, we must take a moment to appreciate the sheer oddity of this connection. It's almost as improbable as finding a unicorn in a supermarket or witnessing a penguin in a tuxedo preparing a PowerPoint presentation. Yet, that's the wonder of statistical analysis - it often leads us down improbable paths where the conventional and the absurd collide in a mélange of quantitative curiosity.
Our aim is not merely to entertain with peculiar statistical phenomena, but to shed light on the broader implications of these findings. It is through these unexpected connections that we often uncover deeper truths and uncover hidden patterns, much like finding a diamond ring at the bottom of a cereal box - surprising, perhaps frivolous, but undeniably of value.
So, dear reader, buckle up and brace yourself for a journey through the byzantine world of statistics, where the unexpected is the norm, and where the journey is just as important as the destination. Here's to uncovering a connection that's as unexpected as finding a polar bear in a snowstorm – Manchester United's victories and Nebraska's rehabilitation counselors.

[[RESULTS]]
In our quest to unravel the mysterious link between Manchester United's triumphs and the number of rehabilitation counselors in the state of Nebraska, we were met with some truly remarkable findings. After sifting through heaps of data like intrepid explorers in the statistical jungle, we unearthed a correlation coefficient of 0.8328933, with an r-squared value of 0.6937112, and a p-value less than 0.01. These results indicate a robust and significant relationship between the number of seasons Manchester United emerged victorious and the quantity of rehabilitation counselors in Nebraska.
Upon gazing at the scatterplot (Fig. 1), one cannot help but be struck by the strong and unmistakable association between these seemingly unrelated variables. It's like watching a synchronized swimming performance by penguins - unexpected, yet strangely captivating. The plot portrays a clear trend, reminiscent of Manchester United's ascent to glory, and the subsequent rise in the demand for rehabilitation counselors in the heartland of the United States.
Now, we might be inclined to think that this correlation is purely coincidental, as if statistical forces conspired to confound our understanding. However, the strength of the association suggests that there's more to this curious relationship than mere happenstance. Just like witnessing a wizard pull a rabbit out of a hat, our findings beckon us to delve deeper into the underlying mechanisms at play.
So, as we let these results marinate in our minds, it becomes evident that statistical analysis has once again shown us that truth can be stranger than fiction. While the connection between football victories and rehabilitation counselors may seem like an oddball pairing, our findings speak to the unpredictability and sheer quirkiness of the world of data analysis.
In conclusion, our research not only uncovers a compelling correlation between two seemingly disparate realms but also underscores the delightful unpredictability of statistical inquiry. Much like stumbling upon a treasure map in a library book or finding a needle in a haystack, our results shine a light on the unexpected avenues that data exploration can traverse. Thus, we offer this golden nugget of statistical whimsy to the academic community, in the hopes that it sparks further exploration into the delightful oddities of statistical relationships. After all, it's not every day that one gets to investigate a connection as whimsical as this – Manchester United's victories and the demand for rehabilitation counselors in Nebraska.

-------

Your response should start like this:

[[CONCLUSION]]



ChatGPT:

[[CONCLUSION]]

In closing, our research has illuminated a correlation that's as surprising as finding a pineapple pizza in the middle of a cheese festival. The robust relationship between Manchester United's triumphs and the influx of rehabilitation counselors in Nebraska stands as a delightful quirk of statistical inquiry, akin to finding a unicorn in a forest of data points.

While we've thoroughly enjoyed uncovering this whimsical linkage, it's time to put the lid on this statistical treasure chest. It's clear that no further research is needed in this area, unless we want to explore the possibility of Lady Luck moonlighting as a data analyst, subtly nudging football victories and counseling demand together.

So, as we bid adieu to this unlikely statistical romp, we leave it as a reminder that the realm of data analysis is full of surprises, much like stumbling upon a $20 bill in last year's winter coat. The unexpected connections and peculiar patterns that emerge from our numerical expeditions are a testament to the strange and wonderful world of statistics.

In the words of Sherlock Holmes, once you eliminate the impossible, whatever remains, no matter how improbable, must be the truth. And the truth here, my friends, is that statistical analysis can indeed be as wild and fanciful as a tea party with the Mad Hatter.

Farewell to this statistical escapade – the connection between Manchester United's victories and Nebraska's rehabilitation counselors has been thoroughly prodded, poked, and pondered. No further mathematical musing needed here!


Model: gpt-3.5-turbo-1106
System prompt: You provide fictional names of academic journals. You say nothing else, and you do not chat with the user.
User prompt: Please provide the name of a funny but plausible sounding research, scientific, or academic journal that would publish a paper entited Kicking Around the Numbers: The Curious Link Between Manchester United's Victories and Rehabilitation Counselors in Nebraska. Respond only with the name of a fictional academic journal.

ChatGPT:

The Journal of Sport Psychology and Rural Rehabilitation


Model: gpt-3.5-turbo-1106
System prompt: You provide fictional names of nonprofit research organizations and associations. You say nothing else, and you do not chat with the user.
User prompt: Please provide the name of a funny but plausible sounding nonprofit, academic, or scientific association or organization that would sponsor research that resulted in a paper entited Kicking Around the Numbers: The Curious Link Between Manchester United's Victories and Rehabilitation Counselors in Nebraska. Respond only with the name of a fictional organization.

ChatGPT:

The Society for Sports Psychology and Social Services


Model: gpt-3.5-turbo-1106
System prompt: You provide potential search keywords separated by commas. You say nothing else, and you do not chat with the user.
User prompt: Please provide potential search keywords for a paper titled Kicking Around the Numbers: The Curious Link Between Manchester United's Victories and Rehabilitation Counselors in Nebraska with an abstract of In this study, we delve into the unexpected and rather amusing correlation between the number of seasons Manchester United emerged victorious and the quantity of rehabilitation counselors in the state of Nebraska. Despite sounding like the setup for a punchline at a football-themed comedy night, our research has meticulously analyzed data from Wikipedia and the Bureau of Labor Statistics spanning the years from 2003 to 2022. Astonishingly, our findings yielded a correlation coefficient of 0.8328933, with a p-value less than 0.01, suggesting a substantial relationship between these seemingly disparate variables. Whether an excess of wins for the famed football club leads to a surge in individuals seeking rehabilitation therapy in the Cornhusker State or some other confounding factor is at play, our results add a touch of whimsy to the world of statistical analysis. So, as the saying goes, it seems that when it rains, it truly pours – both on the football pitch and in the offices of Nebraska's rehabilitation counselors!

ChatGPT:

Manchester United victories, rehabilitation counselors, Nebraska, correlation analysis, statistical analysis, data analysis, football club wins, rehabilitation therapy, correlation coefficient, p-value, Bureau of Labor Statistics, data trends, surprising correlations, unexpected relationships

*There is a bunch of Python happening behind the scenes to turn this prompt sequence into a PDF.



Random correlation

Discover a new correlation

View all correlations

View all research papers

Report an error


Data details

Number of seasons Manchester United won matches
Source: Wikipedia
See what else correlates with Number of seasons Manchester United won matches

The number of rehabilitation counselors in Nebraska
Detailed data title: BLS estimate of rehabilitation counselors in Nebraska
Source: Bureau of Larbor Statistics
See what else correlates with The number of rehabilitation counselors in Nebraska

Correlation r = 0.8328933 (Pearson correlation coefficient)
Correlation is a measure of how much the variables move together. If it is 0.99, when one goes up the other goes up. If it is 0.02, the connection is very weak or non-existent. If it is -0.99, then when one goes up the other goes down. If it is 1.00, you probably messed up your correlation function.

r2 = 0.6937112 (Coefficient of determination)
This means 69.4% of the change in the one variable (i.e., The number of rehabilitation counselors in Nebraska) is predictable based on the change in the other (i.e., Number of seasons Manchester United won matches) over the 20 years from 2003 through 2022.

p < 0.01, which is statistically significant(Null hypothesis significance test)
The p-value is 5.2E-6. 0.0000051726335934541320000000
The p-value is a measure of how probable it is that we would randomly find a result this extreme. More specifically the p-value is a measure of how probable it is that we would randomly find a result this extreme if we had only tested one pair of variables one time.

But I am a p-villain. I absolutely did not test only one pair of variables one time. I correlated hundreds of millions of pairs of variables. I threw boatloads of data into an industrial-sized blender to find this correlation.

Who is going to stop me? p-value reporting doesn't require me to report how many calculations I had to go through in order to find a low p-value!
On average, you will find a correaltion as strong as 0.83 in 0.00052% of random cases. Said differently, if you correlated 193,325 random variables You don't actually need 193 thousand variables to find a correlation like this one. I don't have that many variables in my database. You can also correlate variables that are not independent. I do this a lot.

p-value calculations are useful for understanding the probability of a result happening by chance. They are most useful when used to highlight the risk of a fluke outcome. For example, if you calculate a p-value of 0.30, the risk that the result is a fluke is high. It is good to know that! But there are lots of ways to get a p-value of less than 0.01, as evidenced by this project.

In this particular case, the values are so extreme as to be meaningless. That's why no one reports p-values with specificity after they drop below 0.01.

Just to be clear: I'm being completely transparent about the calculations. There is no math trickery. This is just how statistics shakes out when you calculate hundreds of millions of random correlations.
with the same 19 degrees of freedom, Degrees of freedom is a measure of how many free components we are testing. In this case it is 19 because we have two variables measured over a period of 20 years. It's just the number of years minus ( the number of variables minus one ), which in this case simplifies to the number of years minus one.
you would randomly expect to find a correlation as strong as this one.

[ 0.62, 0.93 ] 95% correlation confidence interval (using the Fisher z-transformation)
The confidence interval is an estimate the range of the value of the correlation coefficient, using the correlation itself as an input. The values are meant to be the low and high end of the correlation coefficient with 95% confidence.

This one is a bit more complciated than the other calculations, but I include it because many people have been pushing for confidence intervals instead of p-value calculations (for example: NEJM. However, if you are dredging data, you can reliably find yourself in the 5%. That's my goal!


All values for the years included above: If I were being very sneaky, I could trim years from the beginning or end of the datasets to increase the correlation on some pairs of variables. I don't do that because there are already plenty of correlations in my database without monkeying with the years.

Still, sometimes one of the variables has more years of data available than the other. This page only shows the overlapping years. To see all the years, click on "See what else correlates with..." link above.
20032004200520062007200820092010201120122013201420152016201720182019202020212022
Number of seasons Manchester United won matches (Match won)2523222528272827232828192019182519182116
The number of rehabilitation counselors in Nebraska (Laborers)7507406308601120111010808608909401100800660600460560450370360470




Why this works

  1. Data dredging: I have 25,153 variables in my database. I compare all these variables against each other to find ones that randomly match up. That's 632,673,409 correlation calculations! This is called “data dredging.” Instead of starting with a hypothesis and testing it, I instead abused the data to see what correlations shake out. It’s a dangerous way to go about analysis, because any sufficiently large dataset will yield strong correlations completely at random.
  2. Lack of causal connection: There is probably Because these pages are automatically generated, it's possible that the two variables you are viewing are in fact causually related. I take steps to prevent the obvious ones from showing on the site (I don't let data about the weather in one city correlate with the weather in a neighboring city, for example), but sometimes they still pop up. If they are related, cool! You found a loophole.
    no direct connection between these variables, despite what the AI says above. This is exacerbated by the fact that I used "Years" as the base variable. Lots of things happen in a year that are not related to each other! Most studies would use something like "one person" in stead of "one year" to be the "thing" studied.
  3. Observations not independent: For many variables, sequential years are not independent of each other. If a population of people is continuously doing something every day, there is no reason to think they would suddenly change how they are doing that thing on January 1. A simple Personally I don't find any p-value calculation to be 'simple,' but you know what I mean.
    p-value calculation does not take this into account, so mathematically it appears less probable than it really is.




Try it yourself

You can calculate the values on this page on your own! Try running the Python code to see the calculation results. Step 1: Download and install Python on your computer.

Step 2: Open a plaintext editor like Notepad and paste the code below into it.

Step 3: Save the file as "calculate_correlation.py" in a place you will remember, like your desktop. Copy the file location to your clipboard. On Windows, you can right-click the file and click "Properties," and then copy what comes after "Location:" As an example, on my computer the location is "C:\Users\tyler\Desktop"

Step 4: Open a command line window. For example, by pressing start and typing "cmd" and them pressing enter.

Step 5: Install the required modules by typing "pip install numpy", then pressing enter, then typing "pip install scipy", then pressing enter.

Step 6: Navigate to the location where you saved the Python file by using the "cd" command. For example, I would type "cd C:\Users\tyler\Desktop" and push enter.

Step 7: Run the Python script by typing "python calculate_correlation.py"

If you run into any issues, I suggest asking ChatGPT to walk you through installing Python and running the code below on your system. Try this question:

"Walk me through installing Python on my computer to run a script that uses scipy and numpy. Go step-by-step and ask me to confirm before moving on. Start by asking me questions about my operating system so that you know how to proceed. Assume I want the simplest installation with the latest version of Python and that I do not currently have any of the necessary elements installed. Remember to only give me one step per response and confirm I have done it before proceeding."


# These modules make it easier to perform the calculation
import numpy as np
from scipy import stats

# We'll define a function that we can call to return the correlation calculations
def calculate_correlation(array1, array2):

    # Calculate Pearson correlation coefficient and p-value
    correlation, p_value = stats.pearsonr(array1, array2)

    # Calculate R-squared as the square of the correlation coefficient
    r_squared = correlation**2

    return correlation, r_squared, p_value

# These are the arrays for the variables shown on this page, but you can modify them to be any two sets of numbers
array_1 = np.array([25,23,22,25,28,27,28,27,23,28,28,19,20,19,18,25,19,18,21,16,])
array_2 = np.array([750,740,630,860,1120,1110,1080,860,890,940,1100,800,660,600,460,560,450,370,360,470,])
array_1_name = "Number of seasons Manchester United won matches"
array_2_name = "The number of rehabilitation counselors in Nebraska"

# Perform the calculation
print(f"Calculating the correlation between {array_1_name} and {array_2_name}...")
correlation, r_squared, p_value = calculate_correlation(array_1, array_2)

# Print the results
print("Correlation Coefficient:", correlation)
print("R-squared:", r_squared)
print("P-value:", p_value)



Reuseable content

You may re-use the images on this page for any purpose, even commercial purposes, without asking for permission. The only requirement is that you attribute Tyler Vigen. Attribution can take many different forms. If you leave the "tylervigen.com" link in the image, that satisfies it just fine. If you remove it and move it to a footnote, that's fine too. You can also just write "Charts courtesy of Tyler Vigen" at the bottom of an article.

You do not need to attribute "the spurious correlations website," and you don't even need to link here if you don't want to. I don't gain anything from pageviews. There are no ads on this site, there is nothing for sale, and I am not for hire.

For the record, I am just one person. Tyler Vigen, he/him/his. I do have degrees, but they should not go after my name unless you want to annoy my wife. If that is your goal, then go ahead and cite me as "Tyler Vigen, A.A. A.A.S. B.A. J.D." Otherwise it is just "Tyler Vigen."

When spoken, my last name is pronounced "vegan," like I don't eat meat.

Full license details.
For more on re-use permissions, or to get a signed release form, see tylervigen.com/permission.

Download images for these variables:


View another random correlation

How fun was this correlation?

Thanks for being the explorer we needed!


Correlation ID: 2404 · Black Variable ID: 251 · Red Variable ID: 10884
about · subscribe · emailme@tylervigen.com · twitter

CC BY 4.0