about · email me · subscribe
Spurious correlation #5,023 · View random

A linear line chart with years as the X-axis and two variables on the Y-axis. The first variable is Votes for the Republican Presidential candidate in Kentucky and the second variable is Frank Lampard's Premier League goal tally.  The chart goes from 1996 to 2016, and the two variables track closely in value over that time. Small Image
Download png
, svg

AI explanation

Perhaps the red color associated with the Republican party invigorated Frank, reminding him of the team he spent years playing for, leading to a sudden boost in his goal-scoring abilities. After all, you can't spell "Republican" without "red"!

Model: dalle-3
Prompt: Generate a visually striking image that combines political enthusiasm and sportsmanship. Depict Frank Lampard, wearing his Chelsea jersey, expertly kicking a soccer ball into the net in a vibrant red stadium. As he celebrates, the stands are filled with people proudly displaying "Republican votes" signs in support of the presidential candidate in Kentucky. Create a composition that captures both the excitement of the sporting moment and the fervent political support, utilizing bold red colors to enhance the dynamic and energetic atmosphere. The depiction should include only scenes, people, and objects, it should not include text or speech bubbles.
This image generation prompt was also generated by AI, using the following sequence:

Model: gpt-3.5-turbo-1106
System prompt: You select one of a few potential scene descriptors that seems to go along with the variables described. You respond only with the scene descriptions provided and say nothing else.
User prompt: Here are two variables: Votes for the Republican Presidential candidate in Kentucky and Frank Lampard's Premier League goal tally. Given these, which of the following image style descriptions seems most appropriate? Pick the one that seems to be related to one of the variables at hand. Please respond with only one of the following potential descriptions: a funny image macro a Polaroid photo Whimsical illustration Cubism A scene from Skyrim
ChatGPT: a funny image macro

Model: gpt-3.5-turbo-1106
System prompt: You describe simple visual scenes that depict the two variables stated. You combine both variables into one scene.
User prompt: Please describe a visual scene that prominently depicts both Votes for the Republican Presidential candidate in Kentucky and Frank Lampard's Premier League goal tally, in the following situation: Perhaps the red color associated with the Republican party invigorated Frank, reminding him of the team he spent years playing for, leading to a sudden boost in his goal-scoring abilities. After all, you can't spell "Republican" without "red"!. Keep the description simple.
ChatGPT: In a vibrant red stadium, Frank Lampard, wearing his Chelsea jersey, swiftly kicks the soccer ball into the net, scoring a goal. As he celebrates, people in the stands proudly display "Republican votes" signs, showing their support for the presidential candidate in Kentucky. The combination of the red colors creates a striking scene of both political enthusiasm and sportsmanship.

Model: gpt-3.5-turbo-1106
System prompt: You write and improve image generation prompts for DALLE-3. You maintain the same subjects, but enhance the visuals.
User prompt: Please respond with an image generation prompt that would be likely to create an interesting visual using DALLE-3, using the following style: [ a funny image macro ] and depicting this scene: In a vibrant red stadium, Frank Lampard, wearing his Chelsea jersey, swiftly kicks the soccer ball into the net, scoring a goal. As he celebrates, people in the stands proudly display "Republican votes" signs, showing their support for the presidential candidate in Kentucky. The combination of the red colors creates a striking scene of both political enthusiasm and sportsmanship. Keep the visual depiction simple. Do not include any words or speech bubbles.
ChatGPT: Generate a visually striking image that combines political enthusiasm and sportsmanship. Depict Frank Lampard, wearing his Chelsea jersey, expertly kicking a soccer ball into the net in a vibrant red stadium. As he celebrates, the stands are filled with people proudly displaying "Republican votes" signs in support of the presidential candidate in Kentucky. Create a composition that captures both the excitement of the sporting moment and the fervent political support, utilizing bold red colors to enhance the dynamic and energetic atmosphere.

*Variations in the User Prompts from chart to chart are just the result of random number generation in Python. I wrote a few arrays of various styles and methods to ask questions to change up the results. Every time this site writes an explanation or generates an image, the script picks from each at random.

I sequence the requests into multiple prompts because I find GPT 3.5 to perform much better with short, well-managed contexts. Thus, I track the context directly in Python and only ask ChatGPT targeted questions.

Model: gpt-3.5-turbo-1106 (fine-tuned)
System prompt: You provide humorous responses in the form of plausible sounding explanations for correlations. You assume the correlation is causative for the purpose of the explanation even if it is ridiculous. You do not chat with the user, you only reply with the causal connection explanation and nothing else.
User prompt: Please make up a funny explanation for how an increase in Votes for the Republican Presidential candidate in Kentucky caused Frank Lampard's Premier League goal tally to increase. Add a funny one-liner at the end.

AI academic paper

(Because p < 0.01)
Republican Votes in Kentucky and the rollicking Rise of Frank Lampard: A Remarkable Relationship Revealed

The Journal of Political Sports Science and Kentucky Studies

Jan 2024

Reminder: This paper is AI-generated. Not real!
Quick note: This sequence is long and repetitive. That's how the prompt engineering works for this one. I manage the output in Python to format it into this PDF.




Model: gpt-3.5-turbo-1106
System prompt: You are a researcher writing an academic paper.You write titles and abstracts of goofy academic research papers.
User prompt:
Pretend you are an academic researcher. Your writing sounds like it belongs in a medical or economics journal. However, you frequently crack jokes, make puns, and add unexpected twists in your writing.

Please draft the title and abstract of an academic research paper presenting the findings of the connection between Votes for the Republican Presidential candidate in Kentucky and Frank Lampard's Premier League goal tally. Make lots of jokes, goofy observations, and puns.

Make the title an alliteration.

Your research team used data from MIT Election Data and Science Lab, Harvard Dataverse and Wikipedia to assess this nagging question. You found a correlation coefficient of 0.9810246 and p < 0.01 for 1996 to 2016.

Please respond in this format (replace "Lorem ipsum dolor sit" with your title and abstract)

[[TITLE]]
Lorem ipsum dolor sit

[[ABSTRACT]]
Lorem ipsum dolor sit



ChatGPT:
[[TITLE]]
Republican Votes in Kentucky and the rollicking Rise of Frank Lampard: A Remarkable Relationship Revealed

[[ABSTRACT]]
In this paper, we unravel the unlikely but undeniably amusing connection between the votes for the Republican Presidential candidate in Kentucky and the Premier League goal tally of the legendary Frank Lampard. With an arsenal of data from the MIT Election Data and Science Lab, Harvard Dataverse, and Wikipedia, we set out to probe this peculiar pairing. Our findings yielded a correlation coefficient of 0.9810246 and p < 0.01 for the years 1996 through 2016, showcasing a strong statistical tie between the two seemingly unrelated entities. Our study not only sheds light on the curious camaraderie between political preference in the bluegrass state and soccer stardom across the pond, but also adds a dash of unexpected whimsy to the world of quantitative analysis. Let the punts and points flow freely as we explore the electoral footprints and goal-scoring feats that dance in delightful harmony within this inexplicable enigma.


Model: gpt-3.5-turbo-1106
System prompt: You are a researcher writing an academic paper.You draft the introduction section of an academic research paper.
User prompt:
Pretend you are an academic researcher. Your writing sounds like it belongs in a medical or economics journal. However, you frequently crack jokes, make puns, or add unexpected twists in your writing.

Please make up the introduction section of an academic research paper presenting the findings of the connection between Votes for the Republican Presidential candidate in Kentucky and Frank Lampard's Premier League goal tally. Make lots of jokes, goofy observations, and puns.

Here is the title and abstract of the paper:
[[TITLE]]
Republican Votes in Kentucky and the rollicking Rise of Frank Lampard: A Remarkable Relationship Revealed

[[ABSTRACT]]
In this paper, we unravel the unlikely but undeniably amusing connection between the votes for the Republican Presidential candidate in Kentucky and the Premier League goal tally of the legendary Frank Lampard. With an arsenal of data from the MIT Election Data and Science Lab, Harvard Dataverse, and Wikipedia, we set out to probe this peculiar pairing. Our findings yielded a correlation coefficient of 0.9810246 and p < 0.01 for the years 1996 through 2016, showcasing a strong statistical tie between the two seemingly unrelated entities. Our study not only sheds light on the curious camaraderie between political preference in the bluegrass state and soccer stardom across the pond, but also adds a dash of unexpected whimsy to the world of quantitative analysis. Let the punts and points flow freely as we explore the electoral footprints and goal-scoring feats that dance in delightful harmony within this inexplicable enigma.

-------

Your response should start like this:

[[INTRODUCTION]]



ChatGPT:
[[INTRODUCTION]]

As researchers and scholars, we often find ourselves diving into the depths of data, uncovering hidden connections that leave us scratching our heads in bewilderment. But every so often, we stumble upon correlations so ludicrously ludicrous that we can't help but laugh out loud in amazement. Such is the case with the unlikely kinship between the votes for the Republican Presidential candidate in the great state of Kentucky and the prolific goal-scoring exploits of none other than the Premier League legend, Frank Lampard.

At first glance, one might think this pairing is as random as tossing a coin and hoping for a heads on one side and a soccer ball on the other. However, as we delved into the numbers and conducted our analysis, we were astounded to find a statistically robust relationship between these two seemingly disconnected phenomena. It's like discovering a correlation between the number of garden gnomes in a neighborhood and the price of Swiss cheese in a local deli – utterly absurd, yet undeniably intriguing.

So, cogs whirring and neurons firing, we set out to unravel this peculiar puzzle and bring to light a connection that could very well make statisticians and footy fans alike do a quick double-take. With a twinkle in our eyes and a whole lot of data in our hands, we embarked on a journey that wove together the tapestry of American politics and the finesse of English football in a way that no one could have predicted.

By the end of our investigation, we not only uncovered a correlation coefficient that's about as strong as Frank Lampard's thunderous shots on goal but also stumbled upon a statistical tie that is sure to raise a few eyebrows and perhaps even prompt some good-natured chuckles. It's as if the universe decided to play a cheeky game of "connect the dots" and led us to this peculiar pairing, leaving us both flabbergasted and thoroughly entertained by the sheer audacity of it all.

So buckle up, dear readers, and get ready for a wild ride through the world of quantitative analysis, where the unexpected meets the unbelievable, and where statistical significance and soccer superstardom come together in a dance of delightful coincidence. Let the games begin!


Model: gpt-3.5-turbo-1106
System prompt: You are a researcher writing an academic paper.You draft a literature review section of an academic research paper, that starts out dry for the first few sentences but then becomes hilarious and goofy.
User prompt:
Pretend you are an academic researcher. Your writing sounds like it belongs in a medical or economics journal. However, you frequently crack jokes, make puns, or add unexpected twists in your writing.

Please make up a literature review section of an academic research paper presenting the findings of the connection between Votes for the Republican Presidential candidate in Kentucky and Frank Lampard's Premier League goal tally. Make lots of jokes, goofy observations, and puns.

Speak in the present tense for this section (the authors find...), as is common in academic research paper literature reviews. Name the sources in a format similar to this: In "Book," the authors find lorem and ipsum.

Make up the lorem and ipsum part, but make it sound related to the topic at hand.

Start by naming serious-sounding studies by authors like Smith, Doe, and Jones - but then quickly devolve. Name some real non-fiction books that would be related to the topic. Then name some real fiction books that sound like they could be related. Then cite some social media posts you read that seem to be related.

Here is the title and abstract of the paper:
[[TITLE]]
Republican Votes in Kentucky and the rollicking Rise of Frank Lampard: A Remarkable Relationship Revealed

[[ABSTRACT]]
In this paper, we unravel the unlikely but undeniably amusing connection between the votes for the Republican Presidential candidate in Kentucky and the Premier League goal tally of the legendary Frank Lampard. With an arsenal of data from the MIT Election Data and Science Lab, Harvard Dataverse, and Wikipedia, we set out to probe this peculiar pairing. Our findings yielded a correlation coefficient of 0.9810246 and p < 0.01 for the years 1996 through 2016, showcasing a strong statistical tie between the two seemingly unrelated entities. Our study not only sheds light on the curious camaraderie between political preference in the bluegrass state and soccer stardom across the pond, but also adds a dash of unexpected whimsy to the world of quantitative analysis. Let the punts and points flow freely as we explore the electoral footprints and goal-scoring feats that dance in delightful harmony within this inexplicable enigma.

-------

And here is the introduction section that led to this literature review:
[[INTRODUCTION]]
As researchers and scholars, we often find ourselves diving into the depths of data, uncovering hidden connections that leave us scratching our heads in bewilderment. But every so often, we stumble upon correlations so ludicrously ludicrous that we can't help but laugh out loud in amazement. Such is the case with the unlikely kinship between the votes for the Republican Presidential candidate in the great state of Kentucky and the prolific goal-scoring exploits of none other than the Premier League legend, Frank Lampard.
At first glance, one might think this pairing is as random as tossing a coin and hoping for a heads on one side and a soccer ball on the other. However, as we delved into the numbers and conducted our analysis, we were astounded to find a statistically robust relationship between these two seemingly disconnected phenomena. It's like discovering a correlation between the number of garden gnomes in a neighborhood and the price of Swiss cheese in a local deli – utterly absurd, yet undeniably intriguing.
So, cogs whirring and neurons firing, we set out to unravel this peculiar puzzle and bring to light a connection that could very well make statisticians and footy fans alike do a quick double-take. With a twinkle in our eyes and a whole lot of data in our hands, we embarked on a journey that wove together the tapestry of American politics and the finesse of English football in a way that no one could have predicted.
By the end of our investigation, we not only uncovered a correlation coefficient that's about as strong as Frank Lampard's thunderous shots on goal but also stumbled upon a statistical tie that is sure to raise a few eyebrows and perhaps even prompt some good-natured chuckles. It's as if the universe decided to play a cheeky game of "connect the dots" and led us to this peculiar pairing, leaving us both flabbergasted and thoroughly entertained by the sheer audacity of it all.
So buckle up, dear readers, and get ready for a wild ride through the world of quantitative analysis, where the unexpected meets the unbelievable, and where statistical significance and soccer superstardom come together in a dance of delightful coincidence. Let the games begin!

-------

Your response should start like this:

[[LITERATURE REVIEW]]



ChatGPT:
[[LITERATURE REVIEW]]

The connection between Votes for the Republican Presidential candidate in Kentucky and Frank Lampard's Premier League goal tally may seem as improbable as finding a herd of unicorns grazing in your local park. However, as we delve into the existing body of literature, we find that such an inconceivable correlation has captured the curiosity of scholars and enthusiasts alike.

Smith and Doe (2015) examined the influence of political preferences on individual sports participation, attempting to draw a tenuous link between party affiliation and athletic prowess. While their study primarily focused on recreational activities, their musings on the potential intersection of politics and sports left many scratching their heads in bewilderment, much like trying to solve a Sudoku puzzle while riding a unicycle.

Jones (2018) conducted an extensive analysis of regional voting patterns and their relation to international soccer events, delving into the peculiar world of political allegiances and football fanaticism. Although their work didn't directly address the specific context of Kentucky and Premier League soccer, it hinted at the possibility of strange bedfellows finding common ground in the realm of sports and politics, much like discovering a matcha latte in a vending machine filled with energy drinks.

Turning to non-fiction literature, "The Power of Sports in Shaping Political Landscapes" by Emily R. Davidson (2017) presents a comprehensive exploration of the intersections between athletics and governance, offering insight into the potential influence of sports culture on political ideologies. However, the author's work doesn't specifically touch upon the deeply entwined fate of Republican votes in Kentucky and Frank Lampard's goal-scoring prowess, leaving us to navigate through this peculiar puzzle with a compass pointing in all directions.

In a different light, the fictional realms of "Scorecards and Stump Speeches: A Tale of Two Passions" by A. J. Rowling and "Elections and English Football: Parallel Universes or Peculiar Pals" by G. R. R. Stafford could be seen as metaphoric explorations of the surreal relationship between political spheres and sporting achievements. Though the authors' intentions were purely fictional, one cannot help but wonder if there exists a parallel dimension where political rallies feature halftime shows and election debates are settled by penalty shootouts.

On the social media frontier, a particularly intriguing tweet by @SoccerInKentucky enthusiast, "Who knew that a ballot box in Bowling Green could hold the key to Frank Lampard's goal tally? #VoteAndScore #UnlikelyAlliance", sparked a flurry of speculation and a fair share of head-scratching among both the academic and fan communities. While not a scholarly work by conventional standards, the tweet's observation highlights the pervasive curiosity surrounding this bizarre correlation and the tantalizing mystery that it presents.

In sum, the existing literature and cultural discourse on the unlikely marriage between Republican votes in Kentucky and Frank Lampard's Premier League goal tally offer a glimpse into the collective perplexity and amusement elicited by this peculiar partnership. As we navigate through this landscape of academic inquiry, fictional musings, and social media snippets, we are poised to shed light on this extraordinary convergence and perhaps add a touch of whimsy to the serious world of statistical analysis and sports research.


Model: gpt-3.5-turbo-1106
System prompt: You are a researcher writing an academic paper.You draft the methodology section of an academic research paper.
User prompt:
Pretend you are an academic researcher. Your writing sounds like it belongs in a medical or economics journal. However, you frequently crack jokes, make puns, or add unexpected twists in your writing.

Please make up the methodology section of an academic research paper presenting the findings of the connection between Votes for the Republican Presidential candidate in Kentucky and Frank Lampard's Premier League goal tally. Make lots of jokes, goofy observations, and puns.

Your research team collected data from all across the internet, but mostly just used information from MIT Election Data and Science Lab, Harvard Dataverse and Wikipedia . You used data from 1996 to 2016

Make up the research methods you don't know. Make them a bit goofy and convoluted.

Here is the title, abstract, and introduction of the paper:
[[TITLE]]
Republican Votes in Kentucky and the rollicking Rise of Frank Lampard: A Remarkable Relationship Revealed

[[ABSTRACT]]
In this paper, we unravel the unlikely but undeniably amusing connection between the votes for the Republican Presidential candidate in Kentucky and the Premier League goal tally of the legendary Frank Lampard. With an arsenal of data from the MIT Election Data and Science Lab, Harvard Dataverse, and Wikipedia, we set out to probe this peculiar pairing. Our findings yielded a correlation coefficient of 0.9810246 and p < 0.01 for the years 1996 through 2016, showcasing a strong statistical tie between the two seemingly unrelated entities. Our study not only sheds light on the curious camaraderie between political preference in the bluegrass state and soccer stardom across the pond, but also adds a dash of unexpected whimsy to the world of quantitative analysis. Let the punts and points flow freely as we explore the electoral footprints and goal-scoring feats that dance in delightful harmony within this inexplicable enigma.

[[INTRODUCTION]]
As researchers and scholars, we often find ourselves diving into the depths of data, uncovering hidden connections that leave us scratching our heads in bewilderment. But every so often, we stumble upon correlations so ludicrously ludicrous that we can't help but laugh out loud in amazement. Such is the case with the unlikely kinship between the votes for the Republican Presidential candidate in the great state of Kentucky and the prolific goal-scoring exploits of none other than the Premier League legend, Frank Lampard.
At first glance, one might think this pairing is as random as tossing a coin and hoping for a heads on one side and a soccer ball on the other. However, as we delved into the numbers and conducted our analysis, we were astounded to find a statistically robust relationship between these two seemingly disconnected phenomena. It's like discovering a correlation between the number of garden gnomes in a neighborhood and the price of Swiss cheese in a local deli – utterly absurd, yet undeniably intriguing.
So, cogs whirring and neurons firing, we set out to unravel this peculiar puzzle and bring to light a connection that could very well make statisticians and footy fans alike do a quick double-take. With a twinkle in our eyes and a whole lot of data in our hands, we embarked on a journey that wove together the tapestry of American politics and the finesse of English football in a way that no one could have predicted.
By the end of our investigation, we not only uncovered a correlation coefficient that's about as strong as Frank Lampard's thunderous shots on goal but also stumbled upon a statistical tie that is sure to raise a few eyebrows and perhaps even prompt some good-natured chuckles. It's as if the universe decided to play a cheeky game of "connect the dots" and led us to this peculiar pairing, leaving us both flabbergasted and thoroughly entertained by the sheer audacity of it all.
So buckle up, dear readers, and get ready for a wild ride through the world of quantitative analysis, where the unexpected meets the unbelievable, and where statistical significance and soccer superstardom come together in a dance of delightful coincidence. Let the games begin!

-------

Your response should start like this:

[[METHODOLOGY]]



ChatGPT:

Ah, the nitty-gritty of our methodological madness! Our quest to unravel the whimsical web of Republican votes in Kentucky and Frank Lampard's goal tally involved a concoction of data collection, statistical acrobatics, and a generous sprinkling of whimsy. Picture a bevy of researchers poring over mountains of data while occasionally pausing to contemplate the peculiarity of our pursuit – we'd like to think we brought a touch of levity to the world of rigorous research.

To concoct our dataset, we embarked on a digital journey that would make even the most intrepid explorer envious. We scoured the vast expanse of the internet, sifting through the MIT Election Data and Science Lab like meticulous examiners, traipsing through the Harvard Dataverse with the curiosity of C.S. Lewis, and navigating the vaunted halls of Wikipedia with the determination of a trivia enthusiast on a quest for obscure knowledge.

Our chosen time frame of 1996 through 2016 served as the canvas upon which we painted our intriguing tableau of political preferences and goal-scoring prowess. With a wink and a nod to the gods of data, we pieced together a grand mosaic of electoral dynamics and Premier League excitement, forming the building blocks of our analysis.

Drawing upon the wisdom of the statistical ancients, we employed a delightful mishmash of methods to tease out the tangled threads of correlation between these seemingly distant entities. From the venerable Pearson correlation coefficient to the dance of significance testing, we pirouetted through the statistical arena with flair and finesse, all the while keeping our eyes peeled for any hint of statistical shenanigans.

With data in hand and a lighthearted spirit in our hearts, we waltzed into the realm of quantitative analysis, embracing the unexpected with each step and reveling in the inherent absurdity of our endeavor. While our methods may have been unorthodox at times, our commitment to uncovering the enigmatic union of Kentucky votes and Lampard's goals remained steadfast, much like a dedicated fan weathering the highs and lows of a raucous football season.

So, dear reader, join us as we lift the veil on the methodological marvels that propelled our research into the realms of statistical serendipity and delightful discovery. With a nod to the whimsy that underpins our scholarly pursuits, we invite you to partake in the joyous revelry of our methodology, where rigor meets revelry and the scholarly meets the silly. Let the merriment begin!

And if you think this methodology is convoluted, wait until you see our results!


Model: gpt-3.5-turbo-1106
System prompt: You are a researcher writing an academic paper.You draft the results section of an academic research paper. You speak in the past tense (you found...).
User prompt:
Pretend you are an academic researcher. Your writing sounds like it belongs in a medical or economics journal. However, you frequently crack jokes, make puns, or add unexpected twists in your writing.

Please make up the results section of an academic research paper presenting the findings of the connection between Votes for the Republican Presidential candidate in Kentucky and Frank Lampard's Premier League goal tally. Make lots of jokes, goofy observations, and puns.

Your research team collected data from all across the internet, but mostly just used information from MIT Election Data and Science Lab, Harvard Dataverse and Wikipedia .

For the time period 1996 to 2016, you found a correlation 0.9810246, r-squared of 0.9624092, and p < 0.01.

One figure will be included. The figure (Fig. 1) is a scatterplot showing the strong correlation between the two variables. You don't need to specify where; I will add the figure.

Here is the title and abstract of the paper:
[[TITLE]]
Republican Votes in Kentucky and the rollicking Rise of Frank Lampard: A Remarkable Relationship Revealed

[[ABSTRACT]]
In this paper, we unravel the unlikely but undeniably amusing connection between the votes for the Republican Presidential candidate in Kentucky and the Premier League goal tally of the legendary Frank Lampard. With an arsenal of data from the MIT Election Data and Science Lab, Harvard Dataverse, and Wikipedia, we set out to probe this peculiar pairing. Our findings yielded a correlation coefficient of 0.9810246 and p < 0.01 for the years 1996 through 2016, showcasing a strong statistical tie between the two seemingly unrelated entities. Our study not only sheds light on the curious camaraderie between political preference in the bluegrass state and soccer stardom across the pond, but also adds a dash of unexpected whimsy to the world of quantitative analysis. Let the punts and points flow freely as we explore the electoral footprints and goal-scoring feats that dance in delightful harmony within this inexplicable enigma.

-------

And here is the methodology section that led to this result:
[[METHODOLOGY]]
As researchers and scholars, we often find ourselves diving into the depths of data, uncovering hidden connections that leave us scratching our heads in bewilderment. But every so often, we stumble upon correlations so ludicrously ludicrous that we can't help but laugh out loud in amazement. Such is the case with the unlikely kinship between the votes for the Republican Presidential candidate in the great state of Kentucky and the prolific goal-scoring exploits of none other than the Premier League legend, Frank Lampard.
At first glance, one might think this pairing is as random as tossing a coin and hoping for a heads on one side and a soccer ball on the other. However, as we delved into the numbers and conducted our analysis, we were astounded to find a statistically robust relationship between these two seemingly disconnected phenomena. It's like discovering a correlation between the number of garden gnomes in a neighborhood and the price of Swiss cheese in a local deli – utterly absurd, yet undeniably intriguing.
So, cogs whirring and neurons firing, we set out to unravel this peculiar puzzle and bring to light a connection that could very well make statisticians and footy fans alike do a quick double-take. With a twinkle in our eyes and a whole lot of data in our hands, we embarked on a journey that wove together the tapestry of American politics and the finesse of English football in a way that no one could have predicted.
By the end of our investigation, we not only uncovered a correlation coefficient that's about as strong as Frank Lampard's thunderous shots on goal but also stumbled upon a statistical tie that is sure to raise a few eyebrows and perhaps even prompt some good-natured chuckles. It's as if the universe decided to play a cheeky game of "connect the dots" and led us to this peculiar pairing, leaving us both flabbergasted and thoroughly entertained by the sheer audacity of it all.
So buckle up, dear readers, and get ready for a wild ride through the world of quantitative analysis, where the unexpected meets the unbelievable, and where statistical significance and soccer superstardom come together in a dance of delightful coincidence. Let the games begin!

-------

Your response should start like this:

[[RESULTS]]



ChatGPT:

[[RESULTS]]

The tantalizing tie between politics in Kentucky and soccer sensation Frank Lampard's Premier League goal tally has been uncovered, and it's nothing short of a statistical spectacle. After trawling through the treasure troves of data from the MIT Election Data and Science Lab, Harvard Dataverse, and Wikipedia, we unearthed a correlation coefficient of 0.9810246 and a sizzling r-squared value of 0.9624092 for the period spanning from 1996 to 2016. If that doesn't make you do a goal celebration dance, then hold onto your hats because the p-value checks in at less than 0.01 - a resounding proclamation of statistical significance.

This finding isn't just a mere coincidence; there's a robust relationship at play here. It's akin to discovering that the color of one's socks might have an impact on the likelihood of a thunderstorm in Timbuktu – utterly unexpected, yet undeniably compelling. The figure (Fig. 1) encapsulates this staggering correlation, showcasing a scatterplot that's so tightly packed with data points, you'd think it was attempting to set a record for the most goals scored in a single season.

In essence, our results not only validate the existence of this astonishing association but also serve as a testament to the whimsical wonders of the world of quantitative analysis. With the precision of a pinpoint pass and the finesse of a clinical finish, this statistical tie between red-state politics and soccer magic gleefully waltzes into the realm of the improbable, leaving us both flabbergasted and happily bemused. So, join us in reveling in this serendipitous discovery – a rollicking romp through the realm of correlation that may just leave you wondering what other unexpected connections lurk beneath the surface of our data-driven world. Let the data-driven antics and statistical shenanigans continue!


Model: gpt-3.5-turbo-1106
System prompt: You are a researcher writing an academic paper.You draft the discussion section of an academic research paper.
User prompt:
Pretend you are an academic researcher. Your writing sounds like it belongs in a medical or economics journal. However, you frequently crack jokes, make puns, or add unexpected twists in your writing.

Please make up the discussion section of an academic research paper presenting the findings of the connection between Votes for the Republican Presidential candidate in Kentucky and Frank Lampard's Premier League goal tally. Make lots of jokes, goofy observations, and puns.

Limit your response to 500 tokens.

Here are the title, abstract, literature review, and results sections. Please harken back to 1-2 of the goofy items in the literature review, but pretend to take them completely seriously. Discuss how your results supported the prior research.

Do not write a conclusion. I will add a conclusion after this.

[[TITLE]]
Republican Votes in Kentucky and the rollicking Rise of Frank Lampard: A Remarkable Relationship Revealed

[[ABSTRACT]]
In this paper, we unravel the unlikely but undeniably amusing connection between the votes for the Republican Presidential candidate in Kentucky and the Premier League goal tally of the legendary Frank Lampard. With an arsenal of data from the MIT Election Data and Science Lab, Harvard Dataverse, and Wikipedia, we set out to probe this peculiar pairing. Our findings yielded a correlation coefficient of 0.9810246 and p < 0.01 for the years 1996 through 2016, showcasing a strong statistical tie between the two seemingly unrelated entities. Our study not only sheds light on the curious camaraderie between political preference in the bluegrass state and soccer stardom across the pond, but also adds a dash of unexpected whimsy to the world of quantitative analysis. Let the punts and points flow freely as we explore the electoral footprints and goal-scoring feats that dance in delightful harmony within this inexplicable enigma.

[[LITERATURE REVIEW]]
The connection between Votes for the Republican Presidential candidate in Kentucky and Frank Lampard's Premier League goal tally may seem as improbable as finding a herd of unicorns grazing in your local park. However, as we delve into the existing body of literature, we find that such an inconceivable correlation has captured the curiosity of scholars and enthusiasts alike.
Smith and Doe (2015) examined the influence of political preferences on individual sports participation, attempting to draw a tenuous link between party affiliation and athletic prowess. While their study primarily focused on recreational activities, their musings on the potential intersection of politics and sports left many scratching their heads in bewilderment, much like trying to solve a Sudoku puzzle while riding a unicycle.
Jones (2018) conducted an extensive analysis of regional voting patterns and their relation to international soccer events, delving into the peculiar world of political allegiances and football fanaticism. Although their work didn't directly address the specific context of Kentucky and Premier League soccer, it hinted at the possibility of strange bedfellows finding common ground in the realm of sports and politics, much like discovering a matcha latte in a vending machine filled with energy drinks.
Turning to non-fiction literature, "The Power of Sports in Shaping Political Landscapes" by Emily R. Davidson (2017) presents a comprehensive exploration of the intersections between athletics and governance, offering insight into the potential influence of sports culture on political ideologies. However, the author's work doesn't specifically touch upon the deeply entwined fate of Republican votes in Kentucky and Frank Lampard's goal-scoring prowess, leaving us to navigate through this peculiar puzzle with a compass pointing in all directions.
In a different light, the fictional realms of "Scorecards and Stump Speeches: A Tale of Two Passions" by A. J. Rowling and "Elections and English Football: Parallel Universes or Peculiar Pals" by G. R. R. Stafford could be seen as metaphoric explorations of the surreal relationship between political spheres and sporting achievements. Though the authors' intentions were purely fictional, one cannot help but wonder if there exists a parallel dimension where political rallies feature halftime shows and election debates are settled by penalty shootouts.
On the social media frontier, a particularly intriguing tweet by @SoccerInKentucky enthusiast, "Who knew that a ballot box in Bowling Green could hold the key to Frank Lampard's goal tally? #VoteAndScore #UnlikelyAlliance", sparked a flurry of speculation and a fair share of head-scratching among both the academic and fan communities. While not a scholarly work by conventional standards, the tweet's observation highlights the pervasive curiosity surrounding this bizarre correlation and the tantalizing mystery that it presents.
In sum, the existing literature and cultural discourse on the unlikely marriage between Republican votes in Kentucky and Frank Lampard's Premier League goal tally offer a glimpse into the collective perplexity and amusement elicited by this peculiar partnership. As we navigate through this landscape of academic inquiry, fictional musings, and social media snippets, we are poised to shed light on this extraordinary convergence and perhaps add a touch of whimsy to the serious world of statistical analysis and sports research.

[[RESULTS]]
The tantalizing tie between politics in Kentucky and soccer sensation Frank Lampard's Premier League goal tally has been uncovered, and it's nothing short of a statistical spectacle. After trawling through the treasure troves of data from the MIT Election Data and Science Lab, Harvard Dataverse, and Wikipedia, we unearthed a correlation coefficient of 0.9810246 and a sizzling r-squared value of 0.9624092 for the period spanning from 1996 to 2016. If that doesn't make you do a goal celebration dance, then hold onto your hats because the p-value checks in at less than 0.01 - a resounding proclamation of statistical significance.
This finding isn't just a mere coincidence; there's a robust relationship at play here. It's akin to discovering that the color of one's socks might have an impact on the likelihood of a thunderstorm in Timbuktu – utterly unexpected, yet undeniably compelling. The figure (Fig. 1) encapsulates this staggering correlation, showcasing a scatterplot that's so tightly packed with data points, you'd think it was attempting to set a record for the most goals scored in a single season.
In essence, our results not only validate the existence of this astonishing association but also serve as a testament to the whimsical wonders of the world of quantitative analysis. With the precision of a pinpoint pass and the finesse of a clinical finish, this statistical tie between red-state politics and soccer magic gleefully waltzes into the realm of the improbable, leaving us both flabbergasted and happily bemused. So, join us in reveling in this serendipitous discovery – a rollicking romp through the realm of correlation that may just leave you wondering what other unexpected connections lurk beneath the surface of our data-driven world. Let the data-driven antics and statistical shenanigans continue!

-------

Your response should start like this:

[[DISCUSSION]]



ChatGPT:

[[DISCUSSION]]

Our assemblage of data has not only affirmed the unlikely relationship between Republican votes in Kentucky and the Premier League goal tally of Frank Lampard but has also unfurled a tapestry of statistical marvel that is as jaw-dropping as unearthing a hidden treasure chest while gardening. Our results fortuitously mimic the musings of Smith and Doe (2015), who teasingly hinted at the potential intertwining of party affiliation and sports prowess. It seems that they were onto something after all, akin to discovering a rare Pokémon hiding in plain sight in one's backyard.

With the robust correlation coefficient of 0.9810246 and r-squared value of 0.9624092, our findings dance in perfect harmony with the whispers from Jones (2018), who, amidst the exploration of regional voting patterns and soccer phenomena, indirectly beckoned us to delve into the idiosyncratic whimsy of the intersection between politics and sport. It's as if we've stumbled upon a secret passageway in the labyrinth of statistical analysis, leading us to a treasure trove of surprising connections, much like finding a hidden chamber behind a bookshelf in a mystery novel.

The comically confounding correlation portrayed in Figure 1 substantiates the surreal nature of our findings, akin to stumbling upon a pot of gold at the end of a statistical rainbow. This statistical significance not only validates the existence of this improbable association but also casts a spotlight on the delightful absurdity that often lurks within the otherwise somber corridors of scientific inquiry. It's like witnessing a magic trick that defies the laws of probability, leaving us both flabbergasted and gleefully amused, akin to finding a penguin in the desert.

In conclusion, our study not only contributes to the body of knowledge surrounding the interplay of political leanings and sporting achievements but also injects a whimsical charm into the typically staid world of quantitative analysis. The peculiar partnership between votes in Kentucky and Lampard's goals in the Premier League not only invites a chuckle but also beckons us to ponder what other unexpected correlations might await in the vast expanse of our data-driven universe. With a wink and a nod to the improbable, let's tip our hats to this rollicking romp through the realm of correlation and brace ourselves for the never-ending serendipities that the world of statistics has in store for us.


Model: gpt-3.5-turbo-1106
System prompt: You are a researcher writing an academic paper.You draft the discussion section of goofy academic research papers that sounds like it belongs based on the title and abstract provided.
User prompt:
Pretend you are an academic researcher. Your writing sounds like it belongs in a medical or economics journal. However, you frequently crack jokes, make puns, or add unexpected twists in your writing.

Please make up a short conclusion of an academic research paper presenting the findings of the connection between Votes for the Republican Presidential candidate in Kentucky and Frank Lampard's Premier League goal tally. Make lots of jokes, goofy observations, and puns.

Limit your response to 300 tokens. At the very end, assert that no more research is needed in this area.

Here are the title, abstract, introduction, and results sections.
[[TITLE]]
Republican Votes in Kentucky and the rollicking Rise of Frank Lampard: A Remarkable Relationship Revealed

[[ABSTRACT]]
In this paper, we unravel the unlikely but undeniably amusing connection between the votes for the Republican Presidential candidate in Kentucky and the Premier League goal tally of the legendary Frank Lampard. With an arsenal of data from the MIT Election Data and Science Lab, Harvard Dataverse, and Wikipedia, we set out to probe this peculiar pairing. Our findings yielded a correlation coefficient of 0.9810246 and p < 0.01 for the years 1996 through 2016, showcasing a strong statistical tie between the two seemingly unrelated entities. Our study not only sheds light on the curious camaraderie between political preference in the bluegrass state and soccer stardom across the pond, but also adds a dash of unexpected whimsy to the world of quantitative analysis. Let the punts and points flow freely as we explore the electoral footprints and goal-scoring feats that dance in delightful harmony within this inexplicable enigma.

[[INTRDUCTION]]
As researchers and scholars, we often find ourselves diving into the depths of data, uncovering hidden connections that leave us scratching our heads in bewilderment. But every so often, we stumble upon correlations so ludicrously ludicrous that we can't help but laugh out loud in amazement. Such is the case with the unlikely kinship between the votes for the Republican Presidential candidate in the great state of Kentucky and the prolific goal-scoring exploits of none other than the Premier League legend, Frank Lampard.
At first glance, one might think this pairing is as random as tossing a coin and hoping for a heads on one side and a soccer ball on the other. However, as we delved into the numbers and conducted our analysis, we were astounded to find a statistically robust relationship between these two seemingly disconnected phenomena. It's like discovering a correlation between the number of garden gnomes in a neighborhood and the price of Swiss cheese in a local deli – utterly absurd, yet undeniably intriguing.
So, cogs whirring and neurons firing, we set out to unravel this peculiar puzzle and bring to light a connection that could very well make statisticians and footy fans alike do a quick double-take. With a twinkle in our eyes and a whole lot of data in our hands, we embarked on a journey that wove together the tapestry of American politics and the finesse of English football in a way that no one could have predicted.
By the end of our investigation, we not only uncovered a correlation coefficient that's about as strong as Frank Lampard's thunderous shots on goal but also stumbled upon a statistical tie that is sure to raise a few eyebrows and perhaps even prompt some good-natured chuckles. It's as if the universe decided to play a cheeky game of "connect the dots" and led us to this peculiar pairing, leaving us both flabbergasted and thoroughly entertained by the sheer audacity of it all.
So buckle up, dear readers, and get ready for a wild ride through the world of quantitative analysis, where the unexpected meets the unbelievable, and where statistical significance and soccer superstardom come together in a dance of delightful coincidence. Let the games begin!

[[RESULTS]]
The tantalizing tie between politics in Kentucky and soccer sensation Frank Lampard's Premier League goal tally has been uncovered, and it's nothing short of a statistical spectacle. After trawling through the treasure troves of data from the MIT Election Data and Science Lab, Harvard Dataverse, and Wikipedia, we unearthed a correlation coefficient of 0.9810246 and a sizzling r-squared value of 0.9624092 for the period spanning from 1996 to 2016. If that doesn't make you do a goal celebration dance, then hold onto your hats because the p-value checks in at less than 0.01 - a resounding proclamation of statistical significance.
This finding isn't just a mere coincidence; there's a robust relationship at play here. It's akin to discovering that the color of one's socks might have an impact on the likelihood of a thunderstorm in Timbuktu – utterly unexpected, yet undeniably compelling. The figure (Fig. 1) encapsulates this staggering correlation, showcasing a scatterplot that's so tightly packed with data points, you'd think it was attempting to set a record for the most goals scored in a single season.
In essence, our results not only validate the existence of this astonishing association but also serve as a testament to the whimsical wonders of the world of quantitative analysis. With the precision of a pinpoint pass and the finesse of a clinical finish, this statistical tie between red-state politics and soccer magic gleefully waltzes into the realm of the improbable, leaving us both flabbergasted and happily bemused. So, join us in reveling in this serendipitous discovery – a rollicking romp through the realm of correlation that may just leave you wondering what other unexpected connections lurk beneath the surface of our data-driven world. Let the data-driven antics and statistical shenanigans continue!

-------

Your response should start like this:

[[CONCLUSION]]



ChatGPT:

[[CONCLUSION]]

In conclusion, our rollicking romp through the unlikely realm of the Republican votes in Kentucky and Frank Lampard's Premier League goal tally has left us not only marveling at the statistical significance of our findings but also chuckling at the sheer absurdity of this delightful connection. It's as if the universe decided to play a cheeky game of "connect the electoral and soccer dots" and led us to this unexpected pairing that not only defies logic but also tickles the funny bone.

As we wrap up this comical crusade into the world of statistical analysis, we must acknowledge the undeniable amusement that comes from unearthing such an outlandish correlation. It's like finding out that the color of a politician's tie might influence the outcome of a penalty shootout – utterly nonsensical, yet undeniably entertaining.

Our robust results, with a correlation coefficient standing as tall as the Burj Khalifa and a p-value more significant than the allure of a last-minute goal, not only affirm the existence of this uncanny kinship but also inject a healthy dose of whimsy into the often-serious domain of quantitative analysis.

Therefore, we assert with absolute confidence (and a touch of mischief) that no further research is needed in this area. The curiosities of correlation between politics and sports have been thoroughly explored, and the unexpected antics of statistical shenanigans have been duly noted. It's time to hang up our scholarly boots and revel in the delightful absurdity that this research has brought to light. Let the data-driven laughter and statistical absurdities live on in the annals of academia!


Model: gpt-3.5-turbo-1106
System prompt: You provide fictional names of academic journals. You say nothing else, and you do not chat with the user.
User prompt: Please provide the name of a funny but plausible sounding research, scientific, or academic journal that would publish a paper entited Republican Votes in Kentucky and the rollicking Rise of Frank Lampard: A Remarkable Relationship Revealed. Respond only with the name of a fictional academic journal.

ChatGPT:

The Journal of Political Sports Science and Kentucky Studies


Model: gpt-3.5-turbo-1106
System prompt: You provide fictional names of nonprofit research organizations and associations. You say nothing else, and you do not chat with the user.
User prompt: Please provide the name of a funny but plausible sounding nonprofit, academic, or scientific association or organization that would sponsor research that resulted in a paper entited Republican Votes in Kentucky and the rollicking Rise of Frank Lampard: A Remarkable Relationship Revealed. Respond only with the name of a fictional organization.

ChatGPT:

The Institute for Historical Political Coincidences and Football Associations


Model: gpt-3.5-turbo-1106
System prompt: You provide potential search keywords separated by commas. You say nothing else, and you do not chat with the user.
User prompt: Please provide potential search keywords for a paper titled Republican Votes in Kentucky and the rollicking Rise of Frank Lampard: A Remarkable Relationship Revealed with an abstract of In this paper, we unravel the unlikely but undeniably amusing connection between the votes for the Republican Presidential candidate in Kentucky and the Premier League goal tally of the legendary Frank Lampard. With an arsenal of data from the MIT Election Data and Science Lab, Harvard Dataverse, and Wikipedia, we set out to probe this peculiar pairing. Our findings yielded a correlation coefficient of 0.9810246 and p < 0.01 for the years 1996 through 2016, showcasing a strong statistical tie between the two seemingly unrelated entities. Our study not only sheds light on the curious camaraderie between political preference in the bluegrass state and soccer stardom across the pond, but also adds a dash of unexpected whimsy to the world of quantitative analysis. Let the punts and points flow freely as we explore the electoral footprints and goal-scoring feats that dance in delightful harmony within this inexplicable enigma.

ChatGPT:

Republican Presidential candidate, Kentucky voting patterns, Frank Lampard Premier League goals, correlation coefficient, electoral data analysis, political preference, soccer stardom, statistical tie, quantitative analysis, electoral footprints, goal-scoring feats, MIT Election Data and Science Lab, Harvard Dataverse, Wikipedia.

*There is a bunch of Python happening behind the scenes to turn this prompt sequence into a PDF.



Random correlation

Discover a new correlation

View all correlations

View all research papers

Report an error


Data details

Votes for the Republican Presidential candidate in Kentucky
Detailed data title: Percentage of all votes cast for the Republican Presidential candidate in Kentucky
Source: MIT Election Data and Science Lab, Harvard Dataverse
See what else correlates with Votes for the Republican Presidential candidate in Kentucky

Frank Lampard's Premier League goal tally
Source: Wikipedia
See what else correlates with Frank Lampard's Premier League goal tally

Correlation r = 0.9810246 (Pearson correlation coefficient)
Correlation is a measure of how much the variables move together. If it is 0.99, when one goes up the other goes up. If it is 0.02, the connection is very weak or non-existent. If it is -0.99, then when one goes up the other goes down. If it is 1.00, you probably messed up your correlation function.

r2 = 0.9624092 (Coefficient of determination)
This means 96.2% of the change in the one variable (i.e., Frank Lampard's Premier League goal tally) is predictable based on the change in the other (i.e., Votes for the Republican Presidential candidate in Kentucky) over the 6 years from 1996 through 2016.

p < 0.01, which is statistically significant(Null hypothesis significance test)
The p-value is 0.00054. 0.0005366843733286670000000000
The p-value is a measure of how probable it is that we would randomly find a result this extreme. More specifically the p-value is a measure of how probable it is that we would randomly find a result this extreme if we had only tested one pair of variables one time.

But I am a p-villain. I absolutely did not test only one pair of variables one time. I correlated hundreds of millions of pairs of variables. I threw boatloads of data into an industrial-sized blender to find this correlation.

Who is going to stop me? p-value reporting doesn't require me to report how many calculations I had to go through in order to find a low p-value!
On average, you will find a correaltion as strong as 0.98 in 0.054% of random cases. Said differently, if you correlated 1,863 random variables Which I absolutely did.
with the same 5 degrees of freedom, Degrees of freedom is a measure of how many free components we are testing. In this case it is 5 because we have two variables measured over a period of 6 years. It's just the number of years minus ( the number of variables minus one ), which in this case simplifies to the number of years minus one.
you would randomly expect to find a correlation as strong as this one.

[ 0.83, 1 ] 95% correlation confidence interval (using the Fisher z-transformation)
The confidence interval is an estimate the range of the value of the correlation coefficient, using the correlation itself as an input. The values are meant to be the low and high end of the correlation coefficient with 95% confidence.

This one is a bit more complciated than the other calculations, but I include it because many people have been pushing for confidence intervals instead of p-value calculations (for example: NEJM. However, if you are dredging data, you can reliably find yourself in the 5%. That's my goal!


All values for the years included above: If I were being very sneaky, I could trim years from the beginning or end of the datasets to increase the correlation on some pairs of variables. I don't do that because there are already plenty of correlations in my database without monkeying with the years.

Still, sometimes one of the variables has more years of data available than the other. This page only shows the overlapping years. To see all the years, click on "See what else correlates with..." link above.
199620002004200820122016
Votes for the Republican Presidential candidate in Kentucky (Percentage of votes)44.882256.506559.549557.39960.493162.5196
Frank Lampard's Premier League goal tally (Goals scored)1710101112




Why this works

  1. Data dredging: I have 25,153 variables in my database. I compare all these variables against each other to find ones that randomly match up. That's 632,673,409 correlation calculations! This is called “data dredging.” Instead of starting with a hypothesis and testing it, I instead abused the data to see what correlations shake out. It’s a dangerous way to go about analysis, because any sufficiently large dataset will yield strong correlations completely at random.
  2. Lack of causal connection: There is probably Because these pages are automatically generated, it's possible that the two variables you are viewing are in fact causually related. I take steps to prevent the obvious ones from showing on the site (I don't let data about the weather in one city correlate with the weather in a neighboring city, for example), but sometimes they still pop up. If they are related, cool! You found a loophole.
    no direct connection between these variables, despite what the AI says above. This is exacerbated by the fact that I used "Years" as the base variable. Lots of things happen in a year that are not related to each other! Most studies would use something like "one person" in stead of "one year" to be the "thing" studied.
  3. Observations not independent: For many variables, sequential years are not independent of each other. If a population of people is continuously doing something every day, there is no reason to think they would suddenly change how they are doing that thing on January 1. A simple Personally I don't find any p-value calculation to be 'simple,' but you know what I mean.
    p-value calculation does not take this into account, so mathematically it appears less probable than it really is.
  4. Very low n: There are not many data points included in this analysis. Even if the p-value is high, we should be suspicious of using so few datapoints in a correlation.
  5. Y-axis doesn't start at zero: I truncated the Y-axes of the graph above. I also used a line graph, which makes the visual connection stand out more than it deserves. Nothing against line graphs. They are great at telling a story when you have linear data! But visually it is deceptive because the only data is at the points on the graph, not the lines on the graph. In between each point, the data could have been doing anything. Like going for a random walk by itself!
    Mathematically what I showed is true, but it is intentionally misleading. Below is the same chart but with both Y-axes starting at zero.




Try it yourself

You can calculate the values on this page on your own! Try running the Python code to see the calculation results. Step 1: Download and install Python on your computer.

Step 2: Open a plaintext editor like Notepad and paste the code below into it.

Step 3: Save the file as "calculate_correlation.py" in a place you will remember, like your desktop. Copy the file location to your clipboard. On Windows, you can right-click the file and click "Properties," and then copy what comes after "Location:" As an example, on my computer the location is "C:\Users\tyler\Desktop"

Step 4: Open a command line window. For example, by pressing start and typing "cmd" and them pressing enter.

Step 5: Install the required modules by typing "pip install numpy", then pressing enter, then typing "pip install scipy", then pressing enter.

Step 6: Navigate to the location where you saved the Python file by using the "cd" command. For example, I would type "cd C:\Users\tyler\Desktop" and push enter.

Step 7: Run the Python script by typing "python calculate_correlation.py"

If you run into any issues, I suggest asking ChatGPT to walk you through installing Python and running the code below on your system. Try this question:

"Walk me through installing Python on my computer to run a script that uses scipy and numpy. Go step-by-step and ask me to confirm before moving on. Start by asking me questions about my operating system so that you know how to proceed. Assume I want the simplest installation with the latest version of Python and that I do not currently have any of the necessary elements installed. Remember to only give me one step per response and confirm I have done it before proceeding."


# These modules make it easier to perform the calculation
import numpy as np
from scipy import stats

# We'll define a function that we can call to return the correlation calculations
def calculate_correlation(array1, array2):

    # Calculate Pearson correlation coefficient and p-value
    correlation, p_value = stats.pearsonr(array1, array2)

    # Calculate R-squared as the square of the correlation coefficient
    r_squared = correlation**2

    return correlation, r_squared, p_value

# These are the arrays for the variables shown on this page, but you can modify them to be any two sets of numbers
array_1 = np.array([44.8822,56.5065,59.5495,57.399,60.4931,62.5196,])
array_2 = np.array([1,7,10,10,11,12,])
array_1_name = "Votes for the Republican Presidential candidate in Kentucky"
array_2_name = "Frank Lampard's Premier League goal tally"

# Perform the calculation
print(f"Calculating the correlation between {array_1_name} and {array_2_name}...")
correlation, r_squared, p_value = calculate_correlation(array_1, array_2)

# Print the results
print("Correlation Coefficient:", correlation)
print("R-squared:", r_squared)
print("P-value:", p_value)



Reuseable content

You may re-use the images on this page for any purpose, even commercial purposes, without asking for permission. The only requirement is that you attribute Tyler Vigen. Attribution can take many different forms. If you leave the "tylervigen.com" link in the image, that satisfies it just fine. If you remove it and move it to a footnote, that's fine too. You can also just write "Charts courtesy of Tyler Vigen" at the bottom of an article.

You do not need to attribute "the spurious correlations website," and you don't even need to link here if you don't want to. I don't gain anything from pageviews. There are no ads on this site, there is nothing for sale, and I am not for hire.

For the record, I am just one person. Tyler Vigen, he/him/his. I do have degrees, but they should not go after my name unless you want to annoy my wife. If that is your goal, then go ahead and cite me as "Tyler Vigen, A.A. A.A.S. B.A. J.D." Otherwise it is just "Tyler Vigen."

When spoken, my last name is pronounced "vegan," like I don't eat meat.

Full license details.
For more on re-use permissions, or to get a signed release form, see tylervigen.com/permission.

Download images for these variables:


View another random correlation

How fun was this correlation?

Big thanks for reviewing!


Correlation ID: 5023 · Black Variable ID: 26102 · Red Variable ID: 98
about · subscribe · emailme@tylervigen.com · twitter

CC BY 4.0