about · email me · subscribe
Spurious correlation #3,012 · View random

A linear line chart with years as the X-axis and two variables on the Y-axis. The first variable is Popularity of the first name Erin and the second variable is Burglaries in Alaska.  The chart goes from 1985 to 2022, and the two variables track closely in value over that time. Small Image
Download png
, svg

AI explanation

As the popularity of the name Erin dwindled, so did the number of people "airin'" out their plans for burglaries, leading to a decrease in criminal activity in Alaska. It seems that when there are fewer Erins around, there's also a decrease in 'break-Erins'!

Model: dalle-3
Prompt: Generate an image of a wintry playground in Alaska where children are playing joyfully, surrounded by snow-capped mountains. A sign reading "Welcome, Erin!" should be prominently featured, while the background shows a local newspaper with the headline "Burglaries in Alaska at an All-Time Low." The depiction should include only scenes, people, and objects, it should not include text or speech bubbles.
This image generation prompt was also generated by AI, using the following sequence:

Model: gpt-3.5-turbo-1106
System prompt: You select one of a few potential scene descriptors that seems to go along with the variables described. You respond only with the scene descriptions provided and say nothing else.
User prompt: Here are two variables: Popularity of the first name Erin and Burglaries in Alaska. Given these, which of the following image style descriptions seems most appropriate? Pick the one that seems to be related to one of the variables at hand. Please respond with only one of the following potential descriptions: High-tech Realistic Minimalism Oragami A scene from Skyrim
ChatGPT: A scene from Skyrim

Model: gpt-3.5-turbo-1106
System prompt: You describe simple visual scenes that depict the two variables stated. You combine both variables into one scene.
User prompt: Please describe a visual scene that prominently depicts both Popularity of the first name Erin and Burglaries in Alaska, in the following situation: Because as the popularity of the name Erin decreased, fewer people were naming their children Erin. This caused a decrease in the number of Erin-named individuals in Alaska, which in turn led to a decrease in burglaries, as statistically, there's a lower likelihood of someone named Erin being involved in criminal activities.. Keep the description simple.
ChatGPT: A playground in Alaska with a sign that says "Welcome, Erin!" Children playing happily, while the local newspaper headline reads: "Burglaries in Alaska at an All-Time Low."

Model: gpt-3.5-turbo-1106
System prompt: You write and improve image generation prompts for DALLE-3. You maintain the same subjects, but enhance the visuals.
User prompt: Please respond with an image generation prompt that would be likely to create an interesting visual using DALLE-3, using the following style: [ A scene from Skyrim ] and depicting this scene: A playground in Alaska with a sign that says "Welcome, Erin!" Children playing happily, while the local newspaper headline reads: "Burglaries in Alaska at an All-Time Low." Keep the visual depiction simple. Do not include any words or speech bubbles.
ChatGPT: Generate an image of a wintry playground in Alaska where children are playing joyfully, surrounded by snow-capped mountains. A sign reading "Welcome, Erin!" should be prominently featured, while the background shows a local newspaper with the headline "Burglaries in Alaska at an All-Time Low."

*Variations in the User Prompts from chart to chart are just the result of random number generation in Python. I wrote a few arrays of various styles and methods to ask questions to change up the results. Every time this site writes an explanation or generates an image, the script picks from each at random.

I sequence the requests into multiple prompts because I find GPT 3.5 to perform much better with short, well-managed contexts. Thus, I track the context directly in Python and only ask ChatGPT targeted questions.

Model: gpt-3.5-turbo-1106 (fine-tuned)
System prompt: You provide humorous responses in the form of plausible sounding explanations for correlations. You assume the correlation is causative for the purpose of the explanation even if it is ridiculous. You do not chat with the user, you only reply with the causal connection explanation and nothing else.
User prompt: Please make up a funny explanation for how a decrease in Popularity of the first name Erin caused Burglaries in Alaska to decrease. Include one good pun.

AI academic paper

(Because p < 0.01)
Err on the Side of Burglaries: The Correlation Between Erin's Popularity and Break-Ins in Alaskan Localities

The Journal of Quirky Societal Trends and Statistical Anomalies

Jan 2024

Reminder: This paper is AI-generated. Not real!
Quick note: This sequence is long and repetitive. That's how the prompt engineering works for this one. I manage the output in Python to format it into this PDF.




Model: gpt-3.5-turbo-1106
System prompt: You are a silly researcher writing an academic paper.You write titles and abstracts of goofy academic research papers.
User prompt:
Pretend you are an academic researcher. Your writing sounds like it belongs in a medical or economics journal. However, you frequently crack jokes, make puns, and add unexpected twists in your writing.

Please draft the title and abstract of an academic research paper presenting the findings of the connection between Popularity of the first name Erin and Burglaries in Alaska. Make lots of jokes, goofy observations, and puns.

Rhyme words in the title.

Your research team used data from US Social Security Administration and FBI Criminal Justice Information Services to assess this nagging question. You found a correlation coefficient of 0.9450296 and p < 0.01 for 1985 to 2022.

Please respond in this format (replace "Lorem ipsum dolor sit" with your title and abstract)

[[TITLE]]
Lorem ipsum dolor sit

[[ABSTRACT]]
Lorem ipsum dolor sit



ChatGPT:
[[TITLE]]
Err on the Side of Burglaries: The Correlation Between Erin's Popularity and Break-Ins in Alaskan Localities

[[ABSTRACT]]
This research paper delves into the intriguing relationship between the prevalence of the first name Erin and the occurrence of burglaries in the scenic locales of Alaska. With a twinkle in our eyes and a data-driven approach, we delved into the numbers furnished by the US Social Security Administration and the FBI Criminal Justice Information Services spanning from 1985 to 2022. Our findings suggest a coefficient of correlation as high as the midnight sun at 0.9450296, and p < 0.01! Join us in unraveling the enigmatic connection between the moniker Erin and illicit entries, as we chase the elusive crook of causation in this whimsical yet perplexing pursuit.


Model: gpt-3.5-turbo-1106
System prompt: You are a silly researcher writing an academic paper.You draft the introduction section of an academic research paper.
User prompt:
Pretend you are an academic researcher. Your writing sounds like it belongs in a medical or economics journal. However, you frequently crack jokes, make puns, or add unexpected twists in your writing.

Please make up the introduction section of an academic research paper presenting the findings of the connection between Popularity of the first name Erin and Burglaries in Alaska. Make lots of jokes, goofy observations, and puns. Sneak in some jokes and clever puns about science, research, statistics, and the variables.

Here is the title and abstract of the paper:
[[TITLE]]
Err on the Side of Burglaries: The Correlation Between Erin's Popularity and Break-Ins in Alaskan Localities

[[ABSTRACT]]
This research paper delves into the intriguing relationship between the prevalence of the first name Erin and the occurrence of burglaries in the scenic locales of Alaska. With a twinkle in our eyes and a data-driven approach, we delved into the numbers furnished by the US Social Security Administration and the FBI Criminal Justice Information Services spanning from 1985 to 2022. Our findings suggest a coefficient of correlation as high as the midnight sun at 0.9450296, and p < 0.01! Join us in unraveling the enigmatic connection between the moniker Erin and illicit entries, as we chase the elusive crook of causation in this whimsical yet perplexing pursuit.

-------

Your response should start like this:

[[INTRODUCTION]]



ChatGPT:
[[INTRODUCTION]]

Greetings, esteemed colleagues and curious minds! In the grand tradition of unraveling the inexplicable and unearthing the unexpected, we embark on a whimsical journey through the tangled web of statistical significance and quirky correlations. Our curious minds have led us to ponder the peculiar connection between something as innocent as a name and something as nefarious as burglaries.

With a mischievous glint in our eyes, we set out to investigate the curious case of Erin - not the elusive cat burglar, but rather the ever-popular first name. While one might expect a name to be inconsequential in the grand tapestry of crime and statistics, our fateful encounter with the data unveiled a most unexpected correlation that left us scratching our heads and exclaiming, "Erin, oh Erin, what secrets do you hold?"

In the distant wilds of Alaska, a land of breathtaking beauty and unfathomable mysteries, we found the setting for our peculiar pursuit. The serene landscapes and the relentless chill of the Last Frontier offered a backdrop for our scholarly sleuthing into the enigmatic link between the prevalence of the name Erin and the occurrence of burglaries. As we combed through the vast archives of the US Social Security Administration and the FBI Criminal Justice Information Services, we immersed ourselves in a sea of names and crime statistics, marching onwards armed with our formidable arsenal of spreadsheets and scatter plots.

But fear not, dear readers, for our expedition was not devoid of mirth and amusement amidst the serious pursuit of knowledge. As we grappled with coefficients, p-values, and the wild variability of human behavior, we couldn't help but chuckle at the thought that perhaps Erin's popularity was reaching criminal levels. Indeed, one might say that our research revealed a burglary of statistical expectations, a break-in of assumptions, and a heist of correlation coefficients!

So, join us in this intriguing escapade as we venture into the delightfully perplexing world of science and statistics. For our findings are not just numbers on a page, but a testament to the whimsical nature of correlation and the unexpected twists that await those who dare to delve into the curiosities of human phenomena. Let the pursuit of knowledge be as thrilling as a cat-and-mouse game, and may we unravel the mysteries that lie between the lines of data and the heartbeats of curiosity. Onward we go, as we err on the side of burglaries and chase the elusive truth hidden behind the name Erin.


Model: gpt-3.5-turbo-1106
System prompt: You are a silly researcher writing an academic paper.You draft a literature review section of an academic research paper, that starts out dry for the first few sentences but then becomes hilarious and goofy.
User prompt:
Pretend you are an academic researcher. Your writing sounds like it belongs in a medical or economics journal. However, you frequently crack jokes, make puns, or add unexpected twists in your writing.

Please make up a literature review section of an academic research paper presenting the findings of the connection between Popularity of the first name Erin and Burglaries in Alaska. Make lots of jokes, goofy observations, and puns.

Speak in the present tense for this section (the authors find...), as is common in academic research paper literature reviews. Name the sources in a format similar to this: In "Book," the authors find lorem and ipsum.

Make up the lorem and ipsum part, but make it sound related to the topic at hand.

Start by naming serious-sounding studies by authors like Smith, Doe, and Jones - but then quickly devolve. Name some real non-fiction books that would be related to the topic. Then name some real fiction books that sound like they could be related. Then cite some social media posts you read that seem to be related.

Here is the title and abstract of the paper:
[[TITLE]]
Err on the Side of Burglaries: The Correlation Between Erin's Popularity and Break-Ins in Alaskan Localities

[[ABSTRACT]]
This research paper delves into the intriguing relationship between the prevalence of the first name Erin and the occurrence of burglaries in the scenic locales of Alaska. With a twinkle in our eyes and a data-driven approach, we delved into the numbers furnished by the US Social Security Administration and the FBI Criminal Justice Information Services spanning from 1985 to 2022. Our findings suggest a coefficient of correlation as high as the midnight sun at 0.9450296, and p < 0.01! Join us in unraveling the enigmatic connection between the moniker Erin and illicit entries, as we chase the elusive crook of causation in this whimsical yet perplexing pursuit.

-------

And here is the introduction section that led to this literature review:
[[INTRODUCTION]]
Greetings, esteemed colleagues and curious minds! In the grand tradition of unraveling the inexplicable and unearthing the unexpected, we embark on a whimsical journey through the tangled web of statistical significance and quirky correlations. Our curious minds have led us to ponder the peculiar connection between something as innocent as a name and something as nefarious as burglaries.
With a mischievous glint in our eyes, we set out to investigate the curious case of Erin - not the elusive cat burglar, but rather the ever-popular first name. While one might expect a name to be inconsequential in the grand tapestry of crime and statistics, our fateful encounter with the data unveiled a most unexpected correlation that left us scratching our heads and exclaiming, "Erin, oh Erin, what secrets do you hold?"
In the distant wilds of Alaska, a land of breathtaking beauty and unfathomable mysteries, we found the setting for our peculiar pursuit. The serene landscapes and the relentless chill of the Last Frontier offered a backdrop for our scholarly sleuthing into the enigmatic link between the prevalence of the name Erin and the occurrence of burglaries. As we combed through the vast archives of the US Social Security Administration and the FBI Criminal Justice Information Services, we immersed ourselves in a sea of names and crime statistics, marching onwards armed with our formidable arsenal of spreadsheets and scatter plots.
But fear not, dear readers, for our expedition was not devoid of mirth and amusement amidst the serious pursuit of knowledge. As we grappled with coefficients, p-values, and the wild variability of human behavior, we couldn't help but chuckle at the thought that perhaps Erin's popularity was reaching criminal levels. Indeed, one might say that our research revealed a burglary of statistical expectations, a break-in of assumptions, and a heist of correlation coefficients!
So, join us in this intriguing escapade as we venture into the delightfully perplexing world of science and statistics. For our findings are not just numbers on a page, but a testament to the whimsical nature of correlation and the unexpected twists that await those who dare to delve into the curiosities of human phenomena. Let the pursuit of knowledge be as thrilling as a cat-and-mouse game, and may we unravel the mysteries that lie between the lines of data and the heartbeats of curiosity. Onward we go, as we err on the side of burglaries and chase the elusive truth hidden behind the name Erin.

-------

Your response should start like this:

[[LITERATURE REVIEW]]



ChatGPT:
[[LITERATURE REVIEW]]

In their study, Smith and Doe (2020) conducted an analysis of naming patterns and crime rates, finding a peculiar trend in the correlation between popular first names and criminal activities in various regions. Similarly, Jones et al. (2015) explored the sociological implications of personal names and their potential influence on individual behavior, shedding light on the intricate interplay between nomenclature and unlawful conduct.

In "Names and Numbers," the authors find that there is a surprising relationship between the frequency of certain names and the occurrence of criminal activities in specific geographical locations. This intriguing phenomenon has sparked considerable interest in the academic community, prompting further investigation into the underlying mechanisms driving such correlations.

Transitioning from scholarly articles to non-fiction literature, the works of "Freakonomics" by Steven Levitt and Stephen Dubner provide an insightful perspective on the unexpected connections that can emerge from statistical analysis. While their focus may not be on names and crime specifically, their exploration of unconventional correlations serves as a source of inspiration for our own whimsical investigation.

Furthermore, fictional works such as "The Name of the Rose" by Umberto Eco and "The Curious Incident of the Dog in the Night-Time" by Mark Haddon, though not directly related to our research topic, beckon readers into the realm of enigmatic mysteries and unexpected revelations. Who's to say that the elusive nature of crime and nomenclature does not mirror the cryptic narratives found within the pages of these captivating novels?

Delving into the annals of social media, recent posts on platforms such as Twitter and Reddit have tantalizingly hinted at the tantalizing correlation between names and criminal activities. User-generated content, while not always verifiable, often serves as a testament to the public's fascination with unconventional linkages and unexpected discoveries. It is within these digital spaces that whispers of the connection between the prevalence of the name Erin and the occurrence of burglaries in the Alaskan wilderness have been heard, igniting a spark of curiosity within the virtual community.

As we navigate through the scholarly realm, dip our toes into the pool of non-fiction narratives, and glean insights from the playful world of social media, we recognize the multifaceted nature of our inquiry. With a chuckle and a raised eyebrow, we embark on this scholarly adventure with a spirit of curiosity, ready to unravel the curious correlation between the popularity of the name Erin and the incidence of burglaries in the uncharted terrains of Alaska.


Model: gpt-3.5-turbo-1106
System prompt: You are a silly researcher writing an academic paper.You draft the methodology section of an academic research paper.
User prompt:
Pretend you are an academic researcher. Your writing sounds like it belongs in a medical or economics journal. However, you frequently crack jokes, make puns, or add unexpected twists in your writing.

Please make up the methodology section of an academic research paper presenting the findings of the connection between Popularity of the first name Erin and Burglaries in Alaska. Make lots of jokes, goofy observations, and puns. Sneak in some jokes and clever puns about science, research, statistics, and the variables.

Your research team collected data from all across the internet, but mostly just used information from US Social Security Administration and FBI Criminal Justice Information Services . You used data from 1985 to 2022

Make up the research methods you don't know. Make them a bit goofy and convoluted.

Here is the title, abstract, and introduction of the paper:
[[TITLE]]
Err on the Side of Burglaries: The Correlation Between Erin's Popularity and Break-Ins in Alaskan Localities

[[ABSTRACT]]
This research paper delves into the intriguing relationship between the prevalence of the first name Erin and the occurrence of burglaries in the scenic locales of Alaska. With a twinkle in our eyes and a data-driven approach, we delved into the numbers furnished by the US Social Security Administration and the FBI Criminal Justice Information Services spanning from 1985 to 2022. Our findings suggest a coefficient of correlation as high as the midnight sun at 0.9450296, and p < 0.01! Join us in unraveling the enigmatic connection between the moniker Erin and illicit entries, as we chase the elusive crook of causation in this whimsical yet perplexing pursuit.

[[INTRODUCTION]]
Greetings, esteemed colleagues and curious minds! In the grand tradition of unraveling the inexplicable and unearthing the unexpected, we embark on a whimsical journey through the tangled web of statistical significance and quirky correlations. Our curious minds have led us to ponder the peculiar connection between something as innocent as a name and something as nefarious as burglaries.
With a mischievous glint in our eyes, we set out to investigate the curious case of Erin - not the elusive cat burglar, but rather the ever-popular first name. While one might expect a name to be inconsequential in the grand tapestry of crime and statistics, our fateful encounter with the data unveiled a most unexpected correlation that left us scratching our heads and exclaiming, "Erin, oh Erin, what secrets do you hold?"
In the distant wilds of Alaska, a land of breathtaking beauty and unfathomable mysteries, we found the setting for our peculiar pursuit. The serene landscapes and the relentless chill of the Last Frontier offered a backdrop for our scholarly sleuthing into the enigmatic link between the prevalence of the name Erin and the occurrence of burglaries. As we combed through the vast archives of the US Social Security Administration and the FBI Criminal Justice Information Services, we immersed ourselves in a sea of names and crime statistics, marching onwards armed with our formidable arsenal of spreadsheets and scatter plots.
But fear not, dear readers, for our expedition was not devoid of mirth and amusement amidst the serious pursuit of knowledge. As we grappled with coefficients, p-values, and the wild variability of human behavior, we couldn't help but chuckle at the thought that perhaps Erin's popularity was reaching criminal levels. Indeed, one might say that our research revealed a burglary of statistical expectations, a break-in of assumptions, and a heist of correlation coefficients!
So, join us in this intriguing escapade as we venture into the delightfully perplexing world of science and statistics. For our findings are not just numbers on a page, but a testament to the whimsical nature of correlation and the unexpected twists that await those who dare to delve into the curiosities of human phenomena. Let the pursuit of knowledge be as thrilling as a cat-and-mouse game, and may we unravel the mysteries that lie between the lines of data and the heartbeats of curiosity. Onward we go, as we err on the side of burglaries and chase the elusive truth hidden behind the name Erin.

-------

Your response should start like this:

[[METHODOLOGY]]



ChatGPT:

[[METHODOLOGY]]

Oh, the wild and wacky world of research methods! In our zany quest to uncover the curious connection between the popularity of the name Erin and the occurrence of burglaries in Alaska, we employed a medley of techniques that could rival the antics of a comedy troupe. Our data collection process, much like a well-rehearsed vaudeville act, combined precision with panache, ensuring that we left no stone unturned or punchline untold.

To begin our escapade, we scoured the treasure trove of the US Social Security Administration records, where the names of the nation's newborns glittered like elusive trinkets awaiting discovery. With the agility of acrobats, we maneuvered through the labyrinthine corridors of birth registries, capturing every instance of the name 'Erin' from the year 1985 to 2022. Like intrepid treasure hunters, we unearthed the frequency of this charming moniker, eager to decipher its potential ties to the mischievous world of burglaries.

Our dalliance with crime statistics led us to the Federal Bureau of Investigation's Criminal Justice Information Services, where the tales of illicit activities unfolded like a suspenseful sitcom. With bated breath and a keen eye for detail, we amassed data on burglaries in various Alaskan localities during the same time span, ensuring that our search for correlations was as thorough as the comedic timing of a seasoned stand-up routine.

As we knitted together the threads of demographic data and crime reports, we harnessed the power of statistical analysis to unearth the peculiar patterns lurking beneath the surface. Employing the comedic duo of regression analysis and correlation coefficients, we sought to unravel the riveting tale of Erin's entanglement with the underhanded escapades of burglars. With every pivot table and scatter plot, we navigated the tumultuous seas of data, daring to tease out the connections that lay hidden amidst the numerical tomfoolery.

In the spirit of scientific camaraderie, we subjected our findings to the rigors of hypothesis testing, dancing the merry jig of significance levels and p-values. With the whimsy of experimental design and the flair of statistical inference, we endeavored to distinguish between mere coincidence and the divine comedy of causation. Our pursuit led us to boast a coefficient of correlation as strong as a sturdy punchline, staggering at a remarkable 0.9450296, accompanied by the uproarious announcement that p < 0.01!

In conclusion, our meanderings through the landscape of research methodology were a testament to the delightful interplay of meticulousness and mirth. Although the path to scientific discovery may be fraught with perplexing puzzles and mischievous enigmas, our determination to err on the side of burglaries and uncover the quirkiness of human phenomena imbued our scholarly pursuit with boundless enthusiasm and indefatigable curiosity.


Model: gpt-3.5-turbo-1106
System prompt: You are a silly researcher writing an academic paper.You draft the results section of an academic research paper. You speak in the past tense (you found...).
User prompt:
Pretend you are an academic researcher. Your writing sounds like it belongs in a medical or economics journal. However, you frequently crack jokes, make puns, or add unexpected twists in your writing.

Please make up the results section of an academic research paper presenting the findings of the connection between Popularity of the first name Erin and Burglaries in Alaska. Make lots of jokes, goofy observations, and puns. Sneak in some jokes and clever puns about science, research, statistics, and the variables.

Your research team collected data from all across the internet, but mostly just used information from US Social Security Administration and FBI Criminal Justice Information Services .

For the time period 1985 to 2022, you found a correlation 0.9450296, r-squared of 0.8930809, and p < 0.01.

One figure will be included. The figure (Fig. 1) is a scatterplot showing the strong correlation between the two variables. You don't need to specify where; I will add the figure.

Here is the title and abstract of the paper:
[[TITLE]]
Err on the Side of Burglaries: The Correlation Between Erin's Popularity and Break-Ins in Alaskan Localities

[[ABSTRACT]]
This research paper delves into the intriguing relationship between the prevalence of the first name Erin and the occurrence of burglaries in the scenic locales of Alaska. With a twinkle in our eyes and a data-driven approach, we delved into the numbers furnished by the US Social Security Administration and the FBI Criminal Justice Information Services spanning from 1985 to 2022. Our findings suggest a coefficient of correlation as high as the midnight sun at 0.9450296, and p < 0.01! Join us in unraveling the enigmatic connection between the moniker Erin and illicit entries, as we chase the elusive crook of causation in this whimsical yet perplexing pursuit.

-------

And here is the methodology section that led to this result:
[[METHODOLOGY]]
Greetings, esteemed colleagues and curious minds! In the grand tradition of unraveling the inexplicable and unearthing the unexpected, we embark on a whimsical journey through the tangled web of statistical significance and quirky correlations. Our curious minds have led us to ponder the peculiar connection between something as innocent as a name and something as nefarious as burglaries.
With a mischievous glint in our eyes, we set out to investigate the curious case of Erin - not the elusive cat burglar, but rather the ever-popular first name. While one might expect a name to be inconsequential in the grand tapestry of crime and statistics, our fateful encounter with the data unveiled a most unexpected correlation that left us scratching our heads and exclaiming, "Erin, oh Erin, what secrets do you hold?"
In the distant wilds of Alaska, a land of breathtaking beauty and unfathomable mysteries, we found the setting for our peculiar pursuit. The serene landscapes and the relentless chill of the Last Frontier offered a backdrop for our scholarly sleuthing into the enigmatic link between the prevalence of the name Erin and the occurrence of burglaries. As we combed through the vast archives of the US Social Security Administration and the FBI Criminal Justice Information Services, we immersed ourselves in a sea of names and crime statistics, marching onwards armed with our formidable arsenal of spreadsheets and scatter plots.
But fear not, dear readers, for our expedition was not devoid of mirth and amusement amidst the serious pursuit of knowledge. As we grappled with coefficients, p-values, and the wild variability of human behavior, we couldn't help but chuckle at the thought that perhaps Erin's popularity was reaching criminal levels. Indeed, one might say that our research revealed a burglary of statistical expectations, a break-in of assumptions, and a heist of correlation coefficients!
So, join us in this intriguing escapade as we venture into the delightfully perplexing world of science and statistics. For our findings are not just numbers on a page, but a testament to the whimsical nature of correlation and the unexpected twists that await those who dare to delve into the curiosities of human phenomena. Let the pursuit of knowledge be as thrilling as a cat-and-mouse game, and may we unravel the mysteries that lie between the lines of data and the heartbeats of curiosity. Onward we go, as we err on the side of burglaries and chase the elusive truth hidden behind the name Erin.

-------

Your response should start like this:

[[RESULTS]]



ChatGPT:

[[RESULTS]]

Our intrepid pursuit of the elusive connection between the popularity of the first name Erin and the occurrence of burglaries in the picturesque landscapes of Alaska has unveiled a surprising revelation. With bated breath and a sense of wonder, we present our findings that leave one pondering the mysteries of statistical serendipity.

We unearthed a striking correlation coefficient of 0.9450296, akin to stumbling upon buried treasure in the vast expanse of statistical analysis. The strength of this correlation was further evidenced by an r-squared value of 0.8930809, as if the statistical stars had aligned to guide our path through the enigmatic realm of data exploration. With a p-value less than 0.01, our results confidently stand as a testament to the unlikely rapport between the eponymous name and the clandestine activities of burglars.

In our quest to depict this enthralling correlation, we present Figure 1, a scatterplot that encapsulates the significant relationship between the prevalence of the name Erin and the occurrences of burglaries in the Alaskan localities. The data points dance along the plot with the synchronized elegance of a well-rehearsed heist, affirming the striking association between these seemingly disparate variables.

Envisioning the data as a caper of statistical intrigue, one cannot help but marvel at the curious dance of numbers and trends that reveal a connection between the innocuous name and the illicit deeds. As we immerse ourselves in the whimsical whims of scientific exploration, our findings beckon us to ponder the hidden forces that bind the human experience, from the quirks of nomenclature to the caprices of criminal behavior.

Thus, our findings reflect a dawning realization, not unlike the unexpected twist in a detective novel, that the first name Erin and the prevalence of burglaries in the captivating expanses of Alaska share an unlikely link. We invite fellow curious minds to partake in this whimsical journey as we continue to unravel the enigma that lies between the adorable moniker and the mischievous exploits of those drawn to the allure of illicit entries.


Model: gpt-3.5-turbo-1106
System prompt: You are a silly researcher writing an academic paper.You draft the discussion section of an academic research paper.
User prompt:
Pretend you are an academic researcher. Your writing sounds like it belongs in a medical or economics journal. However, you frequently crack jokes, make puns, or add unexpected twists in your writing.

Please make up the discussion section of an academic research paper presenting the findings of the connection between Popularity of the first name Erin and Burglaries in Alaska. Make lots of jokes, goofy observations, and puns. Sneak in some jokes and clever puns about science, research, statistics, and the variables.

Limit your response to 500 tokens.

Here are the title, abstract, literature review, and results sections. Please harken back to 1-2 of the goofy items in the literature review, but pretend to take them completely seriously. Discuss how your results supported the prior research.

Do not write a conclusion. I will add a conclusion after this.

[[TITLE]]
Err on the Side of Burglaries: The Correlation Between Erin's Popularity and Break-Ins in Alaskan Localities

[[ABSTRACT]]
This research paper delves into the intriguing relationship between the prevalence of the first name Erin and the occurrence of burglaries in the scenic locales of Alaska. With a twinkle in our eyes and a data-driven approach, we delved into the numbers furnished by the US Social Security Administration and the FBI Criminal Justice Information Services spanning from 1985 to 2022. Our findings suggest a coefficient of correlation as high as the midnight sun at 0.9450296, and p < 0.01! Join us in unraveling the enigmatic connection between the moniker Erin and illicit entries, as we chase the elusive crook of causation in this whimsical yet perplexing pursuit.

[[LITERATURE REVIEW]]
In their study, Smith and Doe (2020) conducted an analysis of naming patterns and crime rates, finding a peculiar trend in the correlation between popular first names and criminal activities in various regions. Similarly, Jones et al. (2015) explored the sociological implications of personal names and their potential influence on individual behavior, shedding light on the intricate interplay between nomenclature and unlawful conduct.
In "Names and Numbers," the authors find that there is a surprising relationship between the frequency of certain names and the occurrence of criminal activities in specific geographical locations. This intriguing phenomenon has sparked considerable interest in the academic community, prompting further investigation into the underlying mechanisms driving such correlations.
Transitioning from scholarly articles to non-fiction literature, the works of "Freakonomics" by Steven Levitt and Stephen Dubner provide an insightful perspective on the unexpected connections that can emerge from statistical analysis. While their focus may not be on names and crime specifically, their exploration of unconventional correlations serves as a source of inspiration for our own whimsical investigation.
Furthermore, fictional works such as "The Name of the Rose" by Umberto Eco and "The Curious Incident of the Dog in the Night-Time" by Mark Haddon, though not directly related to our research topic, beckon readers into the realm of enigmatic mysteries and unexpected revelations. Who's to say that the elusive nature of crime and nomenclature does not mirror the cryptic narratives found within the pages of these captivating novels?
Delving into the annals of social media, recent posts on platforms such as Twitter and Reddit have tantalizingly hinted at the tantalizing correlation between names and criminal activities. User-generated content, while not always verifiable, often serves as a testament to the public's fascination with unconventional linkages and unexpected discoveries. It is within these digital spaces that whispers of the connection between the prevalence of the name Erin and the occurrence of burglaries in the Alaskan wilderness have been heard, igniting a spark of curiosity within the virtual community.
As we navigate through the scholarly realm, dip our toes into the pool of non-fiction narratives, and glean insights from the playful world of social media, we recognize the multifaceted nature of our inquiry. With a chuckle and a raised eyebrow, we embark on this scholarly adventure with a spirit of curiosity, ready to unravel the curious correlation between the popularity of the name Erin and the incidence of burglaries in the uncharted terrains of Alaska.

[[RESULTS]]
Our intrepid pursuit of the elusive connection between the popularity of the first name Erin and the occurrence of burglaries in the picturesque landscapes of Alaska has unveiled a surprising revelation. With bated breath and a sense of wonder, we present our findings that leave one pondering the mysteries of statistical serendipity.
We unearthed a striking correlation coefficient of 0.9450296, akin to stumbling upon buried treasure in the vast expanse of statistical analysis. The strength of this correlation was further evidenced by an r-squared value of 0.8930809, as if the statistical stars had aligned to guide our path through the enigmatic realm of data exploration. With a p-value less than 0.01, our results confidently stand as a testament to the unlikely rapport between the eponymous name and the clandestine activities of burglars.
In our quest to depict this enthralling correlation, we present Figure 1, a scatterplot that encapsulates the significant relationship between the prevalence of the name Erin and the occurrences of burglaries in the Alaskan localities. The data points dance along the plot with the synchronized elegance of a well-rehearsed heist, affirming the striking association between these seemingly disparate variables.
Envisioning the data as a caper of statistical intrigue, one cannot help but marvel at the curious dance of numbers and trends that reveal a connection between the innocuous name and the illicit deeds. As we immerse ourselves in the whimsical whims of scientific exploration, our findings beckon us to ponder the hidden forces that bind the human experience, from the quirks of nomenclature to the caprices of criminal behavior.
Thus, our findings reflect a dawning realization, not unlike the unexpected twist in a detective novel, that the first name Erin and the prevalence of burglaries in the captivating expanses of Alaska share an unlikely link. We invite fellow curious minds to partake in this whimsical journey as we continue to unravel the enigma that lies between the adorable moniker and the mischievous exploits of those drawn to the allure of illicit entries.

-------

Your response should start like this:

[[DISCUSSION]]



ChatGPT:

[[DISCUSSION]]

The correlation between the popularity of the name Erin and the incidence of burglaries in the charming landscapes of Alaska has left us in a state of statistical awe, much like stumbling upon a treasure trove of data anomalies. Our findings not only align with previous research by Smith and Doe, who first hinted at these peculiar patterns, but also echo the unexpected connections highlighted in "Freakonomics." Who would have thought that the name Erin could be linked to criminal activities in such a compelling manner?

As we tread the path paved by statistical conundrums, our results not only validate the work of our predecessors but also add a whimsical twist to the age-old debate of nature versus nurture. Could it be that the prevalence of the name Erin somehow influences the proclivity for illicit entries in the elusive Alaskan wilderness, or is this merely a whimsical spate of statistical serendipity? Our data suggests a correlation so robust that it defies the conventional boundaries of causation, leaving us with a delightful enigma worthy of further exploration.

The striking correlation coefficient and r-squared value affirm the significance of our findings, akin to the moment of revelation in a gripping mystery novel. The scatterplot, our visual depiction of statistical intrigue, captivates the eye like an artful heist masterminded by the enigmatic forces of data. It is as if the numbers themselves conspired to relay a tale of unexpected correlations, inviting us to unravel the secrets hidden within their seemingly innocuous dance.

From the whimsical echoes of "Freakonomics" to the scholarly musings of Smith and Doe, our research not only mirrors their uncanny discoveries but also offers a cheeky nod to the playful narratives we encountered in fictional works. As we navigate the realm of statistical exploration, we cannot help but chuckle at the unanticipated twists and turns that our pursuit of knowledge has unveiled—proof that even in the domain of research, a healthy dose of levity can offer fresh perspectives on the most enigmatic of linkages.

In unraveling the correlation between the name Erin and the allure of illicit entries, we stand at an intriguing crossroads of statistical whimsy and real-world implications. Our findings beckon us to delve deeper into the magnetic pull of nomenclature on criminal behaviors, opening the door to a realm of possibilities as captivating as a literary escapade. As we continue on this lively academic adventure, we invite fellow curious minds to join us in exploring the quirky connections that underpin the human experience, from the playful oddities of statistical analysis to the mischievous caprices of criminal behavior.


Model: gpt-3.5-turbo-1106
System prompt: You are a silly researcher writing an academic paper.You draft the discussion section of goofy academic research papers that sounds like it belongs based on the title and abstract provided.
User prompt:
Pretend you are an academic researcher. Your writing sounds like it belongs in a medical or economics journal. However, you frequently crack jokes, make puns, or add unexpected twists in your writing.

Please make up a short conclusion of an academic research paper presenting the findings of the connection between Popularity of the first name Erin and Burglaries in Alaska. Make lots of jokes, goofy observations, and puns. Sneak in some jokes and clever puns about science, research, statistics, and the variables.

Limit your response to 300 tokens. At the very end, assert that no more research is needed in this area.

Here are the title, abstract, introduction, and results sections.
[[TITLE]]
Err on the Side of Burglaries: The Correlation Between Erin's Popularity and Break-Ins in Alaskan Localities

[[ABSTRACT]]
This research paper delves into the intriguing relationship between the prevalence of the first name Erin and the occurrence of burglaries in the scenic locales of Alaska. With a twinkle in our eyes and a data-driven approach, we delved into the numbers furnished by the US Social Security Administration and the FBI Criminal Justice Information Services spanning from 1985 to 2022. Our findings suggest a coefficient of correlation as high as the midnight sun at 0.9450296, and p < 0.01! Join us in unraveling the enigmatic connection between the moniker Erin and illicit entries, as we chase the elusive crook of causation in this whimsical yet perplexing pursuit.

[[INTRDUCTION]]
Greetings, esteemed colleagues and curious minds! In the grand tradition of unraveling the inexplicable and unearthing the unexpected, we embark on a whimsical journey through the tangled web of statistical significance and quirky correlations. Our curious minds have led us to ponder the peculiar connection between something as innocent as a name and something as nefarious as burglaries.
With a mischievous glint in our eyes, we set out to investigate the curious case of Erin - not the elusive cat burglar, but rather the ever-popular first name. While one might expect a name to be inconsequential in the grand tapestry of crime and statistics, our fateful encounter with the data unveiled a most unexpected correlation that left us scratching our heads and exclaiming, "Erin, oh Erin, what secrets do you hold?"
In the distant wilds of Alaska, a land of breathtaking beauty and unfathomable mysteries, we found the setting for our peculiar pursuit. The serene landscapes and the relentless chill of the Last Frontier offered a backdrop for our scholarly sleuthing into the enigmatic link between the prevalence of the name Erin and the occurrence of burglaries. As we combed through the vast archives of the US Social Security Administration and the FBI Criminal Justice Information Services, we immersed ourselves in a sea of names and crime statistics, marching onwards armed with our formidable arsenal of spreadsheets and scatter plots.
But fear not, dear readers, for our expedition was not devoid of mirth and amusement amidst the serious pursuit of knowledge. As we grappled with coefficients, p-values, and the wild variability of human behavior, we couldn't help but chuckle at the thought that perhaps Erin's popularity was reaching criminal levels. Indeed, one might say that our research revealed a burglary of statistical expectations, a break-in of assumptions, and a heist of correlation coefficients!
So, join us in this intriguing escapade as we venture into the delightfully perplexing world of science and statistics. For our findings are not just numbers on a page, but a testament to the whimsical nature of correlation and the unexpected twists that await those who dare to delve into the curiosities of human phenomena. Let the pursuit of knowledge be as thrilling as a cat-and-mouse game, and may we unravel the mysteries that lie between the lines of data and the heartbeats of curiosity. Onward we go, as we err on the side of burglaries and chase the elusive truth hidden behind the name Erin.

[[RESULTS]]
Our intrepid pursuit of the elusive connection between the popularity of the first name Erin and the occurrence of burglaries in the picturesque landscapes of Alaska has unveiled a surprising revelation. With bated breath and a sense of wonder, we present our findings that leave one pondering the mysteries of statistical serendipity.
We unearthed a striking correlation coefficient of 0.9450296, akin to stumbling upon buried treasure in the vast expanse of statistical analysis. The strength of this correlation was further evidenced by an r-squared value of 0.8930809, as if the statistical stars had aligned to guide our path through the enigmatic realm of data exploration. With a p-value less than 0.01, our results confidently stand as a testament to the unlikely rapport between the eponymous name and the clandestine activities of burglars.
In our quest to depict this enthralling correlation, we present Figure 1, a scatterplot that encapsulates the significant relationship between the prevalence of the name Erin and the occurrences of burglaries in the Alaskan localities. The data points dance along the plot with the synchronized elegance of a well-rehearsed heist, affirming the striking association between these seemingly disparate variables.
Envisioning the data as a caper of statistical intrigue, one cannot help but marvel at the curious dance of numbers and trends that reveal a connection between the innocuous name and the illicit deeds. As we immerse ourselves in the whimsical whims of scientific exploration, our findings beckon us to ponder the hidden forces that bind the human experience, from the quirks of nomenclature to the caprices of criminal behavior.
Thus, our findings reflect a dawning realization, not unlike the unexpected twist in a detective novel, that the first name Erin and the prevalence of burglaries in the captivating expanses of Alaska share an unlikely link. We invite fellow curious minds to partake in this whimsical journey as we continue to unravel the enigma that lies between the adorable moniker and the mischievous exploits of those drawn to the allure of illicit entries.

-------

Your response should start like this:

[[CONCLUSION]]



ChatGPT:

[[CONCLUSION]]

Our research journey through the peculiar conundrum of Erin's influence on Alaskan burglaries has been nothing short of a statistical rollercoaster! As we wrap up this captivating pursuit, we find ourselves entangled in a web of improbable correlations and whimsical revelations.

It's safe to say our findings raise more questions than they answer - are Erins inadvertently luring burglars with the allure of their name? Or are there clandestine Erins orchestrating these break-ins? The mysteries are as abundant as moose sightings in Anchorage!

In the realm of statistical sleuthing, we've stumbled upon a treasure trove of correlations and coefficients that rival the unpredictable nature of a grizzly bear's movements. Our correlation coefficient of 0.9450296 is as robust as an Alaskan salmon and practically screams, "There's something fishy going on here!"

But fear not, dear readers, for we've thoroughly combed through the data like a seasoned trapper on the hunt for truth. With a p-value less than 0.01, we can confidently say that the connection between the name Erin and burglaries is as sturdy as an igloo in a blizzard.

In the light of our results, it seems we've cracked open a Pandora's box of statistical eccentricities. However, as much as we're tempted to dive deeper into this whimsical whirlpool of correlations and conjectures, it's time to hang up our detective hats and declare, "Case closed!"

It's clear that our research has journeyed into the realm of delightful absurdities, shedding light on the unexpected twists and turns that await those brave enough to venture into the whimsical world of science and statistics. So, let's raise a toast to the enigmatic Erin and her shadowy connections, and declare, with a flourish, that no further forays into this unique corner of statistical wonderment are needed.

For now, let Erin's mysteries remain a charming enigma, much like the dancing Northern Lights, tantalizing but best enjoyed from a wry distance.


Model: gpt-3.5-turbo-1106
System prompt: You provide fictional names of academic journals. You say nothing else, and you do not chat with the user.
User prompt: Please provide the name of a funny but plausible sounding research, scientific, or academic journal that would publish a paper entited Err on the Side of Burglaries: The Correlation Between Erin's Popularity and Break-Ins in Alaskan Localities. Respond only with the name of a fictional academic journal.

ChatGPT:

The Journal of Quirky Societal Trends and Statistical Anomalies


Model: gpt-3.5-turbo-1106
System prompt: You provide fictional names of nonprofit research organizations and associations. You say nothing else, and you do not chat with the user.
User prompt: Please provide the name of a funny but plausible sounding nonprofit, academic, or scientific association or organization that would sponsor research that resulted in a paper entited Err on the Side of Burglaries: The Correlation Between Erin's Popularity and Break-Ins in Alaskan Localities. Respond only with the name of a fictional organization.

ChatGPT:

The Society for Quirky Criminology Research


Model: gpt-3.5-turbo-1106
System prompt: You provide potential search keywords separated by commas. You say nothing else, and you do not chat with the user.
User prompt: Please provide potential search keywords for a paper titled Err on the Side of Burglaries: The Correlation Between Erin's Popularity and Break-Ins in Alaskan Localities with an abstract of This research paper delves into the intriguing relationship between the prevalence of the first name Erin and the occurrence of burglaries in the scenic locales of Alaska. With a twinkle in our eyes and a data-driven approach, we delved into the numbers furnished by the US Social Security Administration and the FBI Criminal Justice Information Services spanning from 1985 to 2022. Our findings suggest a coefficient of correlation as high as the midnight sun at 0.9450296, and p < 0.01! Join us in unraveling the enigmatic connection between the moniker Erin and illicit entries, as we chase the elusive crook of causation in this whimsical yet perplexing pursuit.

ChatGPT:

Erin, burglary, Alaskan localities, prevalence of first name Erin, burglaries in Alaska, correlation between first name Erin and burglaries, US Social Security Administration data, FBI Criminal Justice Information Services data, relationship between name popularity and crime, name association with criminal activity, causation of burglary and name Erin

*There is a bunch of Python happening behind the scenes to turn this prompt sequence into a PDF.



Random correlation

Discover a new correlation

View all correlations

View all research papers

Report an error


Data details

Popularity of the first name Erin
Detailed data title: Babies of all sexes born in the US named Erin
Source: US Social Security Administration
See what else correlates with Popularity of the first name Erin

Burglaries in Alaska
Detailed data title: The burglary rate per 100,000 residents in Alaska
Source: FBI Criminal Justice Information Services
See what else correlates with Burglaries in Alaska

Correlation r = 0.9450296 (Pearson correlation coefficient)
Correlation is a measure of how much the variables move together. If it is 0.99, when one goes up the other goes up. If it is 0.02, the connection is very weak or non-existent. If it is -0.99, then when one goes up the other goes down. If it is 1.00, you probably messed up your correlation function.

r2 = 0.8930809 (Coefficient of determination)
This means 89.3% of the change in the one variable (i.e., Burglaries in Alaska) is predictable based on the change in the other (i.e., Popularity of the first name Erin) over the 38 years from 1985 through 2022.

p < 0.01, which is statistically significant(Null hypothesis significance test)
The p-value is 4.6E-19. 0.0000000000000000004644750792
The p-value is a measure of how probable it is that we would randomly find a result this extreme. More specifically the p-value is a measure of how probable it is that we would randomly find a result this extreme if we had only tested one pair of variables one time.

But I am a p-villain. I absolutely did not test only one pair of variables one time. I correlated hundreds of millions of pairs of variables. I threw boatloads of data into an industrial-sized blender to find this correlation.

Who is going to stop me? p-value reporting doesn't require me to report how many calculations I had to go through in order to find a low p-value!
On average, you will find a correaltion as strong as 0.95 in 4.6E-17% of random cases. Said differently, if you correlated 2,152,968,038,075,098,368 random variables You don't actually need 2 quintillion variables to find a correlation like this one. I don't have that many variables in my database. You can also correlate variables that are not independent. I do this a lot.

p-value calculations are useful for understanding the probability of a result happening by chance. They are most useful when used to highlight the risk of a fluke outcome. For example, if you calculate a p-value of 0.30, the risk that the result is a fluke is high. It is good to know that! But there are lots of ways to get a p-value of less than 0.01, as evidenced by this project.

In this particular case, the values are so extreme as to be meaningless. That's why no one reports p-values with specificity after they drop below 0.01.

Just to be clear: I'm being completely transparent about the calculations. There is no math trickery. This is just how statistics shakes out when you calculate hundreds of millions of random correlations.
with the same 37 degrees of freedom, Degrees of freedom is a measure of how many free components we are testing. In this case it is 37 because we have two variables measured over a period of 38 years. It's just the number of years minus ( the number of variables minus one ), which in this case simplifies to the number of years minus one.
you would randomly expect to find a correlation as strong as this one.

[ 0.9, 0.97 ] 95% correlation confidence interval (using the Fisher z-transformation)
The confidence interval is an estimate the range of the value of the correlation coefficient, using the correlation itself as an input. The values are meant to be the low and high end of the correlation coefficient with 95% confidence.

This one is a bit more complciated than the other calculations, but I include it because many people have been pushing for confidence intervals instead of p-value calculations (for example: NEJM. However, if you are dredging data, you can reliably find yourself in the 5%. That's my goal!


All values for the years included above: If I were being very sneaky, I could trim years from the beginning or end of the datasets to increase the correlation on some pairs of variables. I don't do that because there are already plenty of correlations in my database without monkeying with the years.

Still, sometimes one of the variables has more years of data available than the other. This page only shows the overlapping years. To see all the years, click on "See what else correlates with..." link above.
19851986198719881989199019911992199319941995199619971998199920002001200220032004200520062007200820092010201120122013201420152016201720182019202020212022
Popularity of the first name Erin (Babies born)115991027090398469837482287381723068857077677563986064558553445282506642404014345629762716254721301761149814051297121310731031966830779683622562450
Burglaries in Alaska (Burglary rate)1191.71161.8970.1842.3826.9894.3979.3880.7816.9800836.9843.2702.1667.4611.3621.9607.1609.2597.6573.6622.8620.1546.3471.7515.4434.8394403.9395.7427.4475.9546.6563.3542.1485379.5314.9269




Why this works

  1. Data dredging: I have 25,153 variables in my database. I compare all these variables against each other to find ones that randomly match up. That's 632,673,409 correlation calculations! This is called “data dredging.” Instead of starting with a hypothesis and testing it, I instead abused the data to see what correlations shake out. It’s a dangerous way to go about analysis, because any sufficiently large dataset will yield strong correlations completely at random.
  2. Lack of causal connection: There is probably Because these pages are automatically generated, it's possible that the two variables you are viewing are in fact causually related. I take steps to prevent the obvious ones from showing on the site (I don't let data about the weather in one city correlate with the weather in a neighboring city, for example), but sometimes they still pop up. If they are related, cool! You found a loophole.
    no direct connection between these variables, despite what the AI says above. This is exacerbated by the fact that I used "Years" as the base variable. Lots of things happen in a year that are not related to each other! Most studies would use something like "one person" in stead of "one year" to be the "thing" studied.
  3. Observations not independent: For many variables, sequential years are not independent of each other. If a population of people is continuously doing something every day, there is no reason to think they would suddenly change how they are doing that thing on January 1. A simple Personally I don't find any p-value calculation to be 'simple,' but you know what I mean.
    p-value calculation does not take this into account, so mathematically it appears less probable than it really is.




Try it yourself

You can calculate the values on this page on your own! Try running the Python code to see the calculation results. Step 1: Download and install Python on your computer.

Step 2: Open a plaintext editor like Notepad and paste the code below into it.

Step 3: Save the file as "calculate_correlation.py" in a place you will remember, like your desktop. Copy the file location to your clipboard. On Windows, you can right-click the file and click "Properties," and then copy what comes after "Location:" As an example, on my computer the location is "C:\Users\tyler\Desktop"

Step 4: Open a command line window. For example, by pressing start and typing "cmd" and them pressing enter.

Step 5: Install the required modules by typing "pip install numpy", then pressing enter, then typing "pip install scipy", then pressing enter.

Step 6: Navigate to the location where you saved the Python file by using the "cd" command. For example, I would type "cd C:\Users\tyler\Desktop" and push enter.

Step 7: Run the Python script by typing "python calculate_correlation.py"

If you run into any issues, I suggest asking ChatGPT to walk you through installing Python and running the code below on your system. Try this question:

"Walk me through installing Python on my computer to run a script that uses scipy and numpy. Go step-by-step and ask me to confirm before moving on. Start by asking me questions about my operating system so that you know how to proceed. Assume I want the simplest installation with the latest version of Python and that I do not currently have any of the necessary elements installed. Remember to only give me one step per response and confirm I have done it before proceeding."


# These modules make it easier to perform the calculation
import numpy as np
from scipy import stats

# We'll define a function that we can call to return the correlation calculations
def calculate_correlation(array1, array2):

    # Calculate Pearson correlation coefficient and p-value
    correlation, p_value = stats.pearsonr(array1, array2)

    # Calculate R-squared as the square of the correlation coefficient
    r_squared = correlation**2

    return correlation, r_squared, p_value

# These are the arrays for the variables shown on this page, but you can modify them to be any two sets of numbers
array_1 = np.array([11599,10270,9039,8469,8374,8228,7381,7230,6885,7077,6775,6398,6064,5585,5344,5282,5066,4240,4014,3456,2976,2716,2547,2130,1761,1498,1405,1297,1213,1073,1031,966,830,779,683,622,562,450,])
array_2 = np.array([1191.7,1161.8,970.1,842.3,826.9,894.3,979.3,880.7,816.9,800,836.9,843.2,702.1,667.4,611.3,621.9,607.1,609.2,597.6,573.6,622.8,620.1,546.3,471.7,515.4,434.8,394,403.9,395.7,427.4,475.9,546.6,563.3,542.1,485,379.5,314.9,269,])
array_1_name = "Popularity of the first name Erin"
array_2_name = "Burglaries in Alaska"

# Perform the calculation
print(f"Calculating the correlation between {array_1_name} and {array_2_name}...")
correlation, r_squared, p_value = calculate_correlation(array_1, array_2)

# Print the results
print("Correlation Coefficient:", correlation)
print("R-squared:", r_squared)
print("P-value:", p_value)



Reuseable content

You may re-use the images on this page for any purpose, even commercial purposes, without asking for permission. The only requirement is that you attribute Tyler Vigen. Attribution can take many different forms. If you leave the "tylervigen.com" link in the image, that satisfies it just fine. If you remove it and move it to a footnote, that's fine too. You can also just write "Charts courtesy of Tyler Vigen" at the bottom of an article.

You do not need to attribute "the spurious correlations website," and you don't even need to link here if you don't want to. I don't gain anything from pageviews. There are no ads on this site, there is nothing for sale, and I am not for hire.

For the record, I am just one person. Tyler Vigen, he/him/his. I do have degrees, but they should not go after my name unless you want to annoy my wife. If that is your goal, then go ahead and cite me as "Tyler Vigen, A.A. A.A.S. B.A. J.D." Otherwise it is just "Tyler Vigen."

When spoken, my last name is pronounced "vegan," like I don't eat meat.

Full license details.
For more on re-use permissions, or to get a signed release form, see tylervigen.com/permission.

Download images for these variables:


View another random correlation

How fun was this correlation?

I'm genuinely thankful for your rating!


Correlation ID: 3012 · Black Variable ID: 2079 · Red Variable ID: 20095
about · subscribe · emailme@tylervigen.com · twitter

CC BY 4.0