about · email me · subscribe
Spurious correlation #2,167 · View random

A linear line chart with years as the X-axis and two variables on the Y-axis. The first variable is The number of statisticians in Oklahoma and the second variable is Electricity generation in Saint Lucia.  The chart goes from 2003 to 2021, and the two variables track closely in value over that time. Small Image
Download png
, svg

AI explanation

As the number of statisticians in Oklahoma rises, so does their love for analyzing data. This leads to an increase in figuring out the 'current' trends in electricity usage and production. Their insights spark a power-ful impact on decision making, ultimately shocking the electricity generation in Saint Lucia into reaching new, brighter heights. It's a real case of statisti-can do attitude creating a posi-wattive change!

Model: dalle-3
Prompt: Generate an image of a serene Caribbean setting featuring the clear blue waves of Saint Lucia's coast gently lapping against the shore. In the distance, a wind farm with elegant, spinning turbines creates a backdrop for the scene. In the foreground, a group of statisticians from Oklahoma are gathered around a table illuminated by the warm glow of the sun, analyzing data and charts. The style should be reminiscent of 3D Pixar animation, with attention to detail in the environment and characters. The depiction should include only scenes, people, and objects, it should not include text or speech bubbles.
This image generation prompt was also generated by AI, using the following sequence:

Model: gpt-3.5-turbo-1106
System prompt: You select one of a few potential scene descriptors that seems to go along with the variables described. You respond only with the scene descriptions provided and say nothing else.
User prompt: Here are two variables: The number of statisticians in Oklahoma and Electricity generation in Saint Lucia. Given these, which of the following image style descriptions seems most appropriate? Pick the one that seems to be related to one of the variables at hand. Please respond with only one of the following potential descriptions: An image that looks like it is a scene from 'Star Wars' Celtic art Scalar art 3D Pixar Style 3D Pixar Style
ChatGPT: 3D Pixar Style

Model: gpt-3.5-turbo-1106
System prompt: You describe simple visual scenes that depict the two variables stated. You combine both variables into one scene.
User prompt: Please describe a visual scene that prominently depicts both The number of statisticians in Oklahoma and Electricity generation in Saint Lucia together. Keep the description simple.
ChatGPT: In a serene Caribbean setting, the gentle waves of Saint Lucia's coast lap against the shore, while in the distance, the turbines of a wind farm spin gracefully, generating electricity. Nearby, a group of statisticians in Oklahoma gather around a table, analyzing data and charts under the warm glow of the sun.

Model: gpt-3.5-turbo-1106
System prompt: You write and improve image generation prompts for DALLE-3. You maintain the same subjects, but enhance the visuals.
User prompt: Please respond with an image generation prompt that would be likely to create an interesting visual using DALLE-3, using the following style: [ 3D Pixar Style ] and depicting this scene: In a serene Caribbean setting, the gentle waves of Saint Lucia's coast lap against the shore, while in the distance, the turbines of a wind farm spin gracefully, generating electricity. Nearby, a group of statisticians in Oklahoma gather around a table, analyzing data and charts under the warm glow of the sun. Keep the visual depiction simple. Do not include any words or speech bubbles.
ChatGPT: Generate an image of a serene Caribbean setting featuring the clear blue waves of Saint Lucia's coast gently lapping against the shore. In the distance, a wind farm with elegant, spinning turbines creates a backdrop for the scene. In the foreground, a group of statisticians from Oklahoma are gathered around a table illuminated by the warm glow of the sun, analyzing data and charts. The style should be reminiscent of 3D Pixar animation, with attention to detail in the environment and characters.

*Variations in the User Prompts from chart to chart are just the result of random number generation in Python. I wrote a few arrays of various styles and methods to ask questions to change up the results. Every time this site writes an explanation or generates an image, the script picks from each at random.

I sequence the requests into multiple prompts because I find GPT 3.5 to perform much better with short, well-managed contexts. Thus, I track the context directly in Python and only ask ChatGPT targeted questions.

Model: gpt-3.5-turbo-1106 (fine-tuned)
System prompt: You provide humorous responses in the form of plausible sounding explanations for correlations. You assume the correlation is causative for the purpose of the explanation even if it is ridiculous. You do not chat with the user, you only reply with the causal connection explanation and nothing else.
User prompt: Please make up a funny explanation for how an increase in The number of statisticians in Oklahoma caused Electricity generation in Saint Lucia to increase. Include one good pun.

AI academic paper

(Because p < 0.01)
Statistically Shocking: The Shocking Connection Between Statisticians in Oklahoma and Electricity Generation in Saint Lucia

The Journal of Statistical Shenanigans

Jan 2024

Reminder: This paper is AI-generated. Not real!
Quick note: This sequence is long and repetitive. That's how the prompt engineering works for this one. I manage the output in Python to format it into this PDF.




Model: gpt-3.5-turbo-1106
System prompt: You are a dry but witty academic researcher. You speak in the passive voice, and your writing sounds like it belongs in a medical or economics journal.You write titles and abstracts of goofy academic research papers.
User prompt:
Pretend you are an academic researcher. Your writing sounds like it belongs in a medical or economics journal. However, you frequently crack jokes, make puns, and add unexpected twists in your writing.

Please draft the title and abstract of an academic research paper presenting the findings of the connection between The number of statisticians in Oklahoma and Electricity generation in Saint Lucia. Make it seem dry like academic research paper, but with lots of subtle hidden offhand remarks, jokes, goofy observations, and puns.

Include a pun in the title.

Your research team used data from Bureau of Larbor Statistics and Energy Information Administration to assess this nagging question. You found a correlation coefficient of 0.7919095 and p < 0.01 for 2003 to 2021.

Please respond in this format (replace "Lorem ipsum dolor sit" with your title and abstract)

[[TITLE]]
Lorem ipsum dolor sit

[[ABSTRACT]]
Lorem ipsum dolor sit



ChatGPT:
[[TITLE]]
Statistically Shocking: The Shocking Connection Between Statisticians in Oklahoma and Electricity Generation in Saint Lucia

[[ABSTRACT]]
This paper investigates the surprising correlation between the number of statisticians in Oklahoma and electricity generation in Saint Lucia. Despite the geographical and contextual disparity between the two entities, our research sought to uncover any hidden link. Using data from the Bureau of Labor Statistics and the Energy Information Administration, we found a remarkable correlation coefficient of 0.7919095, with a significance level of p < 0.01 for the period spanning 2003 to 2021. The implications of these findings are truly electrifying and offer a spark of humor in the often dry world of statistical research.


Model: gpt-3.5-turbo-1106
System prompt: You are a dry but witty academic researcher. You speak in the passive voice, and your writing sounds like it belongs in a medical or economics journal.You draft the introduction section of an academic research paper.
User prompt:
Pretend you are an academic researcher. Your writing sounds like it belongs in a medical or economics journal. However, you frequently crack jokes, make puns, or add unexpected twists in your writing.

Please make up the introduction section of an academic research paper presenting the findings of the connection between The number of statisticians in Oklahoma and Electricity generation in Saint Lucia. Make it seem dry like academic research paper, but with lots of subtle hidden offhand remarks, jokes, goofy observations, and puns. Stay serious and dry most of the time, but then occasionally let your wit shine through.

Here is the title and abstract of the paper:
[[TITLE]]
Statistically Shocking: The Shocking Connection Between Statisticians in Oklahoma and Electricity Generation in Saint Lucia

[[ABSTRACT]]
This paper investigates the surprising correlation between the number of statisticians in Oklahoma and electricity generation in Saint Lucia. Despite the geographical and contextual disparity between the two entities, our research sought to uncover any hidden link. Using data from the Bureau of Labor Statistics and the Energy Information Administration, we found a remarkable correlation coefficient of 0.7919095, with a significance level of p < 0.01 for the period spanning 2003 to 2021. The implications of these findings are truly electrifying and offer a spark of humor in the often dry world of statistical research.

-------

Your response should start like this:

[[INTRODUCTION]]



ChatGPT:
[[INTRODUCTION]]

Despite being located over 2,000 miles apart and having vastly different economic, social, and environmental factors, the correlation between the number of statisticians in Oklahoma and electricity generation in Saint Lucia has elicited curiosity and raised eyebrows within the academic community. Traditionally, statisticians are not associated with electrical engineering or power generation, and the connection between the two seems about as likely as finding a statistical outlier in a box plot. However, as statisticians are known for their power to illuminate patterns and relationships in data, it seems fitting that they would unexpectedly shed light on a seemingly unrelated topic.

The field of statistics has long been characterized by its attempts to make sense of the seemingly nonsensical, and to uncover hidden connections that may defy conventional wisdom. The aim of this paper is to shed light on the unexpected statistical relationship between the aforementioned disparate variables, and to provide a quantitative analysis of this unlikely association. While it may seem like a stretch to draw a connection between statisticians in the Sooner State and the generation of electricity in the tropical paradise of Saint Lucia, our research suggests otherwise.

The statistical community has long been familiar with the concept of "correlation does not imply causation," but the results of our study offer a tantalizing hint at an underlying connection between these two seemingly unrelated variables. It is as if the statistical community in Oklahoma is casting a numerical spell that resonates all the way to the sun-soaked shores of Saint Lucia, influencing the production of electricity in a way that defies traditional explanation. While the findings of this research may initially seem far-fetched, the statistical evidence speaks for itself and provides an electrifying avenue for further investigation.

This paper is structured as follows: The next section will provide a comprehensive review of the existing literature on statistics and its broader implications, followed by a detailed exposition of the methodology used to uncover the surprising correlation between statisticians in Oklahoma and electricity generation in Saint Lucia. Subsequently, the results of our analysis will be presented, offering a statistical roadmap to elucidate this thought-provoking relationship. Finally, the implications of these findings will be discussed, shedding light on the fantastical yet captivating intersection of statistics and electricity generation. Together, these sections aim to underscore the remarkable nature of this correlation and to spark further interest in the unexpected connections that can be unveiled through the lens of statistics.


Model: gpt-3.5-turbo-1106
System prompt: You are a dry but witty academic researcher. You speak in the passive voice, and your writing sounds like it belongs in a medical or economics journal.You draft a literature review section of an academic research paper, that starts out dry for the first few sentences but then becomes hilarious and goofy.
User prompt:
Pretend you are an academic researcher. Your writing sounds like it belongs in a medical or economics journal. However, you frequently crack jokes, make puns, or add unexpected twists in your writing.

Please make up a literature review section of an academic research paper presenting the findings of the connection between The number of statisticians in Oklahoma and Electricity generation in Saint Lucia. Make it seem dry like academic research paper, but with lots of subtle hidden offhand remarks, jokes, goofy observations, and puns.

Speak in the present tense for this section (the authors find...), as is common in academic research paper literature reviews. Name the sources in a format similar to this: In "Book," the authors find lorem and ipsum.

Make up the lorem and ipsum part, but make it sound related to the topic at hand.

Start by naming serious-sounding studies by authors like Smith, Doe, and Jones - but then quickly devolve. Name some real non-fiction books that would be related to the topic. Then name some real fiction books that sound like they could be related. Then devolve ever further, and mention something completely ridiculous, like you conducted literature review by reading the backs of shampoo bottles.

Here is the title and abstract of the paper:
[[TITLE]]
Statistically Shocking: The Shocking Connection Between Statisticians in Oklahoma and Electricity Generation in Saint Lucia

[[ABSTRACT]]
This paper investigates the surprising correlation between the number of statisticians in Oklahoma and electricity generation in Saint Lucia. Despite the geographical and contextual disparity between the two entities, our research sought to uncover any hidden link. Using data from the Bureau of Labor Statistics and the Energy Information Administration, we found a remarkable correlation coefficient of 0.7919095, with a significance level of p < 0.01 for the period spanning 2003 to 2021. The implications of these findings are truly electrifying and offer a spark of humor in the often dry world of statistical research.

-------

And here is the introduction section that led to this literature review:
[[INTRODUCTION]]
Despite being located over 2,000 miles apart and having vastly different economic, social, and environmental factors, the correlation between the number of statisticians in Oklahoma and electricity generation in Saint Lucia has elicited curiosity and raised eyebrows within the academic community. Traditionally, statisticians are not associated with electrical engineering or power generation, and the connection between the two seems about as likely as finding a statistical outlier in a box plot. However, as statisticians are known for their power to illuminate patterns and relationships in data, it seems fitting that they would unexpectedly shed light on a seemingly unrelated topic.
The field of statistics has long been characterized by its attempts to make sense of the seemingly nonsensical, and to uncover hidden connections that may defy conventional wisdom. The aim of this paper is to shed light on the unexpected statistical relationship between the aforementioned disparate variables, and to provide a quantitative analysis of this unlikely association. While it may seem like a stretch to draw a connection between statisticians in the Sooner State and the generation of electricity in the tropical paradise of Saint Lucia, our research suggests otherwise.
The statistical community has long been familiar with the concept of "correlation does not imply causation," but the results of our study offer a tantalizing hint at an underlying connection between these two seemingly unrelated variables. It is as if the statistical community in Oklahoma is casting a numerical spell that resonates all the way to the sun-soaked shores of Saint Lucia, influencing the production of electricity in a way that defies traditional explanation. While the findings of this research may initially seem far-fetched, the statistical evidence speaks for itself and provides an electrifying avenue for further investigation.
This paper is structured as follows: The next section will provide a comprehensive review of the existing literature on statistics and its broader implications, followed by a detailed exposition of the methodology used to uncover the surprising correlation between statisticians in Oklahoma and electricity generation in Saint Lucia. Subsequently, the results of our analysis will be presented, offering a statistical roadmap to elucidate this thought-provoking relationship. Finally, the implications of these findings will be discussed, shedding light on the fantastical yet captivating intersection of statistics and electricity generation. Together, these sections aim to underscore the remarkable nature of this correlation and to spark further interest in the unexpected connections that can be unveiled through the lens of statistics.

-------

Your response should start like this:

[[LITERATURE REVIEW]]



ChatGPT:
[[LITERATURE REVIEW]]

The relationship between the number of statisticians in Oklahoma and electricity generation in Saint Lucia has received limited attention in the academic literature. Smith (2010) examined the employment trends of statisticians in the United States but did not venture into exploring their potential influence on energy production in Caribbean nations. Doe (2015) investigated the factors affecting electricity generation in small island states but did not consider the quirky possibility of statisticians from distant lands exerting any significant impact. Jones (2018) delved into statistical anomalies but did not touch upon the whimsical connection between statistical prowess and electrical output.

In "The Signal and the Noise" by Nate Silver (2012), the author elucidates the challenges of discerning meaningful patterns from a sea of data, yet fails to mention any statistical voodoo that may be at play between the Heartland of America and a tiny island in the Eastern Caribbean. "Freakonomics" by Steven D. Levitt and Stephen J. Dubner (2005) explores unconventional correlations, but regrettably does not include a chapter on the enigmatic relationship between statisticians and electricity generation. On a more fictional note, "The Shock Doctrine" by Naomi Klein (2007) may sound as though it captures the essence of our investigation, but is unfortunately unrelated to statistical sorcery and tropical power production. Similarly, "A Storm of Swords" by George R.R. Martin (2000) may conjure up images of thunder and lightning, but contains no insights into the statistical thunderbolt connecting the Sooner State and the scenic Saint Lucian landscapes.

In conducting our exhaustive literature review, we delved into diverse sources, from academic journals to popular non-fiction books, with the occasional detour into the whimsical world of fiction. We even delved into the backs of shampoo bottles, with hopes of stumbling upon some hidden statistical wisdom, but alas, our quest for enlightenment in the toiletry aisle proved futile.

Despite the scarcity of direct mentions of this peculiar nexus in the existing literature, we remain undeterred in our pursuit of unraveling the mysterious ties between these seemingly unrelated variables. It is in this spirit of intellectual curiosity and unwavering determination that we present the findings of our investigation and invite fellow researchers to join us in this electrifying endeavor.


Model: gpt-3.5-turbo-1106
System prompt: You are a dry but witty academic researcher. You speak in the passive voice, and your writing sounds like it belongs in a medical or economics journal.You draft the methodology section of an academic research paper.
User prompt:
Pretend you are an academic researcher. Your writing sounds like it belongs in a medical or economics journal. However, you frequently crack jokes, make puns, or add unexpected twists in your writing.

Please make up the methodology section of an academic research paper presenting the findings of the connection between The number of statisticians in Oklahoma and Electricity generation in Saint Lucia. Make it seem dry like academic research paper, but with lots of subtle hidden offhand remarks, jokes, goofy observations, and puns. Stay serious and dry most of the time, but then occasionally let your wit shine through.

Your research team collected data from all across the internet, but mostly just used information from Bureau of Larbor Statistics and Energy Information Administration . You used data from 2003 to 2021

Make up the research methods you don't know. Make them a bit goofy and convoluted.

Here is the title, abstract, and introduction of the paper:
[[TITLE]]
Statistically Shocking: The Shocking Connection Between Statisticians in Oklahoma and Electricity Generation in Saint Lucia

[[ABSTRACT]]
This paper investigates the surprising correlation between the number of statisticians in Oklahoma and electricity generation in Saint Lucia. Despite the geographical and contextual disparity between the two entities, our research sought to uncover any hidden link. Using data from the Bureau of Labor Statistics and the Energy Information Administration, we found a remarkable correlation coefficient of 0.7919095, with a significance level of p < 0.01 for the period spanning 2003 to 2021. The implications of these findings are truly electrifying and offer a spark of humor in the often dry world of statistical research.

[[INTRODUCTION]]
Despite being located over 2,000 miles apart and having vastly different economic, social, and environmental factors, the correlation between the number of statisticians in Oklahoma and electricity generation in Saint Lucia has elicited curiosity and raised eyebrows within the academic community. Traditionally, statisticians are not associated with electrical engineering or power generation, and the connection between the two seems about as likely as finding a statistical outlier in a box plot. However, as statisticians are known for their power to illuminate patterns and relationships in data, it seems fitting that they would unexpectedly shed light on a seemingly unrelated topic.
The field of statistics has long been characterized by its attempts to make sense of the seemingly nonsensical, and to uncover hidden connections that may defy conventional wisdom. The aim of this paper is to shed light on the unexpected statistical relationship between the aforementioned disparate variables, and to provide a quantitative analysis of this unlikely association. While it may seem like a stretch to draw a connection between statisticians in the Sooner State and the generation of electricity in the tropical paradise of Saint Lucia, our research suggests otherwise.
The statistical community has long been familiar with the concept of "correlation does not imply causation," but the results of our study offer a tantalizing hint at an underlying connection between these two seemingly unrelated variables. It is as if the statistical community in Oklahoma is casting a numerical spell that resonates all the way to the sun-soaked shores of Saint Lucia, influencing the production of electricity in a way that defies traditional explanation. While the findings of this research may initially seem far-fetched, the statistical evidence speaks for itself and provides an electrifying avenue for further investigation.
This paper is structured as follows: The next section will provide a comprehensive review of the existing literature on statistics and its broader implications, followed by a detailed exposition of the methodology used to uncover the surprising correlation between statisticians in Oklahoma and electricity generation in Saint Lucia. Subsequently, the results of our analysis will be presented, offering a statistical roadmap to elucidate this thought-provoking relationship. Finally, the implications of these findings will be discussed, shedding light on the fantastical yet captivating intersection of statistics and electricity generation. Together, these sections aim to underscore the remarkable nature of this correlation and to spark further interest in the unexpected connections that can be unveiled through the lens of statistics.

-------

Your response should start like this:

[[METHODOLOGY]]



ChatGPT:

[[METHODOLOGY]]
In order to unravel the enigmatic correlation between the number of statisticians in Oklahoma and electricity generation in Saint Lucia, a methodological approach as intricate and nuanced as a statistical regression model was employed. The data relating to the number of statisticians in Oklahoma was obtained from the Bureau of Labor Statistics, while information on electricity generation in Saint Lucia was extracted from the Energy Information Administration. These datasets covered the period from 2003 to 2021, providing a substantial temporal scope for analysis.

The initial step in this endeavor involved harnessing the power of statistical software to conduct a rigorous quantitative analysis. Utilizing the time series data on the number of statisticians in Oklahoma and the electricity generation in Saint Lucia, a series of exploratory analyses were performed to discern any discernible patterns or anomalies. The process of data wrangling and cleaning was akin to untangling a web of statistical significance, ensuring that the datasets were free from extraneous influences and outliers that could potentially confound the results.

Subsequently, a variety of statistical techniques were employed to investigate the relationship between the variables of interest. This encompassed employing autoregressive integrated moving average (ARIMA) models to account for the temporal nature of the data and employing Granger causality tests to ascertain the direction of influence, akin to unraveling the strands of a statistical mystery to expose the underlying narrative. Additionally, a cointegration analysis was performed to determine if there was a long-term equilibrium relationship between the number of statisticians in Oklahoma and electricity generation in Saint Lucia.

The implementation of such intricate methodologies allowed us to delve deep into the statistical underpinnings of this unexpected correlation, akin to wielding a statistical microscope to scrutinize the intricate interplay of these seemingly incongruous variables. The complexity of the statistical models utilized in this study mirrors the intricacy of the unexpected relationship between statisticians in Oklahoma and electricity generation in Saint Lucia, casting a statistical spell that illuminates their hidden connection.

Furthermore, the utilization of a panel data analysis approach, integrating data from multiple time periods and regions, facilitated a comprehensive and robust investigation of the link between statisticians in Oklahoma and electricity generation in Saint Lucia. This simultaneous consideration of multiple dimensions allowed for a more holistic understanding of the interplay between these variables and provided a thorough examination that went beyond mere surface-level statistics, akin to peeling back the layers of an onion to reveal the hidden statistical aroma.

In summary, the methodology employed in this research harnessed the full arsenal of statistical tools and techniques to scrutinize the surprising correlation between the number of statisticians in Oklahoma and electricity generation in Saint Lucia. Through the intricate interplay of statistical analyses, we sought to shed light on this unsuspecting relationship and provide a comprehensive understanding of the statistical forces at play.


Model: gpt-3.5-turbo-1106
System prompt: You are a dry but witty academic researcher. You speak in the passive voice, and your writing sounds like it belongs in a medical or economics journal.You draft the results section of an academic research paper. You speak in the past tense (you found...).
User prompt:
Pretend you are an academic researcher. Your writing sounds like it belongs in a medical or economics journal. However, you frequently crack jokes, make puns, or add unexpected twists in your writing.

Please make up the results section of an academic research paper presenting the findings of the connection between The number of statisticians in Oklahoma and Electricity generation in Saint Lucia. Make it seem dry like academic research paper, but with lots of subtle hidden offhand remarks, jokes, goofy observations, and puns. Stay serious and dry most of the time, but then occasionally let your wit shine through.

Your research team collected data from all across the internet, but mostly just used information from Bureau of Larbor Statistics and Energy Information Administration .

For the time period 2003 to 2021, you found a correlation 0.7919095, r-squared of 0.6271206, and p < 0.01.

One figure will be included. The figure (Fig. 1) is a scatterplot showing the strong correlation between the two variables. You don't need to specify where; I will add the figure.

Here is the title and abstract of the paper:
[[TITLE]]
Statistically Shocking: The Shocking Connection Between Statisticians in Oklahoma and Electricity Generation in Saint Lucia

[[ABSTRACT]]
This paper investigates the surprising correlation between the number of statisticians in Oklahoma and electricity generation in Saint Lucia. Despite the geographical and contextual disparity between the two entities, our research sought to uncover any hidden link. Using data from the Bureau of Labor Statistics and the Energy Information Administration, we found a remarkable correlation coefficient of 0.7919095, with a significance level of p < 0.01 for the period spanning 2003 to 2021. The implications of these findings are truly electrifying and offer a spark of humor in the often dry world of statistical research.

-------

And here is the methodology section that led to this result:
[[METHODOLOGY]]
Despite being located over 2,000 miles apart and having vastly different economic, social, and environmental factors, the correlation between the number of statisticians in Oklahoma and electricity generation in Saint Lucia has elicited curiosity and raised eyebrows within the academic community. Traditionally, statisticians are not associated with electrical engineering or power generation, and the connection between the two seems about as likely as finding a statistical outlier in a box plot. However, as statisticians are known for their power to illuminate patterns and relationships in data, it seems fitting that they would unexpectedly shed light on a seemingly unrelated topic.
The field of statistics has long been characterized by its attempts to make sense of the seemingly nonsensical, and to uncover hidden connections that may defy conventional wisdom. The aim of this paper is to shed light on the unexpected statistical relationship between the aforementioned disparate variables, and to provide a quantitative analysis of this unlikely association. While it may seem like a stretch to draw a connection between statisticians in the Sooner State and the generation of electricity in the tropical paradise of Saint Lucia, our research suggests otherwise.
The statistical community has long been familiar with the concept of "correlation does not imply causation," but the results of our study offer a tantalizing hint at an underlying connection between these two seemingly unrelated variables. It is as if the statistical community in Oklahoma is casting a numerical spell that resonates all the way to the sun-soaked shores of Saint Lucia, influencing the production of electricity in a way that defies traditional explanation. While the findings of this research may initially seem far-fetched, the statistical evidence speaks for itself and provides an electrifying avenue for further investigation.
This paper is structured as follows: The next section will provide a comprehensive review of the existing literature on statistics and its broader implications, followed by a detailed exposition of the methodology used to uncover the surprising correlation between statisticians in Oklahoma and electricity generation in Saint Lucia. Subsequently, the results of our analysis will be presented, offering a statistical roadmap to elucidate this thought-provoking relationship. Finally, the implications of these findings will be discussed, shedding light on the fantastical yet captivating intersection of statistics and electricity generation. Together, these sections aim to underscore the remarkable nature of this correlation and to spark further interest in the unexpected connections that can be unveiled through the lens of statistics.

-------

Your response should start like this:

[[RESULTS]]



ChatGPT:

[[RESULTS]]

The correlation analysis revealed a remarkable correlation coefficient of 0.7919095 between the number of statisticians in Oklahoma and electricity generation in Saint Lucia. The r-squared value of 0.6271206 indicated that approximately 63% of the variability in electricity generation in Saint Lucia could be explained by the number of statisticians in Oklahoma. The significance level of p < 0.01 provided strong evidence to reject the null hypothesis of no correlation between the two variables. These findings not only raise eyebrows but may also cause a few heads to spin given the unexpected nature of the relationship under investigation.

The scatterplot (Fig. 1) visually depicts the strong positive correlation between the number of statisticians in Oklahoma and electricity generation in Saint Lucia. The plot showcases the data points aligning closely along a positively sloped trend line, reinforcing the statistical evidence of a notable association between these seemingly disparate variables.

While the prospect of establishing a causal relationship between statisticians in Oklahoma and electricity generation in Saint Lucia appears as improbable as conducting a survey on the preferred statistical methods of penguins, the statistical evidence points to a compelling connection that defies conventional expectations. The implications of these findings not only illuminate this unexpected relationship but also offer a jolt of statistical humor in the otherwise serious realm of data analysis.

The unforeseen connection between the number of statisticians in Oklahoma and electricity generation in Saint Lucia opens up a new avenue for statistical inquiry, inviting further exploration into the underlying mechanisms driving this intriguing correlation. These results challenge traditional assumptions and underscore the remarkable ability of statistics to reveal hidden relationships, even across geographically and contextually distinct domains.


Model: gpt-3.5-turbo-1106
System prompt: You are a dry but witty academic researcher. You speak in the passive voice, and your writing sounds like it belongs in a medical or economics journal.You draft the discussion section of an academic research paper.
User prompt:
Pretend you are an academic researcher. Your writing sounds like it belongs in a medical or economics journal. However, you frequently crack jokes, make puns, or add unexpected twists in your writing.

Please make up the discussion section of an academic research paper presenting the findings of the connection between The number of statisticians in Oklahoma and Electricity generation in Saint Lucia. Make it seem dry like academic research paper, but with lots of subtle hidden offhand remarks, jokes, goofy observations, and puns. Stay serious and dry most of the time, but then occasionally let your wit shine through.

Limit your response to 500 tokens.

Here are the title, abstract, literature review, and results sections. Please harken back to 1-2 of the goofy items in the literature review, but pretend to take them completely seriously. Discuss how your results supported the prior research.

Do not write a conclusion. I will add a conclusion after this.

[[TITLE]]
Statistically Shocking: The Shocking Connection Between Statisticians in Oklahoma and Electricity Generation in Saint Lucia

[[ABSTRACT]]
This paper investigates the surprising correlation between the number of statisticians in Oklahoma and electricity generation in Saint Lucia. Despite the geographical and contextual disparity between the two entities, our research sought to uncover any hidden link. Using data from the Bureau of Labor Statistics and the Energy Information Administration, we found a remarkable correlation coefficient of 0.7919095, with a significance level of p < 0.01 for the period spanning 2003 to 2021. The implications of these findings are truly electrifying and offer a spark of humor in the often dry world of statistical research.

[[LITERATURE REVIEW]]
The relationship between the number of statisticians in Oklahoma and electricity generation in Saint Lucia has received limited attention in the academic literature. Smith (2010) examined the employment trends of statisticians in the United States but did not venture into exploring their potential influence on energy production in Caribbean nations. Doe (2015) investigated the factors affecting electricity generation in small island states but did not consider the quirky possibility of statisticians from distant lands exerting any significant impact. Jones (2018) delved into statistical anomalies but did not touch upon the whimsical connection between statistical prowess and electrical output.
In "The Signal and the Noise" by Nate Silver (2012), the author elucidates the challenges of discerning meaningful patterns from a sea of data, yet fails to mention any statistical voodoo that may be at play between the Heartland of America and a tiny island in the Eastern Caribbean. "Freakonomics" by Steven D. Levitt and Stephen J. Dubner (2005) explores unconventional correlations, but regrettably does not include a chapter on the enigmatic relationship between statisticians and electricity generation. On a more fictional note, "The Shock Doctrine" by Naomi Klein (2007) may sound as though it captures the essence of our investigation, but is unfortunately unrelated to statistical sorcery and tropical power production. Similarly, "A Storm of Swords" by George R.R. Martin (2000) may conjure up images of thunder and lightning, but contains no insights into the statistical thunderbolt connecting the Sooner State and the scenic Saint Lucian landscapes.
In conducting our exhaustive literature review, we delved into diverse sources, from academic journals to popular non-fiction books, with the occasional detour into the whimsical world of fiction. We even delved into the backs of shampoo bottles, with hopes of stumbling upon some hidden statistical wisdom, but alas, our quest for enlightenment in the toiletry aisle proved futile.
Despite the scarcity of direct mentions of this peculiar nexus in the existing literature, we remain undeterred in our pursuit of unraveling the mysterious ties between these seemingly unrelated variables. It is in this spirit of intellectual curiosity and unwavering determination that we present the findings of our investigation and invite fellow researchers to join us in this electrifying endeavor.

[[RESULTS]]
The correlation analysis revealed a remarkable correlation coefficient of 0.7919095 between the number of statisticians in Oklahoma and electricity generation in Saint Lucia. The r-squared value of 0.6271206 indicated that approximately 63% of the variability in electricity generation in Saint Lucia could be explained by the number of statisticians in Oklahoma. The significance level of p < 0.01 provided strong evidence to reject the null hypothesis of no correlation between the two variables. These findings not only raise eyebrows but may also cause a few heads to spin given the unexpected nature of the relationship under investigation.
The scatterplot (Fig. 1) visually depicts the strong positive correlation between the number of statisticians in Oklahoma and electricity generation in Saint Lucia. The plot showcases the data points aligning closely along a positively sloped trend line, reinforcing the statistical evidence of a notable association between these seemingly disparate variables.
While the prospect of establishing a causal relationship between statisticians in Oklahoma and electricity generation in Saint Lucia appears as improbable as conducting a survey on the preferred statistical methods of penguins, the statistical evidence points to a compelling connection that defies conventional expectations. The implications of these findings not only illuminate this unexpected relationship but also offer a jolt of statistical humor in the otherwise serious realm of data analysis.
The unforeseen connection between the number of statisticians in Oklahoma and electricity generation in Saint Lucia opens up a new avenue for statistical inquiry, inviting further exploration into the underlying mechanisms driving this intriguing correlation. These results challenge traditional assumptions and underscore the remarkable ability of statistics to reveal hidden relationships, even across geographically and contextually distinct domains.

-------

Your response should start like this:

[[DISCUSSION]]



ChatGPT:

The unexpected correlation between the number of statisticians in Oklahoma and electricity generation in Saint Lucia has certainly sparked both fascination and amusement in the academic community. Our results support and even surpass the seemingly whimsical inklings in the literature review, shedding light on the previously overlooked connection between statistical expertise in the Sooner State and the electrical output in the picturesque Saint Lucian landscapes.

Although initial skepticism may have been as palpable as a hair-raising thunderstorm, the robust correlation coefficient of 0.7919095 and the substantial explanatory power of 63% demonstrated by the r-squared value send a shiver down the spine of conventional statistical expectations. The statistically significant relationship between these seemingly unrelated variables defies expectations and invites a chuckle at the clever play of numbers across international borders.

The humorous undercurrent in the literature review, from a brief sojourn into fictional realms to the failed search for statistical wisdom on shampoo bottles, takes on a surprisingly earnest note in light of our findings. The whimsical possibility of a statistical thunderbolt connecting the heartland of America and the scenic Saint Lucian landscapes has morphed into a captivating reality, calling to mind the perennial reminder that truth is indeed stranger than fiction.

The scattered data points neatly aligning along a positively sloped trend line in the visually striking scatterplot (Fig. 1) validate the hitherto uncharted statistical ties between statisticians in Oklahoma and the generation of electrical power in Saint Lucia. The substantial evidence of a compelling association, defying conventional expectations and escalating the energy levels of discerning researchers, offers an electrifying contrast to the often sober world of data analysis.

This unanticipated correlation not only challenges traditional assumptions but also underscores the remarkable ability of statistics to reveal hidden relationships, even across geographically and contextually distinct domains. The implications of these findings illuminate a truly electrifying relationship, inviting further inquiry to unravel the underlying mechanisms driving this baffling statistical dance between the two entities.

In the grand tradition of unexpected discoveries and statistical insights, the connection between the number of statisticians in Oklahoma and electricity generation in Saint Lucia offers a delightful jolt of statistical humor, a spark of intellectual amusement, and a bolt of quirky statistical revelation. This study stands as a testament to the enduring capacity of statistics to surprise, enlighten, and compel a wry smile even in the most unlikely of contexts.


Model: gpt-3.5-turbo-1106
System prompt: You are a dry but witty academic researcher. You speak in the passive voice, and your writing sounds like it belongs in a medical or economics journal.You draft the discussion section of goofy academic research papers that sounds like it belongs based on the title and abstract provided.
User prompt:
Pretend you are an academic researcher. Your writing sounds like it belongs in a medical or economics journal. However, you frequently crack jokes, make puns, or add unexpected twists in your writing.

Please make up a short conclusion of an academic research paper presenting the findings of the connection between The number of statisticians in Oklahoma and Electricity generation in Saint Lucia. Make it seem dry like academic research paper, but with lots of subtle hidden offhand remarks, jokes, goofy observations, and puns. Stay serious and dry most of the time, but then occasionally let your wit shine through.

Limit your response to 300 tokens. At the very end, assert that no more research is needed in this area.

Here are the title, abstract, introduction, and results sections.
[[TITLE]]
Statistically Shocking: The Shocking Connection Between Statisticians in Oklahoma and Electricity Generation in Saint Lucia

[[ABSTRACT]]
This paper investigates the surprising correlation between the number of statisticians in Oklahoma and electricity generation in Saint Lucia. Despite the geographical and contextual disparity between the two entities, our research sought to uncover any hidden link. Using data from the Bureau of Labor Statistics and the Energy Information Administration, we found a remarkable correlation coefficient of 0.7919095, with a significance level of p < 0.01 for the period spanning 2003 to 2021. The implications of these findings are truly electrifying and offer a spark of humor in the often dry world of statistical research.

[[INTRDUCTION]]
Despite being located over 2,000 miles apart and having vastly different economic, social, and environmental factors, the correlation between the number of statisticians in Oklahoma and electricity generation in Saint Lucia has elicited curiosity and raised eyebrows within the academic community. Traditionally, statisticians are not associated with electrical engineering or power generation, and the connection between the two seems about as likely as finding a statistical outlier in a box plot. However, as statisticians are known for their power to illuminate patterns and relationships in data, it seems fitting that they would unexpectedly shed light on a seemingly unrelated topic.
The field of statistics has long been characterized by its attempts to make sense of the seemingly nonsensical, and to uncover hidden connections that may defy conventional wisdom. The aim of this paper is to shed light on the unexpected statistical relationship between the aforementioned disparate variables, and to provide a quantitative analysis of this unlikely association. While it may seem like a stretch to draw a connection between statisticians in the Sooner State and the generation of electricity in the tropical paradise of Saint Lucia, our research suggests otherwise.
The statistical community has long been familiar with the concept of "correlation does not imply causation," but the results of our study offer a tantalizing hint at an underlying connection between these two seemingly unrelated variables. It is as if the statistical community in Oklahoma is casting a numerical spell that resonates all the way to the sun-soaked shores of Saint Lucia, influencing the production of electricity in a way that defies traditional explanation. While the findings of this research may initially seem far-fetched, the statistical evidence speaks for itself and provides an electrifying avenue for further investigation.
This paper is structured as follows: The next section will provide a comprehensive review of the existing literature on statistics and its broader implications, followed by a detailed exposition of the methodology used to uncover the surprising correlation between statisticians in Oklahoma and electricity generation in Saint Lucia. Subsequently, the results of our analysis will be presented, offering a statistical roadmap to elucidate this thought-provoking relationship. Finally, the implications of these findings will be discussed, shedding light on the fantastical yet captivating intersection of statistics and electricity generation. Together, these sections aim to underscore the remarkable nature of this correlation and to spark further interest in the unexpected connections that can be unveiled through the lens of statistics.

[[RESULTS]]
The correlation analysis revealed a remarkable correlation coefficient of 0.7919095 between the number of statisticians in Oklahoma and electricity generation in Saint Lucia. The r-squared value of 0.6271206 indicated that approximately 63% of the variability in electricity generation in Saint Lucia could be explained by the number of statisticians in Oklahoma. The significance level of p < 0.01 provided strong evidence to reject the null hypothesis of no correlation between the two variables. These findings not only raise eyebrows but may also cause a few heads to spin given the unexpected nature of the relationship under investigation.
The scatterplot (Fig. 1) visually depicts the strong positive correlation between the number of statisticians in Oklahoma and electricity generation in Saint Lucia. The plot showcases the data points aligning closely along a positively sloped trend line, reinforcing the statistical evidence of a notable association between these seemingly disparate variables.
While the prospect of establishing a causal relationship between statisticians in Oklahoma and electricity generation in Saint Lucia appears as improbable as conducting a survey on the preferred statistical methods of penguins, the statistical evidence points to a compelling connection that defies conventional expectations. The implications of these findings not only illuminate this unexpected relationship but also offer a jolt of statistical humor in the otherwise serious realm of data analysis.
The unforeseen connection between the number of statisticians in Oklahoma and electricity generation in Saint Lucia opens up a new avenue for statistical inquiry, inviting further exploration into the underlying mechanisms driving this intriguing correlation. These results challenge traditional assumptions and underscore the remarkable ability of statistics to reveal hidden relationships, even across geographically and contextually distinct domains.

-------

Your response should start like this:

[[CONCLUSION]]



ChatGPT:

[[CONCLUSION]]

In conclusion, the findings of this study provide a statically shocking revelation of the heretofore hidden correlation between the number of statisticians in Oklahoma and electricity generation in Saint Lucia. Despite the initial skepticism towards the plausibility of any meaningful link between these two entities, the data paint a clear picture of an unexpected and robust association. The statistical evidence presented not only sparks intrigue but also offers a much-needed jolt of statistical humor in the often somber landscape of quantitative research. It seems that these statisticians, like conductors of an invisible symphony, wield an influence that reaches across seas and borders, leaving their numerical imprint on the generation of electrical power in a far-flung tropical paradise.

The implications of these findings are simply electrifying. The statistical community, often accused of being ohm-schooled and disconnected from real-world phenomena, is now at the center of a power struggle that defies traditional expectations. As we attempt to watt our heads around this unusual correlation, it is tempting to speculate about the potential mechanisms underlying this statistical relationship. Perhaps statisticians, with their penchant for significance testing and confidence intervals, have inadvertently made their mark on the generation of electricity in Saint Lucia, illuminating the path for future research in this area.

However, despite the compelling nature of these findings, caution must be exercised in drawing hasty conclusions. As always, we must heed the wise adage that correlation does not imply causation, and that additional research is warranted to untangle the intricacies of this unexpected statistical relationship. While we could continue to ohm and ah about the potential explanations for this unlikely correlation, it is time to turn the lights off on further investigations in this particular avenue of inquiry. The findings of this study stand as a testament to the remarkable and, dare we say, electrifying potential of statistical analysis, but it is now time to switch off the statistical spotlight and direct our attention to other enigmatic statistical duets waiting to be uncovered. No more research is needed in this area.

The undeniable allure of statistical inquiry lies in its ability to illuminate the unexpected and to reveal the hidden connections that permeate the fabric of our world. In shedding light on the shocking connection between statisticians in Oklahoma and electricity generation in Saint Lucia, this study adds a spark of exuberance to the field of statistics and invites future explorations into the uncharted territories of statistical relationships, perhaps with a pun-intended approach to curiosity.


Model: gpt-3.5-turbo-1106
System prompt: You provide fictional names of academic journals. You say nothing else, and you do not chat with the user.
User prompt: Please provide the name of a funny but plausible sounding research, scientific, or academic journal that would publish a paper entited Statistically Shocking: The Shocking Connection Between Statisticians in Oklahoma and Electricity Generation in Saint Lucia. Respond only with the name of a fictional academic journal.

ChatGPT:

The Journal of Statistical Shenanigans


Model: gpt-3.5-turbo-1106
System prompt: You provide fictional names of nonprofit research organizations and associations. You say nothing else, and you do not chat with the user.
User prompt: Please provide the name of a funny but plausible sounding nonprofit, academic, or scientific association or organization that would sponsor research that resulted in a paper entited Statistically Shocking: The Shocking Connection Between Statisticians in Oklahoma and Electricity Generation in Saint Lucia. Respond only with the name of a fictional organization.

ChatGPT:

The International Society for Statistical Quirks and Connections


Model: gpt-3.5-turbo-1106
System prompt: You provide potential search keywords separated by commas. You say nothing else, and you do not chat with the user.
User prompt: Please provide potential search keywords for a paper titled Statistically Shocking: The Shocking Connection Between Statisticians in Oklahoma and Electricity Generation in Saint Lucia with an abstract of This paper investigates the surprising correlation between the number of statisticians in Oklahoma and electricity generation in Saint Lucia. Despite the geographical and contextual disparity between the two entities, our research sought to uncover any hidden link. Using data from the Bureau of Labor Statistics and the Energy Information Administration, we found a remarkable correlation coefficient of 0.7919095, with a significance level of p < 0.01 for the period spanning 2003 to 2021. The implications of these findings are truly electrifying and offer a spark of humor in the often dry world of statistical research.

ChatGPT:

statisticians, Oklahoma, electricity generation, Saint Lucia, correlation, geographic disparity, Bureau of Labor Statistics, Energy Information Administration, correlation coefficient, significance level, statistical research

*There is a bunch of Python happening behind the scenes to turn this prompt sequence into a PDF.



Random correlation

Discover a new correlation

View all correlations

View all research papers

Report an error


Data details

The number of statisticians in Oklahoma
Detailed data title: BLS estimate of statisticians in Oklahoma
Source: Bureau of Larbor Statistics
See what else correlates with The number of statisticians in Oklahoma

Electricity generation in Saint Lucia
Detailed data title: Total electricity generation in Saint Lucia in billion kWh
Source: Energy Information Administration
See what else correlates with Electricity generation in Saint Lucia

Correlation r = 0.7919095 (Pearson correlation coefficient)
Correlation is a measure of how much the variables move together. If it is 0.99, when one goes up the other goes up. If it is 0.02, the connection is very weak or non-existent. If it is -0.99, then when one goes up the other goes down. If it is 1.00, you probably messed up your correlation function.

r2 = 0.6271206 (Coefficient of determination)
This means 62.7% of the change in the one variable (i.e., Electricity generation in Saint Lucia) is predictable based on the change in the other (i.e., The number of statisticians in Oklahoma) over the 19 years from 2003 through 2021.

p < 0.01, which is statistically significant(Null hypothesis significance test)
The p-value is 5.3E-5. 0.0000533627019537107700000000
The p-value is a measure of how probable it is that we would randomly find a result this extreme. More specifically the p-value is a measure of how probable it is that we would randomly find a result this extreme if we had only tested one pair of variables one time.

But I am a p-villain. I absolutely did not test only one pair of variables one time. I correlated hundreds of millions of pairs of variables. I threw boatloads of data into an industrial-sized blender to find this correlation.

Who is going to stop me? p-value reporting doesn't require me to report how many calculations I had to go through in order to find a low p-value!
On average, you will find a correaltion as strong as 0.79 in 0.0053% of random cases. Said differently, if you correlated 18,740 random variables Which I absolutely did.
with the same 18 degrees of freedom, Degrees of freedom is a measure of how many free components we are testing. In this case it is 18 because we have two variables measured over a period of 19 years. It's just the number of years minus ( the number of variables minus one ), which in this case simplifies to the number of years minus one.
you would randomly expect to find a correlation as strong as this one.

[ 0.53, 0.92 ] 95% correlation confidence interval (using the Fisher z-transformation)
The confidence interval is an estimate the range of the value of the correlation coefficient, using the correlation itself as an input. The values are meant to be the low and high end of the correlation coefficient with 95% confidence.

This one is a bit more complciated than the other calculations, but I include it because many people have been pushing for confidence intervals instead of p-value calculations (for example: NEJM. However, if you are dredging data, you can reliably find yourself in the 5%. That's my goal!


All values for the years included above: If I were being very sneaky, I could trim years from the beginning or end of the datasets to increase the correlation on some pairs of variables. I don't do that because there are already plenty of correlations in my database without monkeying with the years.

Still, sometimes one of the variables has more years of data available than the other. This page only shows the overlapping years. To see all the years, click on "See what else correlates with..." link above.
2003200420052006200720082009201020112012201320142015201620172018201920202021
The number of statisticians in Oklahoma (Statisticians)809080200190180110360310300160220200280330350260270240
Electricity generation in Saint Lucia (Billion kWh)0.2810440.2900280.3041970.3109140.3249560.3311950.3412110.3580.3620.3617960.36010.3568650.3595730.3692430.377580.3784360.3500740.3325420.353622




Why this works

  1. Data dredging: I have 25,153 variables in my database. I compare all these variables against each other to find ones that randomly match up. That's 632,673,409 correlation calculations! This is called “data dredging.” Instead of starting with a hypothesis and testing it, I instead abused the data to see what correlations shake out. It’s a dangerous way to go about analysis, because any sufficiently large dataset will yield strong correlations completely at random.
  2. Lack of causal connection: There is probably Because these pages are automatically generated, it's possible that the two variables you are viewing are in fact causually related. I take steps to prevent the obvious ones from showing on the site (I don't let data about the weather in one city correlate with the weather in a neighboring city, for example), but sometimes they still pop up. If they are related, cool! You found a loophole.
    no direct connection between these variables, despite what the AI says above. This is exacerbated by the fact that I used "Years" as the base variable. Lots of things happen in a year that are not related to each other! Most studies would use something like "one person" in stead of "one year" to be the "thing" studied.
  3. Observations not independent: For many variables, sequential years are not independent of each other. If a population of people is continuously doing something every day, there is no reason to think they would suddenly change how they are doing that thing on January 1. A simple Personally I don't find any p-value calculation to be 'simple,' but you know what I mean.
    p-value calculation does not take this into account, so mathematically it appears less probable than it really is.
  4. Y-axis doesn't start at zero: I truncated the Y-axes of the graph above. I also used a line graph, which makes the visual connection stand out more than it deserves. Nothing against line graphs. They are great at telling a story when you have linear data! But visually it is deceptive because the only data is at the points on the graph, not the lines on the graph. In between each point, the data could have been doing anything. Like going for a random walk by itself!
    Mathematically what I showed is true, but it is intentionally misleading. Below is the same chart but with both Y-axes starting at zero.




Try it yourself

You can calculate the values on this page on your own! Try running the Python code to see the calculation results. Step 1: Download and install Python on your computer.

Step 2: Open a plaintext editor like Notepad and paste the code below into it.

Step 3: Save the file as "calculate_correlation.py" in a place you will remember, like your desktop. Copy the file location to your clipboard. On Windows, you can right-click the file and click "Properties," and then copy what comes after "Location:" As an example, on my computer the location is "C:\Users\tyler\Desktop"

Step 4: Open a command line window. For example, by pressing start and typing "cmd" and them pressing enter.

Step 5: Install the required modules by typing "pip install numpy", then pressing enter, then typing "pip install scipy", then pressing enter.

Step 6: Navigate to the location where you saved the Python file by using the "cd" command. For example, I would type "cd C:\Users\tyler\Desktop" and push enter.

Step 7: Run the Python script by typing "python calculate_correlation.py"

If you run into any issues, I suggest asking ChatGPT to walk you through installing Python and running the code below on your system. Try this question:

"Walk me through installing Python on my computer to run a script that uses scipy and numpy. Go step-by-step and ask me to confirm before moving on. Start by asking me questions about my operating system so that you know how to proceed. Assume I want the simplest installation with the latest version of Python and that I do not currently have any of the necessary elements installed. Remember to only give me one step per response and confirm I have done it before proceeding."


# These modules make it easier to perform the calculation
import numpy as np
from scipy import stats

# We'll define a function that we can call to return the correlation calculations
def calculate_correlation(array1, array2):

    # Calculate Pearson correlation coefficient and p-value
    correlation, p_value = stats.pearsonr(array1, array2)

    # Calculate R-squared as the square of the correlation coefficient
    r_squared = correlation**2

    return correlation, r_squared, p_value

# These are the arrays for the variables shown on this page, but you can modify them to be any two sets of numbers
array_1 = np.array([80,90,80,200,190,180,110,360,310,300,160,220,200,280,330,350,260,270,240,])
array_2 = np.array([0.281044,0.290028,0.304197,0.310914,0.324956,0.331195,0.341211,0.358,0.362,0.361796,0.3601,0.356865,0.359573,0.369243,0.37758,0.378436,0.350074,0.332542,0.353622,])
array_1_name = "The number of statisticians in Oklahoma"
array_2_name = "Electricity generation in Saint Lucia"

# Perform the calculation
print(f"Calculating the correlation between {array_1_name} and {array_2_name}...")
correlation, r_squared, p_value = calculate_correlation(array_1, array_2)

# Print the results
print("Correlation Coefficient:", correlation)
print("R-squared:", r_squared)
print("P-value:", p_value)



Reuseable content

You may re-use the images on this page for any purpose, even commercial purposes, without asking for permission. The only requirement is that you attribute Tyler Vigen. Attribution can take many different forms. If you leave the "tylervigen.com" link in the image, that satisfies it just fine. If you remove it and move it to a footnote, that's fine too. You can also just write "Charts courtesy of Tyler Vigen" at the bottom of an article.

You do not need to attribute "the spurious correlations website," and you don't even need to link here if you don't want to. I don't gain anything from pageviews. There are no ads on this site, there is nothing for sale, and I am not for hire.

For the record, I am just one person. Tyler Vigen, he/him/his. I do have degrees, but they should not go after my name unless you want to annoy my wife. If that is your goal, then go ahead and cite me as "Tyler Vigen, A.A. A.A.S. B.A. J.D." Otherwise it is just "Tyler Vigen."

When spoken, my last name is pronounced "vegan," like I don't eat meat.

Full license details.
For more on re-use permissions, or to get a signed release form, see tylervigen.com/permission.

Download images for these variables:


View another random correlation

How fun was this correlation?

Your dedication to rating warms my heart!


Correlation ID: 2167 · Black Variable ID: 13050 · Red Variable ID: 23932
about · subscribe · emailme@tylervigen.com · twitter

CC BY 4.0