Average temperature in Los Angeles correlates with Kerosene used in Gabon (r=0.573)

	1980	1981	1982	1983	1984	1985	1986	1987	1988	1989	1990	1991	1992	1993	1994	1995	1996	1997	1998	1999	2000	2001	2002	2003	2004	2005	2006	2007	2008	2009	2010	2011	2012	2013	2014	2015	2016	2017	2018	2019	2020	2021
Average temperature in Los Angeles (Degrees F)	67.7	68.9	65.5	68.9	68.4	67	67.7	66	66.1	66.6	67.5	65.8	68	67.2	67.5	67.8	67.6	68.3	66.1	64.2	65.1	64	64.4	65.4	65.4	65.1	66.7	65.6	66.4	66.2	65.7	63.9	65.5	65.6	68.1	68.3	67.1	68.2	67.2	66.1	67.8	65.6
Kerosene used in Gabon (Million Barrels/Day)	1.66667	1.81818	1.66667	2.12121	1.81818	1.66667	1.5	0.4	1.3	1.2	1.4	1.7	1.8	1.4	1.9	1.9	2	1.8	1.3	1.1	0.485765	0.423562	0.614164	0.487096	0.464645	0.592986	0.571808	0.571808	0.612486	0.720055	1	1	1	1	1	1	1	0.782137	0.586603	0.521425	0.499699	0.499699

# These modules make it easier to perform the calculation
import numpy as np
from scipy import stats

# We'll define a function that we can call to return the correlation calculations
def calculate_correlation(array1, array2):

    # Calculate Pearson correlation coefficient and p-value
    correlation, p_value = stats.pearsonr(array1, array2)

    # Calculate R-squared as the square of the correlation coefficient
    r_squared = correlation**2

    return correlation, r_squared, p_value

# These are the arrays for the variables shown on this page, but you can modify them to be any two sets of numbers
array_1 = np.array([67.7,68.9,65.5,68.9,68.4,67,67.7,66,66.1,66.6,67.5,65.8,68,67.2,67.5,67.8,67.6,68.3,66.1,64.2,65.1,64,64.4,65.4,65.4,65.1,66.7,65.6,66.4,66.2,65.7,63.9,65.5,65.6,68.1,68.3,67.1,68.2,67.2,66.1,67.8,65.6,])
array_2 = np.array([1.66667,1.81818,1.66667,2.12121,1.81818,1.66667,1.5,0.4,1.3,1.2,1.4,1.7,1.8,1.4,1.9,1.9,2,1.8,1.3,1.1,0.485765,0.423562,0.614164,0.487096,0.464645,0.592986,0.571808,0.571808,0.612486,0.720055,1,1,1,1,1,1,1,0.782137,0.586603,0.521425,0.499699,0.499699,])
array_1_name = "Average temperature in Los Angeles"
array_2_name = "Kerosene used in Gabon"

# Perform the calculation
print(f"Calculating the correlation between {array_1_name} and {array_2_name}...")
correlation, r_squared, p_value = calculate_correlation(array_1, array_2)

# Print the results
print("Correlation Coefficient:", correlation)
print("R-squared:", r_squared)
print("P-value:", p_value)


Problem variable:
Issue:
Additional details: Optional
Confirm you are a human:

Data details

Why this works

Try it yourself

Reuseable content