Pearson Correlation and Linear Regression Formulas

Pearson Correlation & Linear Regression Cheat Sheet


Pearson Product-Moment Correlation & Linear Regression

Correlation Definition

Correlation measures the strength and direction of a relationship between two variables.

Warning: Correlation does NOT imply causation.


Types of Correlation

TypeDescriptionGraph Trend
Positive CorrelationVariables increase/decrease togetherUpward slope
Negative CorrelationOne increases while the other decreasesDownward slope
Zero CorrelationNo predictable relationshipRandom
Read More

Mathematical Methods in PDEs and Statistical Analysis

Partial Differential Equations (PDE)

A Partial Differential Equation (PDE) involves a function u(x, y, …) and its partial derivatives.

  • Homogeneous: If every term in the equation contains the dependent variable u or its derivatives. The general solution is simply the Complementary Function (C.F.).
  • Non-Homogeneous: If there is a term that is a function of the independent variables only (f(x, y)). The solution is u = C.F. + P.I. (Particular Integral). Example: ∇2u = f(x, y) (Poisson’s Equation).
Read More

Fundamentals of Statistics: Concepts and Data Analysis

1. Statistics: Descriptive vs. Inferential

Statistics is the science of collecting, organizing, presenting, analyzing, and interpreting data to draw meaningful conclusions and support decision-making.

Comparison of Statistical Methods

BasisDescriptive StatisticsInferential Statistics
MeaningSummarizes and describes dataDraws conclusions about population
PurposeTo present data clearlyTo make predictions/decisions
Data usedUses complete data setUses sample data
TechniquesMean, median, mode, graphsProbability,
Read More

Maximum unambiguous range maximum theoretical range

Interquartile range = range between the first and third quartile.

Cumulative frequency = sum of all frequencies for all values.

Variance = the average of the squared differences from the Mean.

Standard variation = the average of the squared differences from the Mean under a squared root (the same as Variance just under a square root to get rid of the squared unit).

The Range = The distance between two values of which we combine their frequencies to simplify longer datasets.

Quartiles = A division of

Read More

SOC 222: Measuring the Social World Study Notes

SOC 222: Measuring the Social World

Key Concepts and Definitions

Population vs. Sample

  • Population: The entire group you want to study. Example: All students at UTM.
  • Sample: A subset of the population used to make conclusions. Example: 100 UTM students surveyed in the library.
  • Population Parameter: The true value in the population. Example: The actual percentage of all UTM students who cheat.
  • Sample Statistic: The estimate derived from the sample. Example: 15% of surveyed students admit to cheating.
  • Sampling
Read More

Statistical Analysis: Regression and Probability Models

Regression Analysis and Predictive Modeling

Regression analysis is a statistical method used to model the relationship between variables and to predict the value of one variable using another.

Main Types of Regression

  • Simple linear regression: One independent variable and one dependent variable.
  • Multiple regression: Several independent variables predicting one dependent variable.
  • Logistic regression: Used when the dependent variable represents probabilities or categories.

The goal of simple linear regression

Read More