Statistics Cheat Sheet: Formulas, Definitions & Excel Functions

Statistics Cheat Sheet

Key Concepts and Definitions

Data Types

  • Categorical Data: Data that can be grouped into categories.
    • Numeric: Represented by numbers (e.g., zip codes).
    • Non-numeric: Represented by words (e.g., colors).
  • Quantitative Data: Numerical data that represents measurements or counts.
    • Interval: Differences between values are meaningful (e.g., temperature).
    • Ratio: Has a true zero point (e.g., height).

Descriptive Statistics

  • Mean (x̄): The average of a data set. Excel: =AVERAGE(data)
  • Median: The middle value of a sorted data set. Excel: =MEDIAN(data)
  • Mode: The most frequent value in a data set. Excel: =MODE(data)
  • Range: The difference between the highest and lowest values. Excel: =MAX(data) – MIN(data)
  • Variance (s²): Measures how spread out the data is from the mean. Excel: =VAR.S(data)
  • Standard Deviation (s): The square root of the variance. Excel: =STDEV.S(data)

Probability

  • Discrete Probability Distribution: Describes the probability of each value of a discrete random variable.
    • Requirements: f(x) ≥ 0, Σf(x) = 1
  • Binomial Probability Distribution: Models the probability of successes in a fixed number of independent trials.
    • Parameters: n (number of trials), p (probability of success)
    • Excel: =BINOM.DIST(x, n, p, FALSE)
  • Normal Probability Distribution: A continuous probability distribution with a bell-shaped curve.
    • Z-Score: Measures how many standard deviations a data point is from the mean. Formula: (x – μ) / σ
    • Excel Functions: NORM.S.DIST(z, TRUE), NORM.S.INV(probability)

Other Important Concepts

  • Population Parameters: Measurements from an entire population (e.g., population mean: μ, population standard deviation: σ).
  • Sample Statistics: Measurements from a sample of a population (e.g., sample mean: x̄, sample standard deviation: s).
  • Correlation: Measures the linear relationship between two variables. Excel: =CORREL(array1, array2)
  • Covariance: Measures how two variables change together. Excel: =COVARIANCE.S(array1, array2)

Excel Formulas Quick Reference

  • Quartiles: =QUARTILE.INC(array, k)
  • Interquartile Range: =QUARTILE.INC(array, 3) – QUARTILE.INC(array, 1)
  • Percentile: =PERCENTILE.INC(array, k)

Additional Notes

  • This cheat sheet provides a summary of key statistical concepts and formulas. Refer to your textbook or other resources for more detailed information.
  • Always ensure you understand the assumptions and limitations of the statistical methods you use.