Statistics and Probability Concepts: A Comprehensive Guide
T.1 Descriptive Statistics
Variables
Absolute frequency (fi): The number of times a value (xi) appears in a dataset.
Relative frequency (hi): The proportion of times a value (xi) appears, calculated as fi/n, where n is the total number of observations.
Cumulative absolute frequency (Fi): The number of times values are less than or equal to xi.
Cumulative relative frequency (Hi): The proportion of values less than or equal to xi, calculated as Fi/n.
Frequency Tables with Grouped Data
Class intervals (Ii, Si): The lower and upper limits of a range of values.
Class mark (xi): The representative value of an interval, calculated as (Ii + Si) / 2.
Class size (ci): The length of the interval, calculated as Si – Ii.
Number of classes: The number of intervals used to group the data.
Graphic Representation of Data
- Qualitative data: Pie charts and bar charts.
- Quantitative data without grouping: Frequency polygons and cumulative frequency polygons.
- Quantitative data with grouping: Histograms and cumulative frequency histograms.
Summary Data – Measures of Position
These measures describe how a variable is distributed across its possible values.
Measures of Central Tendency
These measures identify a central point in the data.
- Mean: The average value, calculated as the sum of all values divided by the number of values.
- Median: The middle value when data is ordered, dividing the data into two equal halves.
- Mode: The most frequently occurring value.
Other Measures of Position
- Quartiles (Q1, Q2, Q3): Values that divide the ordered data into four equal parts.
- Percentiles (P1, P2, …, P99): Values that divide the ordered data into one hundred equal parts.
T.2 Probability
Probability theory describes and explores the behavior of random phenomena. A phenomenon is random if its outcome cannot be predicted with certainty.
Key Concepts
- Sample space: The set of all possible outcomes of a random phenomenon.
- Event: A subset of the sample space, representing a specific outcome or group of outcomes.
Operations on Events
- Union (A U B): The event containing all outcomes in A, B, or both.
- Intersection (A ∩ B): The event containing outcomes that belong to both A and B.
- Disjoint events: Events with no common outcomes.
- Difference (A – B): The event containing outcomes in A but not in B.
Probability
The likelihood of an event occurring, expressed as a number between 0 and 1.
Axioms of Probability
- The probability of any event is between 0 and 1.
- The probability of the entire sample space is 1.
- If two events have no common outcomes, the probability of their union is the sum of their individual probabilities.
Properties of Probability
- P(A’) = 1 – P(A), where A’ is the complement of event A.
- P(A U B) = P(A) + P(B) – P(A ∩ B).
- P(A – B) = P(A) – P(A ∩ B).
Laplace’s Rule
If a sample space contains N equally likely outcomes, the probability of each outcome is 1/N. If event A contains k outcomes, then P(A) = k/N.