📊 Statistics 1: The Master Encyclopedia (V5)

Welcome to the ultimate guide for Statistics 1. This document is designed to be the “goated” repository for every concept, formula, and nuance in the course.

📔 Volume I: The Genesis of Data & Descriptive Foundations

🔬 Week 1: The Anatomy of Statistical Inquiry

1.1 The Epistemology of Statistics

Statistics is the mathematical framework for decision-making under uncertainty. It translates raw observation into actionable intelligence.

Descriptive Statistics: Summarizing and visualizing the data we have on hand.
Inferential Statistics: Making predictions or generalizations about a Population based on a Sample.

Key Definitions:

Population ( $N$ ): The entire collection of interest.
Sample ( $n$ ): The observed subset.
Parameter: A characteristic of the population (e.g., Population Mean $μ$ ).
Statistic: A characteristic of the sample (e.g., Sample Mean $\overset{x}{ˉ}$ ).

1.2 The Taxonomy of Variables

Type	Sub-type	Definition	Example
Categorical	Nominal	Identity only, no order	Blood type, Gender
	Ordinal	Meaningful order, distance unknown	Ranks, Likert scale
Numerical	Discrete	Countable gaps (integers)	No. of children
	Continuous	Infinite values in a range	Height, Weight

1.3 The Scale Hierarchy (S.S. Stevens)

Nominal: Identity ( $=, \neq =$ ).
Ordinal: Ranking ( $>, <$ ).
Interval: Meaningful differences ( $+, -$ ). No “True Zero” (e.g., $0^{o} C$ ).
Ratio: True Zero exists ( $\times, \div$ ). Zero means absence of property (e.g., $0$ kg).

CAUTION

The “True Zero” Trap: Always ask: “If this is 0, does it mean the thing doesn’t exist?” If yes, it’s Ratio. If no, it’s likely Interval.

🎨 Week 2: Graphical Representations & Frequency Geometry

2.1 Frequency Distributions

Before visualizing, we must structure raw data into Frequency Tables.

Relative Frequency: $\frac{f _{i}}{N}$ (Proportion of the total).
Percent Frequency: Relative Frequency $\times 100$ .

2.2 Visualizing Categorical Data

The goal is to show the distribution of observations across categories.

1. The Bar Chart

Geometry: Rectangular bars where $He i g h t \propto F re q u e n cy$ .
Best Use: Comparing specific values across categories.
Variation: Pareto Chart. A bar chart sorted in descending order of frequency ( $80/20$ rule application).

2. The Pie Chart

Geometry: Circular sectors where $A n g l e \propto R e l a t i v e F re q u e n cy$ .
Formula: Sector Angle $θ = 36 0^{o} \times R e l a t i v e F re q u e n cy$ .
Best Use: Highlighting the “part-to-whole” relationship.

2.3 The “Golden Rule” of Visuals: The Area Principle

The visual area of a graphical element MUST be proportional to the value it represents.

Violation: 3D effects, varying thicknesses, or uneven widths in bar charts. These are designed to mislead and are “Cardinal Sins” in Statistics.

2.4 Case Study: Placement Analysis

Sector: Software (200), Core (150), Analytics (100), Other (50).
Total ( $N$ ): 500.
Software RF: $200/500 = 0.4$ .
Software Angle: $0.4 \times 36 0^{o} = 14 4^{o}$ .
Pareto Logic: Software would be the first bar, followed by Core, then Analytics.

📐 Volume II: Measures of Centrality & Linear Relationships

📏 Week 3: The Mechanics of Centrality & Dispersion

Numerical data requires summary metrics that capture where the data “centers” and how much it “leaks” (spreads).

3.1 Measures of Central Tendency

Arithmetic Mean ( $\overset{x}{ˉ}$ ):
- $\overset{x}{ˉ} = \frac{\sum x _{i}}{n}$ .
- Philosophical Insight: The mean is the “Balance Point” of the data. If you placed weights on a beam at the data points, the mean is where it would balance perfectly.
- Sensitivity: Extremely sensitive to outliers. One massive value pulls the mean toward it.
Median ( $M$ ):
- The middle value of a sorted dataset.
- If $n$ is odd: $Value at rank \frac{n + 1}{2}$ .
- If $n$ is even: Average of values at ranks $\frac{n}{2}$ and $\frac{n}{2} + 1$ .
- Robustness: The median is a “Robust Statistic.” It ignores outliers.
Mode:
- The most frequent observation.
- The only measure valid for Nominal data.

3.2 Measures of Dispersion (Spread)

Variance ( $s^{2}$ or $σ^{2}$ ):
- Average squared deviation from the mean.
- Sample Variance: $s^{2} = \frac{\sum ( x _{i} - x ˉ ) ^{2}}{n - 1}$ .
- Why $n - 1$ ? (Bessel’s Correction): Using $n$ would underestimate the true population variance. $n - 1$ makes the sample variance an “Unbiased Estimator.”
Standard Deviation ( $s$ or $σ$ ):
- $Va r ian ce$ . Brings the units back to the original scale.
Coefficient of Variation (CV):
- $C V = (\frac{s}{x ˉ}) \cdot 100%$ .
- Use: Comparing spread across different units (e.g., Variability of height in cm vs weight in kg).

3.3 Percentiles & The Anatomy of a Box Plot

Percentiles ( $P_{k}$ ): Value such that $k %$ of data is at or below it.
Quartiles: $Q_{1} = P_{25}, Q_{2} = M e d ian = P_{50}, Q_{3} = P_{75}$ .
IQR (Interquartile Range): $Q_{3} - Q_{1}$ .
Outlier Detection (Tukey’s Fences):
- Lower Fence: $Q_{1} - 1.5 \cdot I QR$
- Upper Fence: $Q_{3} + 1.5 \cdot I QR$
- Values outside these fences are statistically suspected outliers.

🔗 Week 4: Bivariate Data (Relationships & Trends)

When we observe two variables $(X, Y)$ for the same case, we search for Correlation—the degree to which they move together.

4.1 Scatter Plots & Visual Trends

Positive Trend: As $X$ increases, $Y$ tends to increase.
Negative Trend: As $X$ increases, $Y$ tends to decrease.
No Trend: Points are scattered randomly with no discernible line.

4.2 Covariance & Pearson’s $r$

Covariance ( $C o v (X, Y)$ ):
- Measures the direction of the relationship.
- $C o v (X, Y) = \frac{\sum ( x _{i} - x ˉ ) ( y _{i} - y ˉ )}{n - 1}$ .
- If positive, they move together. If negative, they move oppositely.
Correlation Coefficient ( $r$ ):
- Normalizes covariance to a range of $[- 1, 1]$ .
- $r = \frac{C o v ( X , Y )}{s _{x} s _{y}}$ .
- Strength:
  - $∣ r ∣ \approx 1$ : Strong linear relationship.
  - $∣ r ∣ \approx 0.5$ : Moderate linear relationship.
  - $∣ r ∣ \approx 0$ : No linear relationship (they might still have a curved relationship!).

4.3 Linear Transformations & Correlation

This is a high-frequency exam topic. If we transform $X \to a X + b$ and $Y \to c Y + d$ :

Covariance Result: $C o v (a X + b, c Y + d) = a c \cdot C o v (X, Y)$ .
Correlation Result:
- $r_{n e w} = r_{o l d}$ if $a$ and $c$ have the same sign.
- $r_{n e w} = - r_{o l d}$ if $a$ and $c$ have opposite signs.
Invariance: Adding/Subtracting constants ( $b, d$ ) has zero effect on $r$ .

4.4 Scatter Plot Interpretation Strategy

Step 1: Identify the independent $(X)$ and dependent $(Y)$ variables.
Step 2: Look for the overall “cloud” shape.
Step 3: Is it linear? (If it’s a parabola, $r$ might be 0 even if there’s a clear relationship).
Step 4: Outliers? A single outlier far from the line can tank a high correlation.

🎲 Volume III: The Logic of Chance (Combinatorics & Probability)

🔢 Week 5: Countable Assemblies (Combinatorics)

Counting is the backbone of discrete probability. We must master the art of determining the size of the Sample Space ( $∣ S ∣$ ) and the Event Space ( $∣ E ∣$ ).

5.1 The Multiplication Principle (The “AND” Rule)

If task A can be done in $m$ ways and task B in $n$ ways, doing A AND B takes $m \times n$ ways.

Example: 3 shirts and 4 pants $⟹ 12$ outfits.

5.2 Permutations (Order Matters)

Use when the sequence is distinct (e.g., gold/silver/bronze medals).

Basic Permutation: $P (n, r) = \frac{n !}{( n - r )!}$ .
Identical Items (The Anagram Rule): If there are $p$ items of one kind, $q$ of another, total ways to arrange $n$ items:
- $N = \frac{n !}{p ! q ! \dots}$
- Example: “STATISTICS” (10 letters: 3 S, 3 T, 2 I). Ways $= \frac{10 !}{3 ! 3 ! 2 !}$ .
Circular Permutations: If $n$ items are in a circle, ways $= (n - 1)!$ .

5.3 Combinations (Order Doesn’t Matter)

Use when only the selection counts (e.g., a committee of 3).

Formula: $(r n) = \frac{n !}{r ! ( n - r )!}$ .
Symmetry: $(r n) = (n - r n)$ (Choosing $r$ to take is the same as choosing $n - r$ to leave).

5.4 The “Bundle & Gap” Strategy

Bundle Method: When items must be together. Bundle them as 1 item, arrange the bundle internal elements, then arrange the new set.
Gap Method: When items must NOT be together. Arrange the other items first, then place the restricted items in the “gaps” between them.

🌩️ Weeks 6-8: The Calculus of Uncertainty (Probability)

6.1 The Axiomatic Foundation

Probability is a measure of belief or frequency assigned to an event $E \subset S$ .

Axiom 1: $0 \leq P (E) \leq 1$ .
Axiom 2: $P (S) = 1$ .
Axiom 3: If $A$ and $B$ are Mutually Exclusive ( $A \cap B = \emptyset$ ), then $P (A \cup B) = P (A) + P (B)$ .

6.2 The “Independence” vs. “disjoint” Trap

Mutually Exclusive (Disjoint): Events CANNOT happen at the same time ( $P (A \cap B) = 0$ ).
Independent: One event happening does NOT change the probability of the other.
- Test: $P (A \cap B) = P (A) \cdot P (B)$ or $P (A ∣ B) = P (A)$ .
Crucial Fact: If $P (A), P (B) > 0$ , then they cannot be both independent and mutually exclusive.

7.1 Conditional Probability: The “Reduced Sample Space”

$P (A ∣ B)$ is the probability of $A$ restricted to the universe of $B$ .

Formula: $P (A ∣ B) = \frac{P ( A \cap B )}{P ( B )}$ .
Multiplication Rule: $P (A \cap B) = P (A ∣ B) \cdot P (B)$ .

8.1 Total Probability & Partitions

If states $B_{1}, B_{2}, \dots$ cover the entire space and don’t overlap (A Partition), the probability of any event $A$ is:

$P (A) = P (A ∣ B_{1}) P (B_{1}) + P (A ∣ B_{2}) P (B_{2}) + \dots$
Visual: Think of $B_{i}$ as different factories and $A$ as a “defective item.”

8.2 Bayes Theorem (Belief Revision)

Bayes allows us to “invert” conditional probability.

Formula: $P (B_{k} ∣ A) = \frac{P ( A ∣ B _{k} ) \cdot P ( B _{k} )}{P ( A )}$ .
Interpretation:
- $P (B_{k})$ : Prior (What we knew before).
- $P (A ∣ B_{k})$ : Likelihood (How often the evidence happens in this state).
- $P (B_{k} ∣ A)$ : Posterior (What we know after the evidence).

TIP

The Table Method for Bayes:

State Prior Likelihood Product Posterior (Product / Sum)
$B_{1}$ … … … …
$B_{2}$ … … … …
Sum 1.0 - $P (A)$ 1.0

State	Prior	Likelihood	Product	Posterior (Product / Sum)
$B_{1}$	…	…	…	…
$B_{2}$	…	…	…	…
Sum	1.0	-	$P (A)$	1.0

📈 Volume IV: Random Variables & Statistical Blueprints

🧬 Week 9: Discrete Random Variables & PMFs

A Random Variable ( $X$ ) is a function that maps outcomes of a random experiment to real numbers. It translates “Heads/Tails” into “1/0”.

9.1 The Probability Mass Function (PMF) - $f (x)$

For a discrete RV, the PMF gives the probability of $X$ taking a specific value $x$ .

Notation: $f (x) = P (X = x)$ .
Existence Constraints:
1. $f (x) \geq 0$ for all $x$ .
2. $\sum_{a ll x} f (x) = 1$ .

9.2 The Cumulative Distribution Function (CDF) - $F (x)$

The “Running Total” of probability.

Notation: $F (x) = P (X \leq x) = \sum_{x_{i} \leq x} f (x_{i})$ .
Critical Properties:
- $F (x)$ is non-decreasing.
- $lim_{x \to - \infty} F (x) = 0$ and $lim_{x \to \infty} F (x) = 1$ .
- $P (a < X \leq b) = F (b) - F (a)$ .

9.3 Support of a Random Variable

The set of all values $x$ for which $f (x) > 0$ .

Finite Support: $X \in {1, 2, 3}$ .
Countably Infinite Support: $X \in {0, 1, 2, \dots}$ .

⚖️ Week 10: The Algebra of Expectation & Variance

To summarize a distribution, we use its Expected Value (The “Center”) and Variance (The “Spread”).

10.1 Expected Value ( $E [X]$ )

Definition: The long-term weighted average of outcomes.
Formula: $E [X] = \sum_{x} x \cdot f (x)$ .
Law of the Unconscious Statistician (LOTUS): To find the expectation of a function $g (X)$ :
- $E [g (X)] = \sum_{x} g (x) \cdot f (x)$ .

10.2 Linearity of Expectation (The “God” Rule)

Expectation is a linear operator. This holds regardless of whether variables are independent!

$E [a X + b] = a E [X] + b$ .
$E [X + Y] = E [X] + E [Y]$ .

10.3 Variance ( $Va r (X)$ )

Definition: The expected value of the squared deviation from the mean.
Master Identity: $Va r (X) = E [X^{2}] - (E [X])^{2}$ .
Transformation: $Va r (a X + b) = a^{2} \cdot Va r (X)$ .
Std Dev: $S D (X) = Va r (X)$ .

10.4 Independent Variables

If $X$ and $Y$ are Independent:

Expectation: $E [X Y] = E [X] \cdot E [Y]$ .
Variance: $Va r (X \pm Y) = Va r (X) + Va r (Y)$ . (Note: It is always addition because squaring terms removes negative signs).
Covariance: $C o v (X, Y) = 0$ .

🏗️ Week 11: Statistical Blueprints (Specific Discrete Distributions)

Certain experiment types appear so frequently that their distributions are pre-calculated.

11.1 The Bernoulli Trial ( $X \sim B er (p)$ )

A single trial with two outcomes: Success (1) or Failure (0).

PMF: $P (X = 1) = p, P (X = 0) = 1 - p$ .
Stats: $E [X] = p$ , $Va r (X) = p (1 - p)$ .

11.2 The Binomial Distribution ( $X \sim B in (n, p)$ )

The count of successes in $n$ Independent Bernoulli trials.

PMF: $P (X = k) = (k n) p^{k} (1 - p)^{n - k}$ .
Conditions:
1. Fixed number of trials $n$ .
2. Each trial is independent.
3. $P (S u ccess)$ is constant ( $p$ ).
Stats: $E [X] = n p$ , $Va r (X) = n p (1 - p)$ .

11.3 The Poisson Distribution ( $X \sim P o i s (λ)$ )

Used for counts of “rare” events happening in a fixed interval (Time, Space, Volume).

PMF: $P (X = k) = \frac{e ^{- λ} λ ^{k}}{k !}$ .
Rate Scaling: If $λ$ is for time $T$ , the rate for time $k T$ is $k \cdot λ$ .
Stats: $E [X] = λ$ , $Va r (X) = λ$ .

IMPORTANT

The Binomial $\to$ Poisson Approximation: When $n \geq 100$ and $p \leq 0.05$ , the Binomial distribution $B in (n, p)$ can be modeled as $P o i s (λ = n p)$ .

🌊 Week 12: Continuous Blueprints (Uniform & Exponential)

In the continuous domain, we deal with “densities” rather than discrete chunks of probability.

12.1 The Probability Density Function (PDF) - $f (x)$

For a continuous RV, $P (X = x)$ is always 0. We only measure probability over an interval $[a, b]$ .

Core Relation: $P (a \leq X \leq b) = \int_{a}^{b} f (x) d x$ .
Constraint: Area under the entire curve $\int_{- \infty}^{\infty} f (x) d x = 1$ .

12.2 The Continuous Uniform Distribution ( $X \sim U (a, b)$ )

Equally likely outcomes over a range $[a, b]$ .

PDF: $f (x) = \frac{1}{b - a}$ for $x \in [a, b]$ .
Stats: $E [X] = \frac{a + b}{2}$ (Center), $Va r (X) = \frac{( b - a ) ^{2}}{12}$ .

12.3 The Exponential Distribution ( $X \sim E x p o (λ)$ )

Models the time between events in a Poisson process.

PDF: $f (x) = λ e^{- λ x}$ for $x \geq 0$ .
CDF: $F (x) = 1 - e^{- λ x}$ .
Stats: $E [X] = 1/ λ$ , $Va r (X) = 1/ λ^{2}$ .

TIP

The Memoryless Property: $P (X > s + t ∣ X > s) = P (X > t)$ . “The probability that the car survives another $t$ hours given it has already survived $s$ hours is the same as the probability a brand new car survives $t$ hours.” Only the Exponential distribution has this property.

12.4 The Poisson-Exponential Duality

Poisson: Counts of events ( $N$ events in time $T$ ).
Exponential: Time until the next event ( $T$ time between events).
If events occur at rate $λ$ (Poisson), the time between them follows $E x p o (λ)$ .

🏁 Final Conclusion

This guide covers the entire spectrum of Statistics 1. From the raw classification of data to the sophisticated modeling of continuous processes, you now possess the complete theoretical toolkit for mastery.

🧠 IIT Madras Notes

Explorer

Theory

📊 Statistics 1: The Master Encyclopedia (V5)

📔 Volume I: The Genesis of Data & Descriptive Foundations

🔬 Week 1: The Anatomy of Statistical Inquiry

1.1 The Epistemology of Statistics

Key Definitions:

1.2 The Taxonomy of Variables

1.3 The Scale Hierarchy (S.S. Stevens)

🎨 Week 2: Graphical Representations & Frequency Geometry

2.1 Frequency Distributions

2.2 Visualizing Categorical Data

1. The Bar Chart

2. The Pie Chart

2.3 The “Golden Rule” of Visuals: The Area Principle

2.4 Case Study: Placement Analysis

📐 Volume II: Measures of Centrality & Linear Relationships

📏 Week 3: The Mechanics of Centrality & Dispersion

3.1 Measures of Central Tendency

3.2 Measures of Dispersion (Spread)

3.3 Percentiles & The Anatomy of a Box Plot

🔗 Week 4: Bivariate Data (Relationships & Trends)

4.1 Scatter Plots & Visual Trends

4.2 Covariance & Pearson’s r

4.3 Linear Transformations & Correlation

4.4 Scatter Plot Interpretation Strategy

🎲 Volume III: The Logic of Chance (Combinatorics & Probability)

🔢 Week 5: Countable Assemblies (Combinatorics)

5.1 The Multiplication Principle (The “AND” Rule)

5.2 Permutations (Order Matters)

5.3 Combinations (Order Doesn’t Matter)

5.4 The “Bundle & Gap” Strategy

🌩️ Weeks 6-8: The Calculus of Uncertainty (Probability)

6.1 The Axiomatic Foundation

6.2 The “Independence” vs. “disjoint” Trap

7.1 Conditional Probability: The “Reduced Sample Space”

8.1 Total Probability & Partitions

8.2 Bayes Theorem (Belief Revision)

📈 Volume IV: Random Variables & Statistical Blueprints

🧬 Week 9: Discrete Random Variables & PMFs

9.1 The Probability Mass Function (PMF) - f(x)

9.2 The Cumulative Distribution Function (CDF) - F(x)

9.3 Support of a Random Variable

⚖️ Week 10: The Algebra of Expectation & Variance

10.1 Expected Value (E[X])

10.2 Linearity of Expectation (The “God” Rule)

10.3 Variance (Var(X))

10.4 Independent Variables

🏗️ Week 11: Statistical Blueprints (Specific Discrete Distributions)

11.1 The Bernoulli Trial (X∼Ber(p))

11.2 The Binomial Distribution (X∼Bin(n,p))

11.3 The Poisson Distribution (X∼Pois(λ))

🌊 Week 12: Continuous Blueprints (Uniform & Exponential)

12.1 The Probability Density Function (PDF) - f(x)

12.2 The Continuous Uniform Distribution (X∼U(a,b))

12.3 The Exponential Distribution (X∼Expo(λ))

12.4 The Poisson-Exponential Duality

🏁 Final Conclusion

Graph View

Table of Contents

Recent Features

4.2 Covariance & Pearson’s $r$

9.1 The Probability Mass Function (PMF) - $f (x)$

9.2 The Cumulative Distribution Function (CDF) - $F (x)$

10.1 Expected Value ( $E [X]$ )

10.3 Variance ( $Va r (X)$ )

11.1 The Bernoulli Trial ( $X \sim B er (p)$ )

11.2 The Binomial Distribution ( $X \sim B in (n, p)$ )

11.3 The Poisson Distribution ( $X \sim P o i s (λ)$ )

12.1 The Probability Density Function (PDF) - $f (x)$

12.2 The Continuous Uniform Distribution ( $X \sim U (a, b)$ )

12.3 The Exponential Distribution ( $X \sim E x p o (λ)$ )