Hypothesis testing is a statistical method used to determine whether a sample of data provides sufficient evidence to support or reject a claim about a population.
Think of it as a courtroom:
Your data acts as the evidence.
Statistics helps you objectively decide whether that evidence is strong enough to reject H₀.
It helps answer key questions like:
Every hypothesis test follows four core steps:
Example:
H₀: The new onboarding process does not improve retention.
H₁: The new onboarding process does improve retention.
Most commonly 0.05 (5% chance of making a wrong decision).
Depending on the data, analysts use:
This approach ensures decisions are guided by evidence rather than bias.
Example from the slides:
A college claims average pass percentage = 85%.
Sample average = 90%.
We test whether this new result truly shows improvement.
Normally 0.05 (5%) or 0.01 (1%).
This represents the maximum risk you’re willing to take to reject a true H₀.
A formula-based number that tells you how far your sample result is from the claim.
A threshold from statistical tables.
If your test statistic crosses it → Reject H₀.
The most important concept in modern statistics.
It tells you:
“How likely is my data if H₀ is true?”
Lower p-value → stronger evidence against H₀.
Decision rule:
Z-Test Explained (Large Samples | σ Known)
Use when:
⭐ Example
Population mean = 168 cm
Sample mean = 169.5
n = 36
σ = 3.9
Compute Z-value using:
You compare this value with critical Z = ±1.96 (95% confidence).
If |Z| > 1.96 → Reject H₀
Student’s t-Test (Small Samples | σ Unknown)
Used when:
Example:
IQ sample
μ = 100
= 140
s = 20
n = 30
df = n – 1 = 29
Critical t = ±2.045 (95%)
Calculated t = 10.96 → Reject H₀
This means the medication significantly affects IQ.
Type I & Type II Errors
|
|
Reject Ho |
Fail to Reject H₀ |
|||
|
H₀ True |
❌ Type I Error (false positive) |
✔ Correct |
|||
|
H₀ False |
✔ Correct |
❌ Type II Error (false negative) |
Very important for:
Chi-Square Test (Categorical Data)
Used when comparing observed vs expected frequencies.
Weight Distribution 2010 vs 2020
Calculated χ² = 26.66
Critical χ² = 5.991
Since 26.66 > 5.991 → Reject H₀
Conclusion: Weight distribution changed significantly over 10 years
ANOVA (Analysis of Variance)
Used to compare 3 or more group means.
Key Concepts:
Example:
Three medication dosages (15 mg, 30 mg, 45 mg)
Computed F = 43.46
Critical F = 3.5546
Since F > critical F → Reject H₀
Meaning: At least one dosage works better than others.
Confidence Interval & Margin of Error
CI provides a range, not just a single estimate.
Example (CAT exam):
CI = (480.8, 559.2)
Meaning: With 95% confidence, the population mean lies in this range.
Hypothesis testing is the core of statistical decision-making.
From Z-tests to ANOVA, every method helps you answer one question:
“Is my observed data significantly different from what I expected?”
Using hypothesis testing, businesses improve decisions, researchers validate claims, and analysts turn raw data into meaningful insights.
Blog Contributed By Dr Prerna Singh