European Global

Making Smarter Decisions With Data: A Simple Guide to Hypothesis Testing

What is Hypothesis Testing?

Hypothesis testing is a statistical method used to determine whether a sample of data provides sufficient evidence to support or reject a claim about a population.

 

Think of it as a courtroom:

 

  • The Null Hypothesis (H₀) is the default assumption (“no change”, “no difference”).
  • The Alternative Hypothesis (H₁) is what you want to prove.

 

Your data acts as the evidence.

 

Statistics helps you objectively decide whether that evidence is strong enough to reject H₀.

 

It helps answer key questions like:

 

  • Is the new medicine effective?
  • Has customer satisfaction improved?
  • Is the coin fair?
  • Has the population changed over time?

 

How Does It Work?

 

Every hypothesis test follows four core steps:

 

  1. Define H₀ and H₁

 

Example:


H₀: The new onboarding process does not improve retention.
H₁: The new onboarding process does improve retention.

 

  1. Choose a significance level (α)

 

Most commonly 0.05 (5% chance of making a wrong decision).

 

  1. Run the appropriate test

 

Depending on the data, analysts use:

 

  • Z-tests (large samples)
  • t-tests (small samples)
  • Chi-square tests (categorical data)
  • ANOVA/F-tests (3+ group comparisons)

 

  1. Make a decision using p-value or critical value

 

  • If p ≤ 0.05, the result is statistically significant → Reject H₀
  • If p > 0.05, there isn’t enough evidence → Fail to reject H₀

 

This approach ensures decisions are guided by evidence rather than bias.

 

Key Concepts You Must Know

  1. Null & Alternate Hypothesis
  • H₀: “Nothing has changed.”
  • H₁: “Something has changed.”

 

Example from the slides:

 

A college claims average pass percentage = 85%.
Sample average = 90%.
We test whether this new result truly shows improvement.

 

  1. Significance Level (α)

 

Normally 0.05 (5%) or 0.01 (1%).
This represents the maximum risk you’re willing to take to reject a true H₀.

 

  1. Test Statistic (Z or t)

 

A formula-based number that tells you how far your sample result is from the claim.

 

  1. Critical Value

 

A threshold from statistical tables.
If your test statistic crosses it → Reject H₀.

 

  1. P-Value

 

The most important concept in modern statistics.

 

It tells you:

 

“How likely is my data if H₀ is true?”

Lower p-value → stronger evidence against H₀.

 

Decision rule:

  • p ≤ α → Reject H₀
  • p > α → Fail to reject H₀

 

Z-Test Explained (Large Samples | σ Known)

 

Use when:

  • Sample size ≥ 30
  • Population standard deviation (σ) is known
  • Data is normally distributed

 

 Example

Population mean = 168 cm
Sample mean = 169.5
n = 36
σ = 3.9

 

Compute Z-value using:

 

You compare this value with critical Z = ±1.96 (95% confidence).

 

If |Z| > 1.96 → Reject H₀

 

Student’s t-Test (Small Samples | σ Unknown)

 

Used when:

 

  • n < 30
  • Population standard deviation unknown
  • Replace σ with sample SD (s)

 

Example:

 

IQ sample
μ = 100
= 140
s = 20
n = 30
df = n – 1 = 29
Critical t = ±2.045 (95%)

 

Calculated t = 10.96 → Reject H₀

 

This means the medication significantly affects IQ.

 

Type I & Type II Errors

 

 

Decision

  

Reject Ho

Fail to Reject H₀

H₀ True

❌ Type I Error (false     positive)

              ✔ Correct

H₀ False

      ✔ Correct

  ❌ Type II Error (false negative)

Very important for:

 

  • Medical testing
  • Machine learning (confusion matrix)
  • Quality control

 

Chi-Square Test (Categorical Data)

 

Used when comparing observed vs expected frequencies.

 

Weight Distribution 2010 vs 2020

 

Calculated χ² = 26.66
Critical χ² = 5.991
Since 26.66 > 5.991 → Reject H₀

 

Conclusion: Weight distribution changed significantly over 10 years

 

ANOVA (Analysis of Variance)

 

Used to compare 3 or more group means.

 

Key Concepts:

  • Factors: Variables (e.g., medication)
  • Levels: Categories (e.g., 5 mg, 10 mg, 15 mg)

 

 Example:

 

Three medication dosages (15 mg, 30 mg, 45 mg)


Computed F = 43.46
Critical F = 3.5546
Since F > critical F → Reject H₀

 

Meaning: At least one dosage works better than others.

 

 Confidence Interval & Margin of Error

 

CI provides a range, not just a single estimate.

 

Example (CAT exam):

  • Mean = 520
  • σ = 100
  • n = 25
  • Z = 1.96

 

CI = (480.8, 559.2)


Meaning: With 95% confidence, the population mean lies in this range.

Conclusion

 

Hypothesis testing is the core of statistical decision-making.


From Z-tests to ANOVA, every method helps you answer one question:

 

“Is my observed data significantly different from what I expected?”

 

Using hypothesis testing, businesses improve decisions, researchers validate claims, and analysts turn raw data into meaningful insights.

 

Blog Contributed By Dr Prerna Singh