Day 22

Math 216: Statistical Thinking

Bastola

Introduction to Hypothesis Testing

Hypothesis testing is a statistical method used to decide whether to support or refute a specific claim about a population parameter based on sample data.
Analogous to a judicial process: The null hypothesis (\(H_0\)) represents the “status quo” (e.g., “innocent until proven guilty”).

Elements of a Test of Hypothesis

City regulations require residential sewer pipes to have an average breaking strength greater than 2,400 pounds per foot.
Manufacturers must demonstrate that their products meet this standard.

Hypotheses

Null Hypothesis (\(H_0\)): In theory, \(\mu \leq 2400\). If we reject \(\mu = 2400\), \(\mu < 2400\) is also rejected.
Alternative Hypothesis (\(H_a\)): \(\mu > 2400\) (Pipes meet or exceed the standard).

Testing the Hypothesis

Computation of Test Statistic

By the Central Limit Theorem, for large samples (\(n \geq 30\)), \(\bar{x}\) approximates a normal distribution.
Test statistic \(z\): \[ z = \frac{\bar{x} - \mu_0}{\sigma/\sqrt{n}} \] where \(\sigma\) is the population standard deviation and \(\mu_0 = 2400\).

Decision Making

If the observed \(\bar{x}\) exceeds 1.645 standard deviations above 2,400, we consider rejecting \(H_0\).
Rejection implies that a rare event (occurring with probability \(\leq 0.05\) under \(H_0\)) is unlikely without true cause (i.e., \(H_a\) is likely true).

Evidence from Sample Data

Sample Characteristics:
- Number of samples (\(n\)): 50
- Mean strength (\(\bar{x}\)): 2,460 pounds per linear foot
- Standard deviation (\(s\)): 200 pounds per linear foot
Test Statistic Calculation:
- The test statistic (\(z\)) is calculated as follows: \[ z = \frac{\bar{x} - 2400}{s/\sqrt{n}} \approx \frac{2460 - 2400}{200 / \sqrt{50}} = \frac{60}{28.28} \approx 2.12 \]

Types of Errors

Type I Error

Analogy: Convicting an innocent person.
Occurs if we reject \(H_0\) when it is, in fact, true.
Probability of Type I error is denoted by \(\alpha\) (commonly set at 0.05).

Types of Errors

Type II Error

Analogy: Acquitting a guilty person.
Occurs if we fail to reject \(H_0\) when, in fact, \(H_a\) is true.
Probability of Type II error is denoted by \(\beta\).
Unlike \(\alpha\), \(\beta\) is not usually pre-specified but affected by the sample size, effect size, and the set \(\alpha\).

Types of Errors

Rejection Region

The set of values of the test statistic that lead us to reject \(H_0\).
For this test, it’s values greater than the z-value corresponding to a 0.05 probability in the upper tail of the normal distribution.

Summary

Hypothesis Testing as a Judicial Process:
- \(H_0\): The defendant (population parameter) is presumed innocent (equals 2400) until proven otherwise.
- \(H_a\): Claims the defendant is not innocent (greater than 2400).
- We examine the “evidence” (sample data) to decide whether it sufficiently supports \(H_a\) over \(H_0\).