Confidence Interval: Range of values likely to contain the population parameter with specified confidence level
Formula: CI = x̄ ± z*(σ/√n) for large samples or x̄ ± t*(s/√n) for small samples
Margin of Error: Critical value × Standard error
- Identify sample statistics (x̄, s, n)
- Determine confidence level and critical value
- Calculate standard error
- Construct interval: x̄ ± critical value × standard error
Sample size (n) = 40, Sample mean (x̄) = 3.2, Sample standard deviation (s) = 0.4
For 95% confidence level, α = 0.05, α/2 = 0.025
Since n ≥ 30, use z-distribution: z₀.₀₂₅ = 1.96
SE = s/√n = 0.4/√40 = 0.4/6.325 = 0.0632
ME = z × SE = 1.96 × 0.0632 = 0.124
CI = x̄ ± ME = 3.2 ± 0.124 = (3.076, 3.324)
We are 95% confident that the true population mean GPA is between 3.076 and 3.324
• Large sample: Use z-distribution when n ≥ 30
• Confidence level: 95% means we expect 95% of intervals to contain true mean
• Standard error: SE = s/√n for sample standard deviation
Hypothesis Testing: Statistical procedure to evaluate claims about population parameters
Null Hypothesis (H₀): Statement of no effect or no difference (usually includes equality)
Alternative Hypothesis (H₁): Research hypothesis (opposite of null)
H₀: μ = 1000 (manufacturer's claim)
H₁: μ ≠ 1000 (two-tailed test)
α = 0.05
Since n < 30, use t-distribution: t = (x̄ - μ₀)/(s/√n)
t = (980 - 1000)/(50/√25) = -20/(50/5) = -20/10 = -2.0
Degrees of freedom = n - 1 = 24
For α = 0.05 (two-tailed), t₀.₀₂₅ = ±2.064
Since |t| = 2.0 < 2.064, we fail to reject H₀
At the 0.05 significance level, there is insufficient evidence to reject the manufacturer's claim that bulbs last 1000 hours on average
• Small sample: Use t-distribution when n < 30
• Decision rule: Reject H₀ if |test statistic| > critical value
• Two-tailed test: Alternative is ≠, so divide α by 2
Sampling Distribution: Distribution of sample statistics (like sample mean)
Central Limit Theorem: Sample mean approaches normal distribution as n increases
Standard Error: Standard deviation of sampling distribution = σ/√n
Population mean (μ) = 70 inches, Population standard deviation (σ) = 3 inches
Sample mean distribution: Mean = μ = 70 inches
Standard error = σ/√n = 3/√36 = 3/6 = 0.5 inches
Lower bound: Z₁ = (69.5 - 70)/0.5 = -0.5/0.5 = -1
Upper bound: Z₂ = (70.5 - 70)/0.5 = 0.5/0.5 = 1
P(69.5 < X̄ < 70.5) = P(-1 < Z < 1) = P(Z < 1) - P(Z < -1)
= 0.8413 - 0.1587 = 0.6826
The probability that the sample mean is between 69.5 and 70.5 inches is approximately 0.6826 or 68.26%
• Sampling distribution: X̄ ~ N(μ, σ²/n)
• Standard error: σ_X̄ = σ/√n
• Central Limit Theorem: Applies for n ≥ 30
Population Parameter: True value for entire population (μ, σ, p)
Sample Statistic: Value calculated from sample (x̄, s, p̂)
Statistical Inference: Process of drawing conclusions about population based on sample
- Data collection: Obtain representative sample
- Parameter identification: Determine what to estimate/test
- Method selection: Choose appropriate statistical procedure
- Calculation: Compute test statistic or confidence interval
- Interpretation: Draw conclusion in context
• Confidence interval: x̄ ± z*(s/√n) or x̄ ± t*(s/√n)
• Test statistic: t = (x̄ - μ₀)/(s/√n)
• Standard error: SE = s/√n
• Degrees of freedom: df = n - 1
Sample Proportion: p̂ = x/n where x is number of successes
Proportion Confidence Interval: p̂ ± z·√[p̂(1-p̂)/n]
Conditions: np̂ ≥ 5 and n(1-p̂) ≥ 5 for normal approximation
p̂ = x/n = 280/500 = 0.56
np̂ = 500 × 0.56 = 280 ≥ 5 ✓
n(1-p̂) = 500 × 0.44 = 220 ≥ 5 ✓
For 90% confidence, α = 0.10, α/2 = 0.05
z₀.₀₅ = 1.645
SE = √[p̂(1-p̂)/n] = √[0.56 × 0.44/500] = √[0.2464/500] = √0.0004928 = 0.0222
ME = z × SE = 1.645 × 0.0222 = 0.0365
CI = p̂ ± ME = 0.56 ± 0.0365 = (0.5235, 0.5965)
We are 90% confident that the true proportion of voters who favor candidate A is between 52.3% and 59.7%
• Proportion CI: p̂ ± z·√[p̂(1-p̂)/n]
• Normal approximation: Requires np̂ ≥ 5 and n(1-p̂) ≥ 5
• Standard error: For proportions, SE = √[p̂(1-p̂)/n]
Two-Sample Test: Compare means of two independent populations
Test Statistic: t = (x̄₁ - x̄₂)/√[(s₁²/n₁) + (s₂²/n₂)]
Null Hypothesis: H₀: μ₁ - μ₂ = 0 (no difference)
H₀: μ₁ - μ₂ = 0 (no difference in means)
H₁: μ₁ - μ₂ > 0 (Method A is better)
t = (x̄₁ - x̄₂)/√[(s₁²/n₁) + (s₂²/n₂)]
t = (85 - 82)/√[(5²/30) + (6²/25)] = 3/√[0.833 + 1.44] = 3/√2.273 = 3/1.508 = 1.99
For unequal variances, use Welch's formula or assume df ≈ smaller of (n₁-1, n₂-1) = 24
For α = 0.05 (one-tailed), with df = 24: t₀.₀₅ ≈ 1.711
Since t = 1.99 > 1.711, we reject H₀
At the 0.05 significance level, there is sufficient evidence to conclude that Method A produces significantly higher scores than Method B
• Two-sample t-test: For comparing means of two independent groups
• One-tailed test: H₁: μ₁ > μ₂ (directional alternative)
• Unequal variances: Use pooled or unpooled method based on assumption
Statistical Inference: The process of drawing conclusions about population parameters based on sample statistics
Point Estimate: Single value used to estimate a population parameter (e.g., x̄ estimates μ)
Interval Estimate: Range of values that likely contains the population parameter
Type I Error: Rejecting a true null hypothesis (false positive)
Type II Error: Failing to reject a false null hypothesis (false negative)
- Problem identification: Determine if estimating parameter or testing hypothesis
- Data analysis: Check assumptions and conditions
- Method selection: Choose appropriate statistical procedure
- Calculation: Compute test statistic or confidence interval
- Decision making: Compare to critical value or interpret interval
- Conclusion: State results in context of original problem
• Single mean CI: x̄ ± t*(s/√n) or x̄ ± z*(σ/√n)
• Single mean test: t = (x̄ - μ₀)/(s/√n)
• Proportion CI: p̂ ± z·√[p̂(1-p̂)/n]
• Two-sample test: t = (x̄₁ - x̄₂)/√[(s₁²/n₁) + (s₂²/n₂)]
• Decision rule: Reject H₀ if |test stat| > critical value
• Confidence level: (1 - α) × 100%
Confidence intervals, hypothesis tests, and sampling distributions
One-sample vs two-sample comparisons
Analysis: The chart shows different statistical inference procedures and their applications.
- Confidence intervals provide ranges for population parameters
- Hypothesis tests evaluate claims about population parameters
- Sampling distributions describe variability of sample statistics