Mean: The average value calculated by adding all values and dividing by the number of values
- Calculate the mean for each data set separately
- Compare the resulting means
- Draw conclusions about which distribution has higher values on average
Sum = 75 + 80 + 85 + 90 + 95 = 425
Mean = 425 ÷ 5 = 85
Sum = 65 + 70 + 80 + 85 + 90 = 390
Mean = 390 ÷ 5 = 78
85 > 78, so Class A has a higher mean
Class A has a higher mean score (85) compared to Class B (78)
• Formula: Mean = Sum of all values ÷ Number of values
• Comparison: Higher mean indicates higher average values
• Sensitivity: Mean is affected by outliers
Median: The middle value when data is arranged in order from least to greatest
Set 1: Data is already ordered, middle value is 18
Set 2: Data is already ordered, middle value is 16
18 > 16, so Set 1 has the higher median
Set 1 has a higher median, indicating its middle value is greater
Set 1 has a higher median (18) compared to Set 2 (16)
• Ordering: Always arrange data from least to greatest
• Robustness: Median is less affected by outliers
• Position: Median represents the 50th percentile
Range: The difference between the maximum and minimum values in a data set
Min = 5, Max = 25
Range = 25 - 5 = 20
Min = 8, Max = 24
Range = 24 - 8 = 16
20 > 16, so Set X has greater variability
Set X has greater variability with a range of 20 compared to Set Y's range of 16
• Formula: Range = Maximum - Minimum
• Variability: Larger range indicates greater spread
• Limitation: Range only considers extremes, not internal distribution
Data Distribution: The pattern of how data values are arranged across different values
Central Tendency: Measures that represent the center of a data distribution
Spread/Variability: Measures that describe how data values are dispersed
Shape: The pattern of the distribution (symmetric, skewed, uniform)
Outlier: A data point that is significantly different from others
Skewness: Asymmetry in the distribution of data
- Organize data: Order data sets from least to greatest
- Calculate measures: Find means, medians, and ranges
- Compare central tendencies: See which distribution has higher values
- Compare spreads: Determine which distribution has more variability
- Identify patterns: Look for outliers, skewness, or clustering
- Draw conclusions: Interpret what differences mean in context
Distribution Shape: The pattern formed by data points when plotted, indicating symmetry or skewness
Set A: 1, 2, 3, 4, 5, 6, 7 - values are evenly spaced and increase uniformly
This creates a uniform distribution that is symmetric
Set B: 1, 1, 1, 2, 6, 7, 7 - values cluster at low and high ends
This creates a bimodal distribution with peaks at 1 and 7
Set A is uniform and symmetric, Set B is bimodal with gaps in the middle
Set A has a uniform distribution shape, while Set B has a bimodal distribution with clustering at the extremes
• Shape identification: Look for patterns, clusters, gaps, and peaks
• Symmetry: Distribution balanced on both sides of center
• Modality: Number of peaks (unimodal, bimodal, multimodal)
Comprehensive Analysis: Examining all aspects of data distributions simultaneously
Mean = (70+75+80+85+90+95)÷6 = 495÷6 = 82.5
Median = (80+85)÷2 = 82.5 (average of two middle values)
Range = 95-70 = 25
Mean = (60+70+80+80+90+100)÷6 = 480÷6 = 80
Median = (80+80)÷2 = 80 (average of two middle values)
Range = 100-60 = 40
Alpha has higher mean (82.5 vs 80) and median (82.5 vs 80)
Beta has larger range (40 vs 25), indicating more variability
Alpha: Evenly distributed scores, symmetric
Beta: Has mode at 80, wider spread suggesting more variation
Beta: Mean=80, Median=80, Range=40
Class Alpha has higher average scores (mean=82.5, median=82.5) but less variability (range=25). Class Beta has lower average scores (mean=80, median=80) but more variability (range=40).
• Multiple measures: Consider mean, median, and range together
• Context: Higher mean doesn't necessarily mean better performance
• Variability: Larger range indicates less consistency
Data Distribution: The pattern of how data values are arranged across different values
Central Tendency: Measures (mean, median, mode) that represent the center of a distribution
Spread/Variability: Measures (range, interquartile range) that describe data dispersion
Shape: The pattern of the distribution (symmetric, skewed, uniform, bimodal)
Outlier: A data point significantly different from others
Skewness: Asymmetry in the distribution of data
Modality: Number of peaks in a distribution (unimodal, bimodal, multimodal)
- Data organization: Order each distribution from least to greatest
- Central tendency: Calculate means and medians for each distribution
- Variability measures: Calculate ranges for each distribution
- Shape analysis: Examine patterns, clusters, gaps, and outliers
- Comparison: Compare corresponding measures between distributions
- Interpretation: Explain what differences mean in context
• Central tendency: Mean (affected by outliers), Median (robust)
• Variability: Range (simple but limited), Interquartile range (better measure)
• Shape: Symmetric, skewed left/right, uniform, bimodal
• Comparison: Higher mean/median indicates generally higher values