Mean (Average): The sum of all values divided by the number of values.
Formula: Mean = (Sum of all values) ÷ (Number of values)
- Add all the values together
- Count the number of values
- Divide the sum by the number of values
- Round to appropriate decimal places if necessary
85 + 92 + 78 + 96 + 88 + 91 + 83 = 613
There are 7 test scores
Mean = 613 ÷ 7 = 87.571428...
Mean ≈ 87.6 (rounded to one decimal place)
The mean of the test scores is 87.6.
• Mean Formula: Sum of values divided by count
• Arithmetic: Perform addition before division
• Rounding: Round to one more decimal than original data
Median: The middle value when data is arranged in ascending order.
For odd number of values: The middle value
For even number of values: Average of the two middle values
Original: 12, 18, 15, 21, 10, 16, 19, 14
Sorted: 10, 12, 14, 15, 16, 18, 19, 21
There are 8 values (even number)
Position 4: 15, Position 5: 16
These are the 4th and 5th values in the sorted list
Median = (15 + 16) ÷ 2 = 31 ÷ 2 = 15.5
The median of the data set is 15.5.
• Sorting Required: Always arrange data in order first
• Odd Count: Median is the middle value
• Even Count: Median is average of two middle values
Mode: The value that appears most frequently in a data set.
Unimodal: One mode exists
Bimodal: Two modes exist
Trimodal: Three modes exist
No Mode: All values appear equally often
Data: 5, 7, 3, 8, 5, 2, 7, 5, 9, 7
2: appears 1 time
3: appears 1 time
5: appears 3 times
7: appears 3 times
8: appears 1 time
9: appears 1 time
Values 5 and 7 both appear 3 times (highest frequency)
Since two values tie for highest frequency, both are modes
Mode = 5 and 7 (bimodal)
The modes of the data set are 5 and 7. This is a bimodal distribution.
• Frequency Count: Tally how many times each value appears
• Multiple Modes: More than one value can be a mode
• No Mode: If all values appear equally, no mode exists
Mean: The arithmetic average of all values in a data set
Median: The middle value when data is arranged in ascending order
Mode: The value that appears most frequently in a data set
Range: The difference between the maximum and minimum values
Unimodal: A data set with one mode
Bimodal: A data set with two modes
Outlier: A value that is significantly different from other values
- Mean Calculation: Add all values, divide by count
- Median Calculation: Sort data, find middle value(s)
- Mode Identification: Count frequencies, identify most common
- Range Calculation: Subtract minimum from maximum
- Outlier Detection: Look for extreme values that may affect measures
- Comparison: Compare measures to understand data distribution
Outlier: A value that is significantly different from other values in the data set.
Effect on Mean: Outliers strongly affect the mean because it uses all data values.
Effect on Median: Outliers have less effect on the median since it depends on position.
Value 100 is significantly larger than other values (15-20 range)
This is the outlier.
Mean with outlier: (15+18+20+17+16+19+18+100)÷8 = 203÷8 = 25.375 ≈ 25.4
Sorted data: 15, 16, 17, 18, 18, 19, 20, 100
Median: (18+18)÷2 = 18
Mean without outlier: (15+18+20+17+16+19+18)÷7 = 123÷7 = 17.57 ≈ 17.6
Sorted data: 15, 16, 17, 18, 18, 19, 20
Median: 18 (middle value)
Mean changed from 17.6 to 25.4 (increase of 7.8)
Median remained at 18 (no change)
The outlier significantly increased the mean but did not affect the median.
This shows that mean is sensitive to extreme values.
Without outlier: Mean=17.6, Median=18
With outlier: Mean = 25.4, Median = 18. Without outlier: Mean = 17.6, Median = 18. The outlier significantly increased the mean but had no effect on the median.
• Outlier Sensitivity: Mean is sensitive to outliers, median is resistant
• Data Analysis: Always consider outliers when interpreting measures
• Robust Measures: Median is more robust than mean for skewed data
a) Salaries at a company with one CEO earning much more than employees
b) Favorite ice cream flavor among students
c) Typical home prices in a neighborhood
Appropriate Measure: The measure that best represents the typical value in a data set.
Skewed Data: Data that is not symmetric, with a tail extending in one direction.
Categorical Data: Data that represents categories rather than numerical values.
The CEO's salary is an extreme outlier that would skew the mean upward.
The median represents the typical employee salary better.
This is categorical data, not numerical.
Only the mode makes sense (most popular flavor).
Home prices often have extreme outliers (luxury homes).
The median better represents typical home prices.
a) Median: Resistant to outliers, shows typical employee salary
b) Mode: Only appropriate for categorical data
c) Median: Less affected by luxury home outliers
Always consider skewness, data type, and presence of outliers when choosing.
a) Median - resistant to CEO's high salary outlier
b) Mode - appropriate for categorical data
c) Median - resistant to luxury home outliers
• Data Type: Use mode for categorical data
• Outlier Presence: Use median when outliers exist
• Distribution Shape: Mean for symmetric, median for skewed
Mean: The arithmetic average of all values in a data set, calculated by adding all values and dividing by the count
Median: The middle value when data is arranged in ascending order; if even number of values, it's the average of the two middle values
Mode: The value that appears most frequently in a data set; there can be one, multiple, or no modes
Range: The difference between the maximum and minimum values in a data set
Outlier: A data point that is significantly different from other values in the data set
Skewed Distribution: A distribution where data is not symmetric, with a longer tail on one side
- Mean Calculation: Add all values and divide by the number of values
- Median Calculation: Sort data in ascending order, then find the middle value(s)
- Mode Identification: Count the frequency of each value and identify the most frequent
- Range Calculation: Subtract the minimum value from the maximum value
- Outlier Analysis: Identify extreme values and assess their impact on measures
- Measure Selection: Choose the most appropriate measure based on data characteristics
• Mean Formula: Mean = (Sum of all values) ÷ (Number of values)
• Median Rule: Middle value when sorted (average of two middle values if even count)
• Mode Rule: Value that appears most frequently
• Range Formula: Range = Maximum - Minimum
• Outlier Impact: Mean is sensitive, median is resistant
Set A: 10, 12, 14, 15, 16, 18, 20 (symmetric)
Set B: 5, 7, 9, 11, 13, 15, 50 (skewed right)
Set C: 2, 2, 3, 4, 5, 5, 5 (with mode)
Analysis: The visualization shows how different distributions affect measures of center.
- Set A: Mean ≈ Median (symmetric distribution)
- Set B: Mean > Median (skewed right by outlier)
- Set C: Mode is prominent at value 5