Box Plot: A graphical representation showing the five-number summary of a dataset: minimum, Q1, median (Q2), Q3, and maximum.
Quartiles: Values that divide ordered data into four equal parts (Q1 = 25%, Q2 = 50%, Q3 = 75%).
- Arrange data in ascending order
- Find the median (Q2)
- Find Q1 (median of lower half)
- Find Q3 (median of upper half)
- Identify minimum and maximum values
- Draw the box plot with whiskers
Sorted data: 45, 52, 58, 63, 67, 71, 75, 78, 82, 85, 90
With 11 values, median is the 6th value: 71
Lower half: 45, 52, 58, 63, 67. Q1 = median = 58
Upper half: 75, 78, 82, 85, 90. Q3 = median = 82
Minimum = 45, Maximum = 90
| Statistic | Value |
|---|---|
| Minimum | 45 |
| Q1 | 58 |
| Median (Q2) | 71 |
| Q3 | 82 |
| Maximum | 90 |
The box plot has: Minimum = 45, Q1 = 58, Median = 71, Q3 = 82, Maximum = 90.
• Quartile calculation: Q1 = median of lower half, Q3 = median of upper half
• Five-number summary: Includes min, Q1, median, Q3, max
• Box boundaries: Box spans from Q1 to Q3
Interquartile Range (IQR): Q3 - Q1, representing the middle 50% of data.
Outliers: Data points beyond Q1 - 1.5×IQR or Q3 + 1.5×IQR.
Sorted data: 15, 18, 20, 22, 25, 27, 30, 32, 35, 38, 40, 55
With 12 values: Q1 = average of 3rd and 4th = (20+22)/2 = 21
Median = average of 6th and 7th = (27+30)/2 = 28.5
Q3 = average of 9th and 10th = (35+38)/2 = 36.5
IQR = Q3 - Q1 = 36.5 - 21 = 15.5
Lower boundary = Q1 - 1.5×IQR = 21 - 1.5×15.5 = 21 - 23.25 = -2.25
Upper boundary = Q3 + 1.5×IQR = 36.5 + 1.5×15.5 = 36.5 + 23.25 = 59.75
All values between -2.25 and 59.75 are within bounds. Since all data points fall within these limits, there are no outliers.
IQR = 15.5. There are no outliers in this dataset.
• IQR formula: IQR = Q3 - Q1
• Outlier detection: Values beyond Q1 - 1.5×IQR or Q3 + 1.5×IQR
• Boundaries: Lower = Q1 - 1.5×IQR, Upper = Q3 + 1.5×IQR
Interquartile Range (IQR): The range containing the middle 50% of data, from Q1 to Q3.
Percentiles: Q1 represents 25th percentile, Q3 represents 75th percentile.
Five-number summary: Min=32, Q1=45, Median=58, Q3=72, Max=85
By definition, 50% of data lies between Q1 and Q3
IQR = Q3 - Q1 = 72 - 45 = 27°F
The middle 50% of temperatures range from 45°F to 72°F
50% of days had temperatures between 45°F and 72°F. The range of the middle 50% is 27°F.
• Quartile interpretation: Q1 to Q3 contains exactly 50% of data
• IQR significance: Measures spread of central data
• Data distribution: Understanding what each part of the box plot represents
Box Plot: A graphical representation of the five-number summary showing the distribution of a dataset.
Five-Number Summary: Minimum, Q1, Median, Q3, Maximum values of a dataset.
Quartiles: Three values that divide ordered data into four equal parts.
Interquartile Range (IQR): The difference between Q3 and Q1, representing the middle 50% of data.
- Sort data: Arrange values in ascending order
- Find median (Q2): Middle value of the dataset
- Find Q1: Median of the lower half (excluding median if odd)
- Find Q3: Median of the upper half (excluding median if odd)
- Identify extremes: Find minimum and maximum values
- Draw plot: Create box from Q1 to Q3 with median line, add whiskers to extremes
Consistency Check: Verifying that calculated values match expected relationships.
Given: Q3 = 45, Q1 = 25, so IQR = 45 - 25 = 20 ✓
With 9 values, median should be the 5th value, which is 35 ✓
Lower half: 1st to 4th values, median of which should be Q1 = 25 ✓
Upper half: 6th to 9th values, median of which should be Q3 = 45 ✓
Minimum = 15, Maximum = 60 are the smallest and largest values ✓
All values are consistent. The box plot shows: Min=15, Q1=25, Median=35, Q3=45, Max=60.
• IQR relationship: IQR must equal Q3 - Q1
• Position verification: Check that values are in correct positions
• Consistency check: Ensure all relationships hold true
Variability: How spread out the data values are.
Central Tendency: Where the center of the data is located.
Dataset A: IQR = 40 - 20 = 20
Dataset B: IQR = 45 - 25 = 20
Dataset A median = 30, Dataset B median = 35
Dataset B is shifted higher
Dataset A range = 50 - 10 = 40
Dataset B range = 55 - 15 = 40
Each value in Dataset B is exactly 5 units higher than Dataset A
Both datasets have the same variability (IQR = 20). Dataset B is shifted 5 units higher than Dataset A.
• Comparative analysis: Compare IQR for variability
• Position comparison: Compare medians and overall ranges
• Pattern recognition: Identifying shifts in distribution
Box Plot (Box-and-Whisker Plot): A standardized way of displaying the distribution of data based on a five-number summary.
Quartiles: Three values that divide sorted data into four equal parts: Q1 (25th percentile), Q2 (50th percentile/median), Q3 (75th percentile).
Interquartile Range (IQR): The range between Q1 and Q3, representing the middle 50% of the data.
Outliers: Data points that fall significantly outside the overall pattern of the data.
- Data Collection: Gather all data points to be represented
- Sorting: Arrange data in ascending order
- Quartile Calculation: Find Q1, Q2 (median), and Q3
- Extremes Identification: Determine minimum and maximum values
- Outlier Detection: Identify values beyond Q1-1.5×IQR or Q3+1.5×IQR
- Plot Construction: Draw box from Q1 to Q3 with median line, add whiskers to non-outlier extremes
• Quartile positions: Q1 = (n+1)/4th value, Q3 = 3(n+1)/4th value
• Interquartile Range: IQR = Q3 - Q1
• Outlier boundaries: Lower = Q1 - 1.5×IQR, Upper = Q3 + 1.5×IQR
• Five-number summary: Min, Q1, Median, Q3, Max
• Percentile interpretation: 25% below Q1, 50% below median, 75% below Q3