| Dog | Cat | Total | |
|---|---|---|---|
| Male | 12 | 8 | 20 |
| Female | 15 | ? | 25 |
| Total | 27 | ? | 45 |
Two-Way Table: A table that organizes data by two categories to show the relationship between them
- Identify what values are missing
- Use the fact that rows and columns must sum correctly
- Apply the principle that totals must match
Female total = Female Dog + Female Cat owners
25 = 15 + Female Cat owners
Female Cat owners = 25 - 15 = 10
Total Cat = Male Cat + Female Cat
Total Cat = 8 + 10 = 18
Total = Total Dog + Total Cat = 27 + 18 = 45 ✓
| Dog | Cat | Total | |
|---|---|---|---|
| Male | 12 | 8 | 20 |
| Female | 15 | 10 | 25 |
| Total | 27 | 18 | 45 |
• Row sums: Each row must equal its total
• Column sums: Each column must equal its total
• Grand total: Row total sum = Column total sum
Relative Frequency: The proportion of a category compared to the total, expressed as a decimal or percentage
Male dog owners = 12 people
Total surveyed = 45 people
Relative frequency = Part/Total = 12/45 = 0.267 or 26.7%
The relative frequency of male dog owners is 0.267 or 26.7%
• Formula: Relative frequency = (specific value)/(total)
• Range: Between 0 and 1 (or 0% and 100%)
• Interpretation: Represents proportion of total
Conditional Probability: The probability of an event occurring given that another event has occurred
We're looking only at females (25 people)
Among females, 10 own cats
P(Cat|Female) = Females with cats / Total females = 10/25 = 0.4
The probability that a female owns a cat is 0.4 or 40%
• Formula: P(A|B) = P(A and B) / P(B)
• Condition: Focus only on the given group
• Range: Between 0 and 1 (or 0% and 100%)
Two-Way Table: A table organizing data by two categorical variables
Joint Frequency: The count in each individual cell
Marginal Frequency: The totals for each row and column
Relative Frequency: Proportion of a value compared to the total
Conditional Probability: Probability of an event given another event occurred
Independence: When knowledge of one variable doesn't affect the probability of another
- Understand structure: Identify row and column categories
- Find joint frequencies: Values in each cell
- Calculate marginal frequencies: Row and column totals
- Compute relative frequencies: Proportions of each value
- Determine conditional probabilities: Probabilities given conditions
- Look for associations: Patterns between variables
Association: When knowing one variable affects the probability of another variable
| Likes Sports | Doesn't Like | Total | |
|---|---|---|---|
| Male | 30 | 20 | 50 |
| Female | 25 | 25 | 50 |
| Total | 55 | 45 | 100 |
P(Likes Sports | Male) = 30/50 = 0.6
P(Likes Sports | Female) = 25/50 = 0.5
Since 0.6 ≠ 0.5, there appears to be an association between gender and sports preference
Yes, there is an association. Males (60%) are more likely to like sports than females (50%).
• Association detection: Compare conditional probabilities
• Independence: If probabilities are equal, variables are independent
• Pattern recognition: Different probabilities suggest association
Multi-step Analysis: Performing multiple calculations using the same table
Students who like sports = 55, Total students = 100
Relative frequency = 55/100 = 0.55 or 55%
This is conditional probability: P(Male | Likes Sports)
Students who are male AND like sports = 30
Students who like sports = 55
P(Male | Likes Sports) = 30/55 ≈ 0.545 or 54.5%
55% of all students like sports, and among those who like sports, 54.5% are male
a) The relative frequency of students who like sports is 0.55 (55%)
b) The probability that a student is male given they like sports is approximately 0.545 (54.5%)
• Relative frequency: Part divided by total
• Conditional probability: Focus on the given condition
• Formula: P(A|B) = P(A and B) / P(B)
Two-Way Table: Displays frequency data for two categorical variables
Joint Frequency: Individual cell counts representing intersection of categories
Marginal Frequency: Row and column totals showing overall category frequencies
Relative Frequency: Proportion of a value compared to the total (between 0 and 1)
Conditional Probability: Probability of an event given that another event occurred
Association: When variables are related (knowledge of one affects probability of the other)
Independence: When variables are unrelated (probability of one doesn't depend on the other)
- Structure identification: Determine row and column categories
- Value organization: Fill in all known and missing values
- Frequency calculation: Find joint, marginal, and relative frequencies
- Probability computation: Calculate conditional probabilities
- Association detection: Compare conditional probabilities
- Interpretation: Explain what the data reveals
• Relative frequency: (specific value) / (total)
• Conditional probability: P(A|B) = P(A and B) / P(B)
• Grand total verification: Sum of row totals = Sum of column totals
• Association detection: Compare P(A|B) with P(A|not B)