AP · Chi-Square Test for Homogeneity · 14 min read · Updated 2026-05-10

Chi-Square Test for Homogeneity — AP Statistics Study Guide

For: AP Statistics candidates sitting AP Statistics.

Covers: Hypothesis formulation, expected count calculation, chi-square test statistic computation, degrees of freedom calculation, inference conditions, p-value interpretation, and contextual conclusions for tests of distribution homogeneity across multiple populations.

You should already know: Basics of hypothesis testing for categorical data, how to read and interpret contingency tables, the core mechanics of the chi-square test for goodness of fit.

A note on the practice questions: All worked questions in the "Practice Questions" section below are original problems written by us in the AP Statistics style for educational use. They are not reproductions of past College Board / Cambridge / IB papers and may differ in wording, numerical values, or context. Use them to practise the technique; cross-check with official mark schemes for grading conventions.

1. What Is Chi-Square Test for Homogeneity?

The chi-square test for homogeneity is a hypothesis-based inference procedure used to test whether the distribution of a single categorical response variable is homogeneous (identical) across two or more independent populations or treatment groups.

A common source of confusion early on is distinguishing this test from the other two chi-square tests on the AP exam: unlike the chi-square test for goodness-of-fit (which compares one sample’s distribution to a single hypothesized distribution) and the chi-square test for independence (which tests for association between two categorical variables from a single population sample), the homogeneity test starts with separate, independent random samples from each population of interest.

This topic is part of Unit 8: Inference for Categorical Data: Chi-Square in the AP Statistics CED, which accounts for 10–15% of the total exam score. Chi-square test for homogeneity questions appear on both the multiple-choice (MCQ) and free-response (FRQ) sections of the exam; it regularly appears as a full 4-point FRQ or a 2–3 point subpart of a longer inference FRQ.

2. Hypotheses and Conditions for Inference

The first step in any chi-square test for homogeneity is correctly stating the null and alternative hypotheses, then verifying that all conditions for inference are met.

For hypotheses, the null hypothesis ( $H_{0}$ ) always states that the distribution of the categorical variable is the same across all populations. The alternative hypothesis ( $H_{a}$ ) states that at least one population has a different distribution of the categorical variable. In formal notation, for $k$ populations and $m$ categories of the response variable: $H_{0} : p_{i 1} = p_{i 2} = ... = p_{ik} for all i = 1, 2, ..., m$ $H_{a} : At least one p_{ij} differs across populations$ Where $p_{ij}$ is the proportion of population $j$ that falls into response category $i$ .

Three conditions must be satisfied for the test to be valid per the AP CED:

Random: Data come from independent random samples from each population, or from groups in a randomized controlled experiment.
Independence: Individual observations are independent within each sample. When sampling without replacement, the total sample size must be less than 10% of the total population size.
Large Counts: The expected count for every cell in the contingency table is at least 5.

Worked Example

A high school counselor wants to test whether the distribution of post-graduation plans (4-year college, 2-year college, full-time work, gap year) is the same for seniors in the general education, honors, and IB programs at her school. She takes independent random samples of 80 general ed, 100 honors, and 60 IB seniors. The smallest expected count in her contingency table is 7.1. State the hypotheses and check conditions for inference.

Define parameters: Let $p_{ij}$ be the proportion of program $j$ that plans for outcome $i$ , where $i$ = 1 (4-year) to 4 (gap year), and $j$ = 1 (gen ed), 2 (honors), 3 (IB).
State hypotheses: $H_{0}$ : The distribution of post-graduation plans is homogeneous across the three programs. $H_{a}$ : The distribution of post-graduation plans differs across at least one of the three programs.
Check Random: Independent random samples were taken from each program, so this condition is satisfied.
Check Independence: The total number of seniors sampled is 240, which is less than 10% of all seniors at any large public high school, so the 10% condition for independence is met.
Check Large Counts: The smallest expected count is 7.1, which is greater than 5, so this condition is satisfied.

All conditions for inference are met.

Exam tip: Always state hypotheses in the context of the problem, not just generic symbols. AP exam readers require contextual hypotheses to award full credit for the inference step.

3. Test Statistic and Degrees of Freedom Calculation

Once you have confirmed hypotheses and conditions, the next step is to calculate the chi-square test statistic and degrees of freedom.

First, calculate the expected count for each cell in an $r \times c$ contingency table, where $r$ is the number of rows (response categories) and $c$ is the number of columns (populations/groups). The formula for expected count for the cell in row $i$ , column $j$ is: $E_{ij} = \frac{( Row Total ) \times ( Column Total )}{Grand Total}$ Intuition: If the null hypothesis of homogeneity is true, the overall proportion of observations in each row should apply equally to all columns, so we multiply the column sample size by the overall row proportion to get the count we expect if $H_{0}$ is true.

Once all expected counts are calculated, the chi-square test statistic is: $χ^{2} = all cells \sum \frac{( O - E ) ^{2}}{E}$ Where $O$ is the observed count for the cell and $E$ is the expected count. Larger values of $χ^{2}$ indicate larger differences between observed and expected counts, so stronger evidence against $H_{0}$ .

Degrees of freedom (df) for the test are calculated as: $df = (r - 1) (c - 1)$ This df is used to find the p-value from the right-skewed chi-square distribution.

Worked Example

A market researcher tests whether the distribution of snack preference (salty, sweet, savory, other) is homogeneous across three age groups: children, teens, adults. The observed contingency table is below:

Preference	Children	Teens	Adults	Row Total
Salty	25	20	15	60
Sweet	22	28	20	70
Savory	10	12	28	50
Other	3	5	7	15
Col Total	60	65	70	195

Calculate the chi-square test statistic and degrees of freedom for this test.

Calculate expected counts for each cell: For example, Salty/Children: $E = (60 \times 60) /195 \approx 18.46$ , Savory/Adults: $E = (50 \times 70) /195 \approx 17.95$ .
Calculate the $\frac{( O - E ) ^{2}}{E}$ term for each cell, then sum all terms: $χ^{2} \approx 15.82$ .
Calculate df: $r = 4$ (preference categories), $c = 3$ (age groups), so $df = (4 - 1) (3 - 1) = 6$ .

The test statistic is $χ^{2} \approx 15.82$ with 6 degrees of freedom.

Exam tip: Always label the row and column totals correctly when plugging into the expected count formula. Mixing up row and column totals will give the wrong expected count and lose points on FRQ.

4. Drawing a Conclusion in Context

The final step of any hypothesis test is to compare the p-value to the pre-specified significance level $α$ (almost always $α = 0.05$ on the AP exam, unless stated otherwise) and draw a conclusion that answers the original research question in context.

For chi-square tests, the p-value is the probability of observing a test statistic as large or larger than the one calculated, if the null hypothesis of homogeneity is true. Because larger $χ^{2}$ values indicate more evidence against $H_{0}$ , all p-values correspond to the area to the right of the test statistic under the chi-square distribution with the appropriate degrees of freedom.

The decision rule is straightforward: if $p$ -value $< α$ , reject $H_{0}$ . If $p$ -value $\geq α$ , fail to reject $H_{0}$ . A critical rule to remember is that you never "accept" the null hypothesis; you only fail to reject it, because we cannot prove the distributions are identical, only that we do not have enough evidence to say they differ.

A template for full credit on AP exams is: "Since [p-value < α / p-value ≥ α], we [reject / fail to reject] $H_{0}$ . There [is / is not] sufficient evidence at the $α = 0.05$ significance level to conclude that the distribution of [response variable] differs across at least one of the [populations/groups]."

Worked Example

Using the snack preference example from Section 3, we have $χ^{2} = 15.82$ with $df = 6$ . The p-value for this test is between 0.01 and 0.02. Using $α = 0.05$ , state an appropriate conclusion.

Compare p-value to $α$ : The p-value is between 0.01 and 0.02, which is less than $α = 0.05$ .
Make the decision: We reject the null hypothesis of homogeneity.
Conclusion in context: There is sufficient evidence at the 0.05 significance level to conclude that the distribution of snack preference differs across at least one of the three age groups (children, teens, adults).

If the p-value had been 0.08, the conclusion would be: "Since 0.08 > 0.05, we fail to reject $H_{0}$ . There is not sufficient evidence at the 0.05 significance level to conclude that the distribution of snack preference differs across the three age groups."

Exam tip: Never skip the context step in your conclusion. AP exam readers require a contextual conclusion to award the final point for the test, even if all calculations are correct.

5. Common Pitfalls (and how to avoid them)

Wrong move: Confusing the chi-square test for homogeneity with the chi-square test for independence, and describing hypotheses incorrectly for the study design. Why: Both tests use identical calculation methods for test statistic and df, so students often mix up the research question. Correct move: Always check the study design first: if you have independent samples from multiple populations and one response variable, it is homogeneity; if you have one sample and two categorical variables measured on each unit, it is independence.
Wrong move: Using the goodness-of-fit df formula $df = n - 1$ instead of $df = (r - 1) (c - 1)$ for a homogeneity test. Why: Students memorize goodness-of-fit first and confuse the two formulas. Correct move: For any chi-square test on a contingency table (homogeneity or independence), always use $df = (r - 1) (c - 1)$ .
Wrong move: Claiming you "accept the null hypothesis" when the p-value is larger than $α$ . Why: Students get lazy with wording and forget that failing to reject is not the same as proving the null true. Correct move: Always use the phrasing "fail to reject the null hypothesis" when $p \geq α$ .
Wrong move: Checking the large counts condition against observed counts instead of expected counts. Why: Students see the "all counts ≥ 5" rule and check the given observed counts by default. Correct move: Always confirm all expected counts are ≥ 5 when checking the large counts condition.
Wrong move: Stating the alternative hypothesis as "all populations have different distributions" instead of "at least one population has a different distribution". Why: Students generalize incorrectly, assuming the alternative requires all groups to differ. Correct move: Always state the alternative as "at least one population has a different distribution of the response variable".
Wrong move: Calculating expected counts as $\frac{Row Total \times Grand Total}{Column Total}$ , swapping the denominator. Why: Students rush when doing calculations and misremember the formula order. Correct move: Memorize "row times column over grand total" and write the formula down before plugging in numbers.

6. Practice Questions (AP Statistics Style)

Question 1 (Multiple Choice)

A political scientist tests whether the distribution of party affiliation (Democrat, Republican, Independent) is homogeneous across four regions of the US. He takes independent random samples of 100 registered voters from each region, and calculates a chi-square test statistic of 12.3. What is the range of the p-value for this test? A) $0.01 < p < 0.025$ B) $0.025 < p < 0.05$ C) $0.05 < p < 0.10$ D) $0.10 < p < 0.20$

Worked Solution: First, calculate degrees of freedom: we have 3 party affiliation categories (rows, $r = 3$ ) and 4 regions (columns, $c = 4$ ), so $df = (3 - 1) (4 - 1) = 6$ . From the standard chi-square distribution table, for $df = 6$ , $χ^{2} = 10.64$ corresponds to $p = 0.10$ and $χ^{2} = 12.59$ corresponds to $p = 0.05$ . Our test statistic 12.3 falls between these two values, so the p-value is between 0.05 and 0.10. The correct answer is C.

Question 2 (Free Response)

A university admissions office wants to test whether the distribution of admission decisions (accept, waitlist, reject) is homogeneous across in-state, out-of-state, and international applicant groups. Independent random samples of 200 applicants from each group gave the observed counts below:

Decision	In-State	Out-of-State	International	Row Total
Accept	72	54	41	167
Waitlist	30	33	32	95
Reject	98	113	127	338
Col Total	200	200	200	600

(a) State the appropriate null and alternative hypotheses for this test, and check all conditions for inference. The smallest expected count is 31.7. (b) Calculate the degrees of freedom for this test. (c) If the p-value for the test is 0.032, what conclusion do you draw at the $α = 0.05$ significance level?

Worked Solution: (a) $H_{0}$ : The distribution of admission decisions is homogeneous across in-state, out-of-state, and international applicant groups. $H_{a}$ : The distribution of admission decisions differs across at least one applicant group. Conditions: Random independent samples are given, all observations are independent (total sample 600 < 10% of all applicants), and the smallest expected count is 31.7 ≥ 5. All conditions for inference are met. (b) $r = 3$ (decision categories), $c = 3$ (applicant groups), so $df = (3 - 1) (3 - 1) = 4$ . (c) Since $p$ -value = 0.032 < 0.05, we reject $H_{0}$ . There is sufficient evidence at the 0.05 significance level to conclude that the distribution of admission decisions differs across at least one of the three applicant groups.

Question 3 (Application / Real-World Style)

An ecologist studies whether the distribution of tree species (oak, maple, pine, birch) is homogeneous across three different soil types in a national forest. She takes independent random samples of 50 trees from each soil type, resulting in the table below:

Species	Soil A	Soil B	Soil C	Row Total
Oak	18	12	8	38
Maple	12	15	10	37
Pine	10	12	18	40
Birch	10	11	14	35
Col Total	50	50	50	150

Conduct a full chi-square test for homogeneity at the $α = 0.05$ significance level and interpret your result in context.

Worked Solution:

Hypotheses: $H_{0}$ : The distribution of tree species is homogeneous across the three soil types. $H_{a}$ : At least one soil type has a different distribution of tree species.
Conditions: Independent random samples are given, all observations are independent, the smallest expected count is $\frac{35 \times 50}{150} \approx 11.67 \geq 5$ , so all conditions are met.
Calculations: $χ^{2} = \sum \frac{( O - E ) ^{2}}{E} \approx 8.31$ , $df = (4 - 1) (3 - 1) = 6$ .
The p-value for $χ^{2} = 8.31$ with df=6 is between 0.20 and 0.25, which is greater than $α = 0.05$ . We fail to reject $H_{0}$ .

Interpretation: There is not sufficient evidence at the 0.05 significance level to conclude that the distribution of tree species differs across the three soil types in the national forest.

7. Quick Reference Cheatsheet

Category	Formula	Notes
Null Hypothesis	$H_{0}$ : Distribution of response is identical across all populations	Use when you have independent samples from multiple groups
Alternative Hypothesis	$H_{a}$ : At least one population has a different response distribution	Never state "all distributions are different"
Expected Cell Count	$E_{ij} = \frac{Row Total \times Column Total}{Grand Total}$	Applies to all contingency table chi-square tests
Chi-Square Test Statistic	$χ^{2} = \sum_{all cells} \frac{( O - E ) ^{2}}{E}$	Larger values = stronger evidence against $H_{0}$
Degrees of Freedom	$df = (r - 1) (c - 1)$	$r$ = number of response categories, $c$ = number of populations
Conditions for Inference	Random, Independence, All $E \geq 5$	AP CED requires all expected counts ≥ 5
Conclusion for $p < α$	Reject $H_{0}$	There is sufficient evidence that distributions differ
Conclusion for $p \geq α$	Fail to reject $H_{0}$	There is not sufficient evidence that distributions differ

8. What's Next

The chi-square test for homogeneity is the second of three core chi-square procedures in AP Statistics Unit 8, after goodness-of-fit and before the chi-square test for independence. Mastering the mechanics of contingency table calculations, hypothesis formulation, and condition checking for homogeneity is a critical prerequisite for the test of independence, which uses the same calculation framework but a different study design and research question. Without mastering this chapter, you will struggle to distinguish between the two contingency table chi-square tests, a common source of lost points on the AP exam. Beyond Unit 8, chi-square inference for categorical data is the foundation for more advanced procedures like logistic regression and categorical data analysis in post-AP statistics courses.

Chi-Square Test for Goodness of Fit Chi-Square Test for Independence Two-Sample Inference for Proportions

← Back to topic

Stuck on a specific question?
Snap a photo or paste your problem — Ollie (our AI tutor) walks through it step-by-step with diagrams.
Try Ollie free →

Chi-Square Test for Homogeneity — AP Statistics Study Guide

1. What Is Chi-Square Test for Homogeneity?

2. Hypotheses and Conditions for Inference

Worked Example

3. Test Statistic and Degrees of Freedom Calculation

Worked Example

4. Drawing a Conclusion in Context

Worked Example

5. Common Pitfalls (and how to avoid them)

6. Practice Questions (AP Statistics Style)

Question 1 (Multiple Choice)

Question 2 (Free Response)

Question 3 (Application / Real-World Style)

7. Quick Reference Cheatsheet

8. What's Next

More study guides