A-Level Mathematics · Paper 5 (Probability & Statistics 1) · Hypothesis Testing · 18 min read · Updated 2026-05-06

Hypothesis Testing — A-Level Mathematics Stats Study Guide

For: A-Level Mathematics candidates sitting Paper 5 (Probability & Statistics 1).

Covers: Null and alternative hypotheses, one- and two-tailed tests, critical regions and significance levels, Type I and Type II errors, and hypothesis tests for binomial proportions and normal means, aligned to the latest A-Level Mathematics syllabus.

You should already know: Basic probability, summation, integration (Pure 1 calculus).

A note on the practice questions: All worked questions in the "Practice Questions" section below are original problems written by us in the A-Level Mathematics style for educational use. They are not reproductions of past Cambridge International examination papers and may differ in wording, numerical values, or context. Use them to practise the technique; cross-check with official Cambridge mark schemes for grading conventions.

1. What Is Hypothesis Testing?

Hypothesis testing is a statistical framework to evaluate whether a claim about a population parameter is supported by sample data, using probability to quantify the risk of drawing an incorrect conclusion. It is a high-weight topic in A-Level Mathematics Paper 5, accounting for 10-15% of total marks on most exam papers, and forms the foundation for all advanced statistical analysis in further mathematics syllabi. Common synonyms include significance testing and statistical hypothesis testing.

2. Null and alternative hypotheses

Every hypothesis test starts with two competing statements about a population parameter, never about a sample statistic.

The null hypothesis ( $H_{0}$ ) is the default, widely accepted claim, assuming no effect, no difference, or no change from a known reference value. It always includes an equality sign for the population parameter (e.g., $p = 0.6$ , $μ = 500$ ).
The alternative hypothesis ( $H_{1}$ ) is the competing claim you are testing against $H_{0}$ , assuming there is a measurable effect, difference, or change from the reference value. It uses inequality signs ( $<$ , $>$ , or $\neq =$ ) depending on the direction of the claim.

Worked Example

A mobile network claims its average 5G download speed is 120 Mbps. A customer suspects the speed is slower than advertised. The hypotheses are: $H_{0} : μ = 120 Mbps (network claim is true)$ $H_{1} : μ < 120 Mbps (speed is slower than advertised)$ Exam tip: Examiners deduct 1 mark if you write hypotheses using sample statistics (e.g., $\overset{x}{ˉ} = 120$ ) instead of population parameters, so double-check this before moving on.

3. One-tailed and two-tailed tests

The "tail" of a test is determined by the inequality in the alternative hypothesis, and dictates how you calculate significance thresholds.

One-tailed test: $H_{1}$ uses a single directional inequality ( $<$ or $>$ ), so you are only testing for an effect in one specific direction. Left-tailed tests use $<$ , right-tailed tests use $>$ .
Two-tailed test: $H_{1}$ uses $\neq =$ , so you are testing for any difference from the reference value, regardless of direction. The total significance level is split equally between the upper and lower tails of the distribution.

Worked Example

Using the mobile network scenario:

If you only suspect speeds are slower, this is a left-tailed one-tailed test.
If you suspect speeds are either faster or slower than advertised (no directional claim), $H_{1} : μ \neq = 120$ Mbps, so this is a two-tailed test. Exam tip: Context clues in the question will tell you which test to use: phrases like "increased", "reduced", or "biased towards" indicate a one-tailed test, while "different from" or "changed" indicate a two-tailed test.

4. Critical region and significance level

The significance level and critical region set the threshold for how unlikely a sample result has to be before you reject the null hypothesis.

The significance level ( $α$ ) is the maximum acceptable probability of rejecting $H_{0}$ when it is actually true. Common values used in A-Level exams are 1%, 5%, and 10%.
The critical region (or rejection region) is the set of values of the test statistic for which you reject $H_{0}$ . The boundary of this region is called the critical value.

Worked Example

You run a right-tailed test for a binomial proportion, with $n = 20$ , $H_{0} : p = 0.4$ , and $α = 5%$ . First, calculate cumulative binomial probabilities: $P (X \geq 11) = 1 - P (X \leq 10) = 1 - 0.943 = 0.057$ $P (X \geq 12) = 1 - P (X \leq 11) = 1 - 0.979 = 0.021$ Since 0.021 < 0.05 and 0.057 > 0.05, the critical region is $X \geq 12$ . For a two-tailed test at 5% significance, you would split $α$ into 2.5% per tail, finding lower and upper critical values where $P (X \leq k) < 0.025$ and $P (X \geq m) < 0.025$ .

5. Type I and Type II errors

No hypothesis test is 100% accurate: there are two possible errors you can make when drawing a conclusion:

Type I error: Reject $H_{0}$ when $H_{0}$ is actually true (a "false positive"). The probability of a Type I error is equal to the significance level $α$ , or the exact probability of the test statistic falling in the critical region under $H_{0}$ .
Type II error: Fail to reject $H_{0}$ when $H_{0}$ is actually false (a "false negative"). The probability of a Type II error is denoted $β$ , and can only be calculated if you are given the true value of the population parameter.

Worked Example

For a test with $H_{0} : p = 0.5$ , $H_{1} : p > 0.5$ , $n = 20$ , and critical region $X \geq 15$ :

Probability of Type I error = $P (X \geq 15∣ p = 0.5) = 1 - P (X \leq 14) = 1 - 0.9793 = 0.0207$ (~2.1%), which is below the 5% significance level.
If the true value of $p$ is 0.7, probability of Type II error = $P (X < 15∣ p = 0.7) = P (X \leq 14) = 0.5836$ (~58.4%). Exam tip: Use the mnemonic to remember: Type I = Innocent person convicted, Type II = Guilty person acquitted.

6. Test for a binomial proportion or normal mean

These are the two hypothesis test variants you will be asked to conduct in A-Level Mathematics Paper 5.

Test for a binomial proportion

Used for discrete count data with fixed independent trials, two outcomes, and constant success probability $p$ . The steps are:

State $H_{0} : p = p_{0}$ and $H_{1}$ as appropriate.
Define the test statistic $X$ , the number of successes in $n$ trials, which follows $X \sim B in (n, p_{0})$ under $H_{0}$ .
Calculate the p-value (probability of observing a result as extreme or more extreme than the sample value under $H_{0}$ ) or compare $X$ to the critical region.
Conclude: reject $H_{0}$ if the p-value < $α$ or $X$ is in the critical region, else fail to reject $H_{0}$ .

Worked Example

A coin is tossed 20 times, landing heads 14 times. Test at 5% significance if the coin is biased towards heads: $H_{0} : p = 0.5, H_{1} : p > 0.5, α = 0.05$ $P (X \geq 14∣ p = 0.5) = 1 - P (X \leq 13) = 1 - 0.9423 = 0.0577$ Since 0.0577 > 0.05, fail to reject $H_{0}$ : there is insufficient evidence at the 5% level to conclude the coin is biased towards heads.

Test for a normal mean

Used for continuous data where the population is normally distributed with known variance $σ^{2}$ . The test statistic is the z-score: $z = \frac{x ˉ - μ _{0}}{σ / n}$ This follows a standard normal distribution $Z \sim N (0, 1)$ under $H_{0}$ .

Worked Example

The average test score for a national exam is 65, with standard deviation 10. A class of 25 students has an average score of 69. Test at 1% significance if the class score is different from the national average: $H_{0} : μ = 65, H_{1} : μ \neq = 65, α = 0.01 (two-tailed)$ $z = \frac{69 - 65}{10/ 25} = \frac{4}{2} = 2$ The two-tailed 1% critical values are $\pm 2.576$ . Since $2 < 2.576$ , fail to reject $H_{0}$ : there is insufficient evidence at the 1% level to conclude the class score is different from the national average.

7. Common Pitfalls (and how to avoid them)

Wrong move: Writing hypotheses using sample statistics (e.g., $\overset{x}{ˉ} = 65$ ) instead of population parameters. Why students do it: They confuse sample results with the population value being tested. Correct move: Always use population parameters ( $μ$ , $p$ ) for $H_{0}$ and $H_{1}$ ; sample values are only used to calculate test statistics.
Wrong move: Using the full significance level for both tails in a two-tailed test. Why students do it: They forget that two-tailed tests split $α$ equally between upper and lower tails. Correct move: For a 5% two-tailed test, use 2.5% in each tail when calculating critical values or p-values.
Wrong move: Writing "accept $H_{0}$ " when the test statistic is not in the critical region. Why students do it: They assume no evidence against $H_{0}$ means $H_{0}$ is proven true. Correct move: Always write "fail to reject $H_{0}$ " or "there is insufficient evidence to reject $H_{0}$ ", as you cannot prove the null hypothesis is true, only that you lack data to disprove it.
Wrong move: Calculating Type II error probability using the $H_{0}$ parameter value instead of the given true parameter value. Why students do it: They mix up the conditions for Type I and Type II errors. Correct move: Type I error is calculated under $H_{0}$ being true; Type II error is calculated under the given true value of the parameter, which is always different from the $H_{0}$ value.
Wrong move: Using the population standard deviation instead of the standard error of the mean for normal tests. Why students do it: They forget that the variance of the sample mean is $σ^{2} / n$ , not $σ^{2}$ . Correct move: Always use $σ / n$ as the denominator in the z-score formula for mean tests.

8. Practice Questions (A-Level Mathematics Paper 5 Style)

Question 1

A café claims that 70% of customers rate their service as "excellent". A new manager suspects the rating is lower, so she surveys a random sample of 12 customers. 5 of them rate the service as excellent. (a) State suitable null and alternative hypotheses for the test. [2 marks] (b) Test the manager’s claim at the 10% significance level. [5 marks]

Solution

(a) Let $p$ = proportion of customers who rate service as excellent. $H_{0} : p = 0.7, H_{1} : p < 0.7 (one-tailed test)$ (b) Let $X$ = number of customers who rate service as excellent, so under $H_{0}$ , $X \sim B in (12, 0.7)$ . $P (X \leq 5∣ p = 0.7) = 0.0386$ Since 0.0386 < 0.1, the test statistic falls inside the critical region. We reject $H_{0}$ at the 10% significance level: there is sufficient evidence to support the manager’s claim that the proportion of customers rating service as excellent is lower than 70%.

Question 2

The mass of a standard bag of flour is normally distributed with mean 1kg and standard deviation 40g. A consumer group suspects bags are underfilled, so they sample 16 bags and find their mean mass is 980g. (a) Find the critical region for a one-tailed test at the 5% significance level. [3 marks] (b) State the conclusion of the test. [2 marks]

Solution

(a) $H_{0} : μ = 1000 g, H_{1} : μ < 1000 g, α = 0.05$ . The test statistic is: $z = \frac{x ˉ - 1000}{40/ 16} = \frac{x ˉ - 1000}{10}$ The 5% left-tailed critical z-value is -1.645. Solve for $\overset{x}{ˉ}$ : $\frac{x ˉ - 1000}{10} < - 1.645 ⟹ \overset{x}{ˉ} < 983.55 g$ The critical region is $\overset{x}{ˉ} < 983.55 g$ . (b) The sample mean is 980g, which falls inside the critical region. We reject $H_{0}$ at the 5% significance level: there is evidence that bags are being underfilled.

Question 3

For the test in Question 2: (a) Calculate the probability of a Type I error. [1 mark] (b) If the true mean mass of the flour bags is 975g, calculate the probability of a Type II error. [4 marks]

Solution

(a) Probability of Type I error = significance level = 0.05. (b) Type II error is failing to reject $H_{0}$ when $μ = 975 g$ . We calculate $P (\overset{x}{ˉ} \geq 983.55∣ μ = 975, σ / n = 10)$ : $z = \frac{983.55 - 975}{10} = 0.855$ $P (Z \geq 0.855) = 1 - Φ (0.855) = 1 - 0.804 = 0.196$ The probability of a Type II error is 0.196 (19.6%).

9. Quick Reference Cheatsheet

Concept	Rule/Formula
Hypotheses	$H_{0}$ : population parameter = reference value; $H_{1}$ : parameter <, >, or ≠ reference value
Test Type	One-tailed if $H_{1}$ has < or >; two-tailed if $H_{1}$ has ≠, split $α$ equally between tails
Significance Level	$α$ = max P(Type I error); common values: 5% = 0.05, 1% = 0.01
Critical Values (z-test)	5% one-tailed: ±1.645; 5% two-tailed: ±1.96; 1% two-tailed: ±2.576
Errors	P(Type I) = $α$ (under $H_{0}$ ); P(Type II) = P(not in critical region
Binomial Test	$X \sim B in (n, p_{0})$ under $H_{0}$ , compare p-value to $α$
Normal Mean Test	$z = \frac{x ˉ - μ _{0}}{σ / n}$ , compare to z critical values
Conclusion	Reject $H_{0}$ if test statistic in critical region / p-value < $α$ ; else fail to reject $H_{0}$ (never "accept $H_{0}$ ")

10. What's Next

Hypothesis testing is a core building block for all advanced statistics topics in the A-Level Mathematics and A-Level Further Mathematics (Further Maths) syllabi. In Paper 5, you will apply these rules to combine with other topics like the Poisson distribution and sampling methods, while in Further Maths you will extend them to chi-squared tests, t-tests, and non-parametric tests. Mastering the 5-step framework outlined here will cut down your working time for 10-15 mark long questions on Paper 5 significantly, as every hypothesis test follows the same structure regardless of the distribution used.

If you struggle with any of the concepts, worked examples, or practice questions in this guide, you can ask Ollie for personalized explanations, additional practice questions, or step-by-step walkthroughs tailored to your weak spots at any time. Head to Ollie, the AI tutor built into OwlsPrep, to get instant support, or browse our full library of A-Level Mathematics Paper 5 study guides to cover the rest of the syllabus before your exam.

Aligned with the Cambridge International AS & A Level Mathematics 9709 syllabus. OwlsAi is not affiliated with Cambridge Assessment International Education.

← Back to topic

Stuck on a specific question?
Snap a photo or paste your problem — Ollie (our AI tutor) walks through it step-by-step with diagrams.
Try Ollie free →

Hypothesis Testing — A-Level Mathematics Stats Study Guide

1. What Is Hypothesis Testing?

2. Null and alternative hypotheses

Worked Example

3. One-tailed and two-tailed tests

Worked Example

4. Critical region and significance level

Worked Example

5. Type I and Type II errors

Worked Example

6. Test for a binomial proportion or normal mean

Test for a binomial proportion

Worked Example

Test for a normal mean

Worked Example

7. Common Pitfalls (and how to avoid them)

8. Practice Questions (A-Level Mathematics Paper 5 Style)

Question 1

Solution

Question 2

Solution

Question 3

Solution

9. Quick Reference Cheatsheet

10. What's Next

More study guides