Discrete Random Variables — A-Level Mathematics Stats Study Guide
For: A-Level Mathematics candidates sitting Paper 5 (Probability & Statistics 1).
Covers: Probability distribution tables, expected value and variance , linear transformations of random variables, binomial distribution conditions and formulas, and geometric distribution for the A-Level Mathematics Stats 1 syllabus.
You should already know: Basic probability, summation, integration (Pure 1 calculus).
A note on the practice questions: All worked questions in the "Practice Questions" section below are original problems written by us in the A-Level Mathematics style for educational use. They are not reproductions of past Cambridge International examination papers and may differ in wording, numerical values, or context. Use them to practise the technique; cross-check with official Cambridge mark schemes for grading conventions.
1. What Is a Discrete Random Variable?
A discrete random variable (DRV) is a numerical quantity that takes a countable set of distinct values, each associated with a fixed probability, as the outcome of a random experiment. "Countable" means you can list all possible values of the variable, even if the list is infinite (e.g. the number of die rolls until you get a 6).
We use uppercase letters (, ) to refer to the random variable itself, and lowercase letters (, ) to refer to individual values it can take. The notation denotes the probability that takes the specific value . DRVs account for 15-20% of total marks in A-Level Mathematics Paper 5, so mastering this topic is critical for high scores.
2. Probability Distribution Table
A probability distribution (or probability mass function, PMF) for a DRV lists every possible value that can take, paired with its corresponding probability . All valid probability distributions follow two non-negotiable rules:
- All probabilities lie between 0 and 1: for all
- The sum of all probabilities equals 1:
A probability distribution table is the standard way to present this for small sets of values. Examiners frequently ask you to fill in missing values in these tables, so always apply the sum-to-1 rule first.
Worked Example
Let be the number of heads obtained when flipping 2 fair coins. The possible values of are 0, 1, 2. The valid distribution table is:
| 0 | 1 | 2 | |
|---|---|---|---|
Check validity: , and all probabilities are positive, so the distribution is valid.
3. Expected Value and Variance
Expected Value
The expected value (or mean, written ) of is the long-run average value of if you repeat the random experiment thousands of times. It is calculated as a weighted average of all possible values, weighted by their probabilities: This formula comes directly from calculating the average of repeated trials: if you flip 2 coins 4000 times, you expect ~1000 instances of 0 heads, ~2000 of 1 head, ~1000 of 2 heads, so the average is , which matches the formula result for the coin flip example. Note that does not need to be a possible value of (e.g. the expected value of a fair 6-sided die roll is 3.5, which is not a valid die score).
Variance
Variance (written ) measures the spread of the distribution around the mean. The definitional formula for variance is the expected value of the squared deviation from the mean: For exam calculations, always use the far simpler computational formula, derived by expanding the definitional form: where . This formula reduces arithmetic errors and saves time in exams.
Worked Example
For the fair 6-sided die roll :
4. Linear Transformation —
A linear transformation of is a new random variable , where and are constants. This comes up frequently in real-world contexts like converting temperature units (F = 1.8C + 32) or adjusting test scores by scaling and adding bonus marks.
Expected Value of Linear Transformation
Expectation is a linear operator, so scaling and shifting apply directly to the mean: Intuitively, if you double every value of and add 3, the long-run average will also double and increase by 3.
Variance of Linear Transformation
Variance measures spread, so adding a constant does not change the spread at all (it shifts all values equally, so gaps between them stay the same). Scaling by increases spread by a factor of , since variance is a squared measure: The standard deviation of is , since standard deviation is the positive square root of variance.
Worked Example
For the die roll with , , let (payout for a die roll game):
- Standard deviation of
5. Binomial Distribution — Conditions and Formula
The binomial distribution is the most frequently tested DRV in Paper 5, used to model the number of successes in a fixed number of independent identical trials. A random variable follows a binomial distribution, written , if all 4 of these conditions are met:
- There is a fixed number of trials
- Each trial has only two outcomes: success or failure
- The probability of success is constant for all trials
- All trials are independent of each other
Examiners will often ask you to justify using a binomial distribution, so you must link each condition to the question context, not just list them.
The PMF, expected value, and variance for the binomial distribution are: where is the number of ways to choose successes out of trials.
Worked Example
A fair coin is flipped 10 times, is the number of heads, so :
6. Geometric Distribution (where applicable)
The geometric distribution models the number of independent trials required to get the first success. For A-Level Mathematics, the standard parameterization defines as the number of trials up to and including the first success, written , where is the probability of success per trial. The conditions for a geometric distribution are:
- Trials are independent
- Probability of success is constant for all trials
- Each trial has two outcomes: success or failure
- Trials continue until the first success is observed
The PMF, expected value, and variance for the geometric distribution are: Note that starts at 1 for the A-Level syllabus: if a question asks for the number of failures before the first success, use the variable to adjust the distribution.
Worked Example
You roll a fair die repeatedly until you get a 6, so :
7. Common Pitfalls (and how to avoid them)
- Wrong move: Using a linear factor of instead of for variance of linear transformations, or including the term in variance calculations. Why it happens: Students mix up expectation and variance rules. Correct move: Write the formula on your exam formula sheet before starting questions, and remember that variance only measures spread, so shifts do not affect it.
- Wrong move: Using the geometric distribution parameterization where starts at 0 for A-Level questions. Why it happens: Some textbooks use the "number of failures" parameterization. Correct move: Check question wording: if it asks for number of trials to first success, , use . If it asks for failures before first success, subtract 1 from .
- Wrong move: Calculating variance with the definitional formula instead of the computational formula. Why it happens: Students memorize the definitional form first. Correct move: Always use the computational formula for variance: it has fewer steps and eliminates sign errors from squared differences.
- Wrong move: Failing to link binomial conditions to question context when justifying use of the distribution. Why it happens: Students only list the 4 conditions without applying them. Correct move: For a question about defective items in a sample of 20, state: "Fixed trials, each item is defective or not, defect probability is constant, items are independent, so binomial is appropriate."
- Wrong move: Assuming must be a possible value of . Why it happens: Students confuse expected value with the mode (most common value). Correct move: Expected value is a long-run average, it can be a non-integer even if only takes integer values (e.g. 3.5 for a die roll is completely valid).
8. Practice Questions (A-Level Mathematics Paper 5 Style)
Question 1
The discrete random variable has the following probability distribution:
| 1 | 2 | 3 | 4 | 5 | |
|---|---|---|---|---|---|
| 0.1 | 0.2 | 0.25 | 0.15 | ||
| (a) Find the value of . (2 marks) | |||||
| (b) Calculate and . (4 marks) | |||||
| (c) Let . Find and the standard deviation of . (3 marks) |
Solution
(a) Sum of probabilities = 1: → → (b) (c) Standard deviation of (3 significant figures)
Question 2
A multiple-choice quiz has 12 questions, each with 4 options, only one correct. A student guesses all answers randomly. (a) Explain why the number of correct answers follows a binomial distribution. (3 marks) (b) Find the probability the student gets exactly 3 correct answers. (3 marks) (c) Find the mean and variance of . (2 marks)
Solution
(a) 1) Fixed trials (questions). 2) Each trial has two outcomes: correct (success) or incorrect (failure). 3) Probability of success is constant for all guesses. 4) Guesses are independent, so outcome of one question does not affect others. All conditions met, so . (b) (3 sf) (c) ,
Question 3
A basketball player has a 0.3 probability of making a free throw per attempt, independent of previous attempts. (a) Find the probability the player makes their first successful free throw on their 4th attempt. (3 marks) (b) Find the expected number of attempts needed to make their first successful free throw, and the variance. (2 marks)
Solution
(a) Let = number of attempts to first success, so (3 sf) (b) (3 sf), (3 sf)
9. Quick Reference Cheatsheet
| Category | Formula | Notes |
|---|---|---|
| General DRV Rules | , | Applies to all valid discrete distributions |
| Expected Value | Weighted average of values | |
| Variance | , | Use this computational form for all exam calculations |
| Linear Transformation | , | Variance ignores the term |
| Binomial | , , | Conditions: fixed , 2 outcomes, constant , independent trials |
| Geometric | , , | A-Level uses = number of trials up to first success, |
10. What's Next
This topic is the foundation for all other probability content in A-Level Mathematics Paper 5. You will apply the expectation and variance rules you learned here to continuous random variables (including the normal distribution) later in the syllabus, and solve combined probability problems that use multiple distributions in the same question. DRV concepts are also used for high-mark real-world application questions, including game theory, quality control, and risk calculation, which make up the final sections of most Paper 5 exams.
If you are struggling with any of the concepts, formulas, or practice questions in this guide, you can ask Ollie for step-by-step explanations, additional practice problems, or clarification of exam marking conventions at any time. Head to [the homepage](/ to access Ollie and other A-Level Mathematics Paper 5 study resources, including past paper walkthroughs and topic quizzes tailored to your weak areas.
Aligned with the Cambridge International AS & A Level Mathematics 9709 syllabus. OwlsAi is not affiliated with Cambridge Assessment International Education.