Transforming Random Variables — AP Statistics Study Guide
For: AP Statistics candidates sitting AP Statistics.
Covers: Linear transformations of discrete and continuous random variables, rules for the mean (expected value), variance, and standard deviation of transformed variables, contrast between linear and non-linear transformations, and common AP exam multiple-choice and free-response question applications.
You should already know: Basic probability and the definition of expected value and variance. How to work with probability distributions for discrete and continuous random variables. Basic unit conversion and linear equation manipulation.
A note on the practice questions: All worked questions in the "Practice Questions" section below are original problems written by us in the AP Statistics style for educational use. They are not reproductions of past College Board / Cambridge / IB papers and may differ in wording, numerical values, or context. Use them to practise the technique; cross-check with official mark schemes for grading conventions.
1. What Is Transforming Random Variables?
Transforming a random variable means changing every outcome of the original variable by a fixed mathematical function to produce a new, transformed random variable. This technique is extremely common in real-world statistics when we need to convert units, adjust for a fixed baseline measurement, or scale raw data for analysis. For example, if you have a random variable that measures temperature in degrees Celsius, you can transform it to Fahrenheit with to get a new random variable with its own mean, variance, and distribution shape. According to the AP Statistics Course and Exam Description (CED), this topic is part of Unit 4, which accounts for 12–15% of the total AP exam score. Transforming random variables appears regularly in both multiple-choice (MCQ) and free-response (FRQ) sections, often combined with other topics like normal distributions or combining random variables. Linear transformations, which take the general form where and are fixed constants, are the primary focus of the AP exam. We also cover the key distinction between linear and non-linear transformations, a common tested concept that trips up many students.
2. Effect of Linear Transformations on Expected Value (Mean)
When you apply a linear transformation of the form to a random variable , the expected value (mean) of follows a simple rule that matches the transformation itself. To build intuition, we can derive this rule for a discrete random variable: the expected value of is defined as . For , every outcome , so: Since for any probability distribution, this simplifies to: This rule makes intuitive sense: multiplying every outcome by scales the average by , and adding shifts the entire distribution by , so the average shifts by as well. This rule works for both discrete and continuous random variables, with no exceptions for linear transformations. It is most commonly tested in unit conversion or baseline adjustment problems on the AP exam.
Worked Example
A food scale adds a 0.05 kg fixed tare weight to every measured apple weight (measured in kg before tare correction). Let be the uncorrected measured weight, with kg. The corrected weight (in kg) is calculated by subtracting the tare, and the orchard owner wants the mean corrected weight reported in pounds, where 1 kg = 2.205 pounds. What is the mean corrected weight in pounds?
- Write the full linear transformation from uncorrected kg to corrected pounds : .
- Apply the expected value rule for linear transformations: .
- Substitute the known values: , , .
- Calculate the result: pounds.
Exam tip: On AP FRQs, always write out the general formula before plugging in values — this earns you method points even if you make an arithmetic error.
3. Effect of Linear Transformations on Variance and Standard Deviation
After finding the mean of a transformed variable, the AP exam almost always asks for the variance or standard deviation (spread) of the new variable. The rules here differ from the mean rule because adding a constant does not change the spread of the distribution — it only shifts the entire distribution left or right, so all observations stay the same distance apart, and spread remains unchanged.
We can derive this rule from the definition of variance: . Substituting gives . Squaring both sides and taking the expectation gives: For standard deviation, which is the square root of variance, we get: We use the absolute value of because standard deviation is always non-negative, and is almost always positive in AP problems (as it is usually a unit conversion or scaling factor).
Worked Example
In the apple orchard problem above, the uncorrected weight has standard deviation kg. Find the standard deviation of the corrected weight in pounds, where .
- Recall the standard deviation rule for linear transformations: .
- Identify values: , ; the shift term does not affect spread.
- Calculate the standard deviation: pounds.
- If asked for variance instead: pounds squared.
Exam tip: AP exam questions frequently ask for standard deviation, not variance. Always double-check which measure the question requests — forgetting to take the square root of variance is one of the most common mistakes on this topic.
4. Linear vs Non-Linear Transformations
All the rules we have covered so far apply only to linear transformations of the form . Non-linear transformations (e.g., , , ) do not follow the linear transformation rules. In particular, for any non-linear function , the expected value of the transformation is almost never equal to the transformation of the expected value: .
To find the expected value of a non-linearly transformed random variable, you must recalculate it from scratch using the original probability distribution: for discrete , ; for continuous , it is calculated via integration. The AP exam often tests this concept by asking students to identify a common mistake or justify why a student's calculation is incorrect.
Worked Example
The area of a random square plot of farm land is given by , where is the side length of the plot (in meters), a random variable with mean meters. A student claims that the mean area is square meters. Is this claim correct? Justify your answer.
- The transformation is a non-linear transformation of the random variable .
- The rule only holds for linear transformations; it does not apply to non-linear transformations.
- For convex functions like , , so the true mean area will be greater than 100 square meters.
- Therefore, the student's claim is incorrect, because the linear expected value rule cannot be applied to this non-linear transformation.
Exam tip: If the transformation includes any exponent, square root, logarithm, or reciprocal that cannot be rewritten as , it is non-linear — never apply the linear transformation rules for mean or variance.
Common Pitfalls (and how to avoid them)
- Wrong move: Calculating the variance of as instead of . Why: Students confuse the mean rule with the variance rule, and incorrectly carry over the shift term and forget to square the scale factor. Correct move: Remind yourself before starting that shifting the distribution does not change spread, so disappears, and is always squared for variance.
- Wrong move: Applying the linear expected value rule to non-linear transformations like . Why: Students memorize the linear rule so well they automatically apply it to any transformation without checking the form. Correct move: First check if the transformation can be rewritten as . If not, do not use the linear rule; calculate from the original probability distribution.
- Wrong move: Adding to the standard deviation/variance: e.g., . Why: Students assume all parts of the transformation affect all statistics, just like the mean. Correct move: Memorize that adding a constant only changes measures of center (mean, median, quartiles), never measures of spread (variance, SD, IQR, range).
- Wrong move: Reversing the scale factor when converting units: e.g., converting kg to g using instead of 1000. Why: Students mix up the direction of unit conversion. Correct move: Always write out the conversion explicitly: , so before calculating.
- Wrong move: Claiming a linear transformation changes the shape of the original distribution. Why: Students think stretching the distribution changes its shape, but this is incorrect. Correct move: Linear transformations only scale and shift the distribution, they do not change its shape. If is normally distributed, is also normally distributed.
Practice Questions (AP Statistics Style)
Question 1 (Multiple Choice)
The random variable has mean and variance . A new random variable is defined as . What is the standard deviation of ? A) 12 B) 43 C) 31 D) 144
Worked Solution: First, recall the rule for variance of a linear transformation: . We have and , so . Standard deviation is the square root of variance, so . The other options correspond to common mistakes: B uses the incorrect rule , C gives the mean of instead of standard deviation, D gives the variance of instead of standard deviation. The correct answer is A.
Question 2 (Free Response)
A small business sells custom printed t-shirts. The random variable represents the total production cost (in US dollars) for one t-shirt, with mean and standard deviation . The business calculates the selling price by marking up the cost by 60% and adding a $2 fixed shipping cost per t-shirt, so . (a) Find the mean selling price of a t-shirt. (b) Find the standard deviation of the selling price of a t-shirt. (c) The business owner wants to convert all prices to Canadian dollars, where 1 US dollar = 1.35 Canadian dollars. Write the transformation from (US dollars) to (Canadian dollars), then find the mean and standard deviation of .
Worked Solution: (a) For the linear transformation , apply the mean rule: The mean selling price is .
(b) Apply the standard deviation rule: the shift term of $2 does not affect spread, so: The standard deviation of the selling price is .
(c) The transformation is , which simplifies to , a linear transformation. Mean: Canadian dollars. Standard deviation: Canadian dollars.
Question 3 (Application / Real-World Style)
A biologist measures the length of trout in a lake in inches, where the random variable (length of a random trout) has mean 12.5 inches and standard deviation 2.1 inches. The biologist uses two common calculations for their data: (1) converting all lengths to centimeters for publication, and (2) estimating weight from length using the empirical formula , where is weight in pounds and is length in inches. (a) Find the mean length in centimeters (1 inch = 2.54 cm). (b) A student intern calculates the mean weight as pounds. Is this calculation correct? Justify your answer in context.
Worked Solution: (a) Converting length from inches to centimeters is a linear transformation: . The mean length in centimeters is:
(b) The weight transformation is non-linear, so the rule that applies to linear transformations cannot be used here. The student incorrectly applied the linear transformation rule to a non-linear transformation, so their calculation is wrong. For this convex transformation, the true mean weight will be higher than 19.53 pounds, and we need the full distribution of lengths to calculate the correct mean weight.
In context: We cannot use the average length to directly calculate the average weight with this non-linear length-weight formula.
Quick Reference Cheatsheet
| Category | Formula | Notes |
|---|---|---|
| General Linear Transformation | = fixed scale factor, = fixed shift | |
| Expected Value (Mean) | Works for all linear transformations, discrete and continuous | |
| Variance | has no effect on variance; does not apply to non-linear transformations | |
| Standard Deviation | $\sigma_{aX + b} = | a |
| Expected Value (Non-Linear) | (discrete) | for all non-linear |
| Distribution Shape | Unchanged after linear transformation | If is normal, is also normal |
| Effect on Statistics | Center (mean, median, quartiles) affected by and ; Spread (SD, variance, IQR) affected only by | Shifting the entire distribution does not change how spread out it is |
What's Next
Transforming random variables is a critical prerequisite for the next core topic in Unit 4: combining independent random variables. Without mastering the rules for linear transformations, you will not be able to correctly calculate the mean and variance of sums or differences of random variables, a heavily tested topic on both MCQ and FRQ. This topic also underpins all of inferential statistics: when we calculate confidence intervals or run hypothesis tests for means, we rely on properties of transformed sampling distributions, which build directly on the rules for linear transformations you learned here. Beyond the exam, these rules are used constantly in data analysis for unit conversion and data scaling.
Combining Independent Random Variables Normal Probability Distributions Sampling Distributions for Means