Parameters for a Binomial Distribution — AP Statistics Study Guide
For: AP Statistics candidates sitting AP Statistics.
Covers: Identifying and interpreting the binomial parameters (number of independent trials) and (probability of success), verifying binomial conditions, and calculating the mean, variance, and standard deviation of a binomial random variable.
You should already know: Basic probability rules for independent events, definition of a discrete random variable, how to calculate expected value of a discrete random variable.
A note on the practice questions: All worked questions in the "Practice Questions" section below are original problems written by us in the AP Statistics style for educational use. They are not reproductions of past College Board / Cambridge / IB papers and may differ in wording, numerical values, or context. Use them to practise the technique; cross-check with official mark schemes for grading conventions.
1. What Is Parameters for a Binomial Distribution?
A binomial distribution is a discrete probability distribution that models the number of successes in a fixed sequence of independent yes/no (success/failure) trials. The parameters of a binomial distribution are the two numerical values that completely define the shape, center, and spread of the distribution—no additional information is needed to calculate any probability, expected value, or measure of spread for the distribution. According to the AP Statistics Course and Exam Description (CED), this topic is part of Unit 4: Probability, Random Variables, and Probability Distributions, which accounts for 10–15% of the total AP exam score. Parameter identification, interpretation, and mean/spread calculation appear on both multiple-choice (MCQ) and free-response (FRQ) sections, and this topic is a foundational step for later inference topics involving proportions. Synonyms sometimes used for parameters include "distribution constants" or "population characteristics", but on the AP exam you will always explicitly work with (number of trials) and (per-trial success probability).
2. Identifying Binomial Parameters and
To correctly identify and , you first must confirm the four binomial conditions, often remembered by the acronym BINS: 1. Binary outcomes: every trial has only two possible outcomes (success or failure, where success is just the outcome you are counting). 2. Independent trials: the outcome of one trial does not change the probability of outcomes for other trials. 3. Number of trials is fixed: you set the number of trials before starting to count successes; you do not stop after reaching a fixed number of successes. 4. Same probability of success: the probability of success is identical for every trial. If you are sampling without replacement from a finite population, you must confirm the 10% condition (your sample size is less than 10% of the total population) to justify independent trials and constant . Once BINS is confirmed, is the fixed total number of independent trials, and is the probability of success on any single trial. A common misconception is that "success" must be a desirable outcome—this is not true; success is just the outcome you are counting.
Worked Example
A wildlife biologist tags 25 sea turtles caught at random from a large breeding population of over 500 turtles. Historical data shows 12% of the turtles in this population carry a specific parasite. Let = the number of tagged turtles that carry the parasite. Identify and , and confirm BINS conditions for a binomial distribution.
- Check BINS conditions: (1) Binary: each turtle either carries the parasite (success, our outcome of interest) or does not (failure). (2) Independent: the sample of 25 is less than 10% of the 500+ total population, so the 10% condition is met, and independence is reasonable. (3) Fixed number of trials: 25 turtles were selected before testing, so the number of trials is fixed. (4) Constant probability: 12% of the population carries the parasite, so is constant for all trials. All BINS conditions are satisfied.
- is the total number of trials, so .
- is the probability of success (a turtle carries the parasite) on a single trial, so .
Exam tip: AP graders require explicit confirmation of the 10% condition for sampling without replacement on FRQ—always add this check to earn full credit.
3. Mean (Expected Value) of a Binomial Distribution
The mean (also called expected value) of a binomial random variable is the long-run average number of successes you would expect over many repetitions of the same -trial experiment. The formula for the mean of a binomial distribution can be derived from the general definition of expected value for a discrete random variable, and it simplifies to a very straightforward expression: Intuition for this formula matches common sense: if you have 100 trials with a 50% chance of success per trial, you expect 50 successes on average, which is exactly . On the AP exam, you will almost always be asked to interpret the mean in context, not just compute the numerical value. A good interpretation references the long-run average over repeated samples or experiments.
Worked Example
A coffee shop finds that 28% of all customers buy a pastry with their coffee. On a given weekday, the coffee shop serves 96 independent customers. Find the expected number of customers who buy a pastry with their coffee, and interpret your result in context.
- Confirm binomial conditions: Binary (customer buys a pastry = success, no pastry = failure), independent orders given, fixed , constant . All conditions are met.
- Apply the binomial mean formula: .
- Interpretation: If we repeatedly observe many samples of 96 customers, the average number of customers who buy a pastry with their coffee is approximately 26.9.
Exam tip: Always include context in interpretations of parameters—an answer with only a numerical value will not earn full credit on FRQ.
4. Variance and Standard Deviation of a Binomial Distribution
Variance measures the spread of the binomial distribution of , and standard deviation is the square root of variance, measured in the same units as (number of successes). Like the mean, these values simplify to formulas based only on the parameters and : Intuition: Spread increases as the number of trials increases, because more trials lead to more possible variation in the number of successes. Spread is maximized when (maximum uncertainty about the outcome) and approaches 0 when approaches 0 or 1 (very little uncertainty about the number of successes). Standard deviation is interpreted as the typical deviation of the number of successes from the mean across repeated experiments.
Worked Example
Continuing the coffee shop example from the previous section: 28% of 96 independent customers buy a pastry. Calculate the standard deviation of the number of customers who buy a pastry, and interpret it in context.
- Bins conditions are already confirmed, so we can use the binomial standard deviation formula.
- Plug in , , : .
- Interpretation: The number of customers who buy a pastry in repeated samples of 96 customers typically varies by about 4.4 customers from the mean of 26.9.
Exam tip: Double-check whether the question asks for variance or standard deviation—stopping at the variance value when standard deviation is requested is one of the most common points of deduction on this topic.
5. Common Pitfalls (and how to avoid them)
- Wrong move: Calling the number of successes instead of the number of trials. Why: Students confuse the value of the random variable (which counts successes) with the fixed parameter . Correct move: Always ask "How many trials did we run?" that answer is ; the number of successes is the value of , not .
- Wrong move: Flipping and because "success" is an undesirable outcome like a defect or disease. Why: The word "success" sounds positive, so students incorrectly use the probability of the good outcome as . Correct move: Always label the outcome you are counting first; is the probability of that outcome, regardless of whether it is good or bad.
- Wrong move: Reporting the expected number of successes as instead of . Why: Students confuse the per-trial probability of success with the expected total number of successes across trials. Correct move: If the question asks for the expected number of successes, always multiply by the number of trials .
- Wrong move: Forgetting the 10% condition when sampling without replacement from a small population. Why: Students assume independence is always true without checking, even when sampling more than 10% of the population. Correct move: Whenever sampling without replacement, explicitly check that the sample size is less than 10% of the population before using binomial formulas.
- Wrong move: Using binomial parameters for a geometric distribution, where the number of successes is fixed and the number of trials is variable. Why: Students confuse binomial and geometric settings which both have binary independent trials. Correct move: Always check that the number of trials is fixed before using binomial parameters; if the number of successes is fixed, use geometric parameters instead.
- Wrong move: Interpreting the expected value as "the number of successes we expect in one trial" when the question asks for the expected number across trials. Why: Students misread the question's unit of interest. Correct move: Underline what the question is asking to count before starting your calculation.
6. Practice Questions (AP Statistics Style)
Question 1 (Multiple Choice)
A local bakery notes that 22% of all customers who enter will buy a loaf of sourdough bread. The owner counts the number of customers who buy sourdough out of the next 80 customers who enter the bakery, and assumes each customer's decision is independent. What are the mean and standard deviation of the number of sourdough buyers, respectively? A) , B) , C) , D) ,
Worked Solution: This is a binomial setting with trials and probability of success (sourdough buyer). Calculate the mean: , which eliminates options A and D. Next calculate the standard deviation: . Option B gives the variance, not the standard deviation, so the correct answer is C.
Question 2 (Free Response)
A university registrar finds that 48% of all enrolled undergraduate students will take a summer course at the university. The university has 820 enrolled undergraduate students in the current junior class, and we can assume enrollment decisions for summer courses are independent. (a) Identify the parameters and for the binomial distribution of the number of juniors who take a summer course, and confirm that binomial conditions are met. (b) Calculate the mean and standard deviation of the number of juniors who take a summer course. (c) Interpret the standard deviation in context.
Worked Solution: (a) Confirm BINS conditions: 1. Binary: each junior either takes a summer course (success) or does not (failure). 2. Independent: decisions are independent as given. 3. Fixed number of trials: 820 juniors in the class, so is fixed. 4. Constant probability: for all juniors. The 820 juniors are the entire population of the class, so no 10% condition is needed. Parameters are and . (b) Mean: . Standard deviation: . (c) Interpretation: In repeated samples of 820 juniors, the number of juniors who take a summer course typically varies by about 14.3 students from the mean of 393.6.
Question 3 (Application / Real-World Style)
A plant biologist is studying flower color in pea plants. Each offspring plant has a 75% probability of producing purple flowers, independent of other offspring. The biologist grows 240 offspring plants for an experiment. What is the expected number of plants that produce purple flowers, and what is the typical deviation from this expected value? Interpret your results in context.
Worked Solution: This is a binomial setting with trials (plants) and probability of success (purple flowers). Expected number of purple-flowered plants is . The typical deviation (standard deviation) is . Interpretation: If this experiment was repeated many times with 240 offspring plants each, the average number of purple-flowered plants would be 180, and the actual number of purple-flowered plants in a single experiment typically differs from 180 by about 6.7 plants.
7. Quick Reference Cheatsheet
| Category | Formula / Definition | Notes |
|---|---|---|
| Binomial Parameter | Must be fixed before counting successes; only for binomial settings | |
| Binomial Parameter | Success = outcome being counted, does not need to be desirable | |
| 10% Condition | Required for sampling without replacement to justify independent trials/constant ; = population size | |
| Mean (Expected Value) | Measured in units of number of successes | |
| Variance | Measured in squared units of number of successes | |
| Standard Deviation | Measures typical deviation from the mean, in units of number of successes |
8. What's Next
Mastery of binomial parameters is an essential prerequisite for all subsequent work involving binomial distributions, starting with calculating binomial probabilities for specific numbers of successes, which you will study immediately after this topic. Without correctly identifying and and calculating the correct mean and spread, you cannot correctly compute probabilities or conduct valid inference for binomial data. Later in the course, binomial parameters form the foundation for inference on a population proportion, one of the most heavily tested inference topics on the AP exam: when you construct a confidence interval for a proportion or conduct a significance test for , you rely on the binomial distribution to check the Normal approximation condition, which depends on and .
Binomial Probability Calculations Normal Approximation of Binomial Distributions Geometric Distribution Parameters Inference for One Population Proportion