统计学 · CED 第4单元：概率、随机变量与概率分布 · 阅读约 14 分钟 · 更新于 2026-05-11

几何分布 — AP 统计学

AP 统计学 · CED 第4单元：概率、随机变量与概率分布 · 14 min read

1. 什么是几何分布？ ★★☆☆☆ ⏱ 3 min

几何分布是一种离散概率分布，用于模拟一系列重复伯努利（两种结果）试验中，获得第一次成功所需的独立试验次数。它常被称为“等待时间分布”，因为我们测量的是为等待第一次成功需要进行多少次试验。

与固定试验次数、统计成功次数的二项分布不同，几何分布反转了这一框架：它固定了每次试验的成功概率，将试验次数作为感兴趣的随机变量。AP统计学统一使用“偏移”约定，即我们从1开始计数试验，这与官方CED定义一致。

2. 几何分布场景的条件 ★★☆☆☆ ⏱ 4 min

在你使用几何分布计算概率或期望值之前，你必须确认场景满足所有四个要求条件，可以缩写为**BITS**：

**B**：每次试验有两种可能结果：每次试验的结果要么是“成功”（我们等待的结果），要么是“失败”（另一种结果）。
**I**：试验独立：一次试验的结果不会改变其他任何试验的成功概率。
**T**：等待第一次成功：试验次数不提前固定；我们测量的值是获得第一次成功所需的试验次数。
**S**：成功概率恒定：每次试验的成功概率$p$都相同。

📐 Worked Example

一家咖啡店推出促销活动，15%的咖啡杯内含有免费甜甜券。顾客每次买一杯咖啡，直到获得优惠券为止。顾客购买的咖啡杯数能否用几何分布建模？检查所有条件。

检查两种结果条件：每杯咖啡要么有优惠券（成功），要么没有（失败）。只有两种结果，因此条件B满足。
检查独立性：优惠券随机分布，因此一杯有优惠券不改变另一杯有优惠券的概率。条件I满足。
检查我们统计的内容：我们统计直到获得第一张优惠券为止的咖啡杯数（试验次数），因此试验次数不提前固定。条件T满足。
检查概率恒定：所有咖啡杯获得优惠券的概率都是15%，因此$p=0.15$是恒定的。条件S满足。
结论：所有条件都满足，因此该场景可以用几何分布建模。

Exam tip: 当被要求为某个场景选择合适的分布时，一定要先回答“我们是统计直到成功的试验次数，还是固定试验中的成功次数？”——这个问题能立刻排除50%的错误选项。

3. 几何概率计算（PMF和CDF） ★★★☆☆ ⏱ 4 min

确认场景满足几何条件后，你可以用两个核心公式计算概率：概率质量函数（PMF）计算第一次成功恰好出现在某一次试验的概率，累积分布函数（CDF）计算第一次成功出现在某一次试验及之前的概率。

要得到第一次成功*恰好*出现在第$k$次试验的概率，说明你前$k-1$次连续失败，然后第$k$次成功。由于试验独立，我们将概率相乘：

P(X = k) = (1-p)^{k-1}p

for $k = 1, 2, 3, ...$

对于累积概率，第一次成功出现在第$k$次试验*及之前*的概率等于1减去前$k$次试验全失败的概率，这给出了一个方便的捷径：

P(X \leq k) = 1 - (1-p)^k

我们可以整理得到第一次成功出现在第$k$次试验*之后*的概率：

P(X > k) = (1-p)^k

这个捷径能在考试中节省大量时间，你不需要对多个单独概率求和。

Exam tip: 如果题目要求$P(X < k)$，一定要调整截断值得到正确指数：$P(X < k) = P(X \leq k-1) = 1 - (1-p)^{k-1}$，避免选择题中常见的偏移1错误。

4. 几何随机变量的均值和标准差 ★★★☆☆ ⏱ 3 min

几何分布的均值（期望值）和标准差公式简单直观。期望值即获得第一次成功所需试验次数的长期平均值，公式为：

E(X) = \mu_X = \frac{1}{p}

这符合直觉：如果成功概率是1/10，你平均预计要等待10次试验才能得到第一次成功。成功概率越低，期望试验次数越高，这与公式一致。

The variance of $X$ is $\text{Var}(X) = \frac{1-p}{p^2}$, so the standard deviation (a measure of the spread of the distribution) is:

\sigma_X = \frac{\sqrt{1-p}}{p}

所有几何分布都是右偏的：概率最高的位置始终在$k=1$，概率随$k$增大而减小。在FRQ题中，几乎总是要求你结合背景解释期望值，解释需要将其与多次重复的长期平均值联系起来。

Exam tip: 在FRQ题中解释期望值时，一定要包含“平均而言”和“多次重复”这两个表述，才能拿到解释部分的全部分数。

5. AP风格练习题 ★★★★☆ ⏱ 4 min

📐 Worked Example

A street artist sells hand-painted portraits, and has a 15% chance of making a sale to any random passerby who stops to look at their work. Assume each passerby is independent. Let $X$ be the number of passersby who stop before the artist makes their first sale of the day.<br>(a) Verify that $X$ can be modeled with a geometric distribution.<br>(b) Calculate $P(X > 6)$ and interpret this probability in context.<br>(c) Find the expected value of $X$ and interpret it in context.

(a) Check the four BITS conditions: 1. Two outcomes: each passerby either buys a portrait (success) or does not (failure), so B is satisfied. 2. Independent: the problem states passersby are independent, so I is satisfied. 3. We count passersby (trials) until the first sale, so the number of trials is not fixed, T is satisfied. 4. Probability of sale is 15% for all passersby, so S is satisfied. All conditions are met.
(b) Use the geometric shortcut for $P(X > k)$:
$P(X > 6) = (0.85)^6 \approx 0.377$
Interpretation: There is about a 37.7% chance that the artist will not make a sale to the first 6 passersby who stop.
(c) Calculate expected value:
$E(X) = 1/p = 1/0.15 \approx 6.67$
Interpretation: Over many days where the artist waits for the first sale of the day, the average number of passersby who stop before the first sale is about 6.67.

📐 Worked Example

A geneticist is studying a recessive trait in pea plants. Each offspring plant has a 25% chance of expressing the recessive trait, independent of other offspring. The geneticist is growing plants one at a time until they get 1 plant that expresses the recessive trait for an experiment. Each plant costs \$1.20 in supplies. What is the expected total cost of the experiment? What is the probability the geneticist gets the desired plant within the first 3 plants grown?

Let $X$ be the number of plants grown until the first recessive trait plant is obtained. $X$ is a geometric random variable with $p=0.25$.
Calculate the expected number of plants:
$E(X) = 1/0.25 = 4$
Multiply by cost per plant to get expected total cost:
$4 * 1.20 = 4.80$
The expected total cost is \$4.80.
Calculate the probability of getting the plant within the first 3 plants:
$P(X \leq 3) = 1 - (0.75)^3 = 1 - 0.4219 = 0.5781$
There is a 57.8% chance the geneticist will get the desired plant within the first 3 plants grown.

Common Pitfalls

Why: Confusion between different geometric distribution conventions used in different textbooks; AP exclusively uses the shifted (trial-counting) convention.

Why: Both use Bernoulli trials, so students forget to check what is being counted and whether the number of trials is fixed.

Why: Off-by-one error from misinterpreting the inequality cutoff.

Why: Confusion between probability of success per trial and expected number of trials until first success.

Why: Students forget that independence is violated when sampling without replacement from small populations, just like in binomial settings.

Quick Reference Cheatsheet

← 返回章节主页

某道题卡住了？
拍照或粘贴题目 — 小欧（我们的 AI 学习助手）会一步步讲解并配示意图。
免费试用小欧 →

几何分布 — AP 统计学

1. 什么是几何分布？ ★★☆☆☆ ⏱ 3 min

2. 几何分布场景的条件 ★★☆☆☆ ⏱ 4 min

3. 几何概率计算（PMF和CDF） ★★★☆☆ ⏱ 4 min

4. 几何随机变量的均值和标准差 ★★★☆☆ ⏱ 3 min

5. AP风格练习题 ★★★★☆ ⏱ 4 min

Common Pitfalls

Quick Reference Cheatsheet

更多学习指南