15 Equally Likely Outcomes and Counting Rules

For a sample space $\Omega$ with finitely many possible outcomes, assuming equally likely outcomes corresponds to a probabiliy measure $\text{P}$ which satisfies \[ \text{P}(A) = \frac{|A|}{|\Omega|} = \frac{\text{number of outcomes in $A$}}{\text{number of outcomes in $\Omega$}} \qquad{\text{when outcomes are equally likely}} \]
Computing probabilities in the equally likely case reduces to just counting outcomes.
But remember:
- In most situations, outcomes are not equally likely
- Even if the sample space outcomes are equally likely, the possible values of related random variables are usually not.
However, rules for counting outcomes are useful even in situations where the outcomes are not equally likely

Example 15.1 Suppose that to send an internet packet from the east coast of the US to the west coast, a packet must go through a major east-coast city (Boston, New York, Washington DC, or Atlanta), then a major midwest city (Chicago, St. Louis, or New Orleans), and then a major west-coast city (San Francisco or Los Angeles). How many possible routes are there?

Multiplication principle for counting. Suppose that stage 1 of a process can be completed in any one of $n_1$ ways. Further, suppose that for each way of completing the stage 1, stage 2 can be completed in any one of $n_2$ ways. Then the two-stage process can be completed in any one of $n_1\times n_2$ ways.
This rule extends naturally to a $\ell$-stage process, which can then be completed in any one of $n_1\times n_2\times n_3\times\cdots\times n_\ell$ ways.
In the multiplication principle it is not important whether there is a “first” or “second” stage. What is important is that there are distinct stages, each with its own number of “choices”.

Example 15.2 Suppose the board of directors of a corporation has identified 5 candidates — Ariana, Beyonce, Cardi, Drake, Elvis — for three distinct executive positions: chief executive officer (CEO), chief financial officer (CFO), and chief operating officer (COO). In the interest of fairness, the board assigns 3 of the 5 candidates to the positions completely at random. No individual can hold more than one of the positions.

When calculating probabilities below, consider the sample space of all possible executive teams.

How many executive teams are possible?
What is the probability that Ariana is CEO, Beyonce is CFO, and Cardi is COO?
What is the probability that Ariana is CEO and Beyonce is CFO?
What is the probability that Ariana is CEO?

Number of “ordered” arrangements. The number of “ordered” arrangements of $k$ items, selected without replacement from a set of $n$ distinct items is \[ n(n-1)(n-2)\cdots(n-k+1) = \frac{n!}{(n-k)!} \]
Recall the factorial notation: $m!=m(m-1)(m-2)\cdots (3)(2)(1)$. For example, $5!=5\times4\times3\times2\times1=120$. By definition, 0!=1.

Example 15.3 Your boss is forming a committee of 3 people for a new project team, and 5 people — Ariana, Beyonce, Cardi, Drake, Elvis— have volunteered to be on the committee. In the interest of fairness, 3 of the 5 people will be selected uniformly at random to form the committee.

How many possible committees consist of Ariana, Beyonce, Cardi? How many executive teams from the previous example consisted of Ariana, Beyonce, Cardi? How is this example different from the executive team example?
How many different possible committees of 3 people can be formed from the 5 volunteers?

The following is the relationship between “ordered” and “unordered” counting. \[\begin{align*} & \quad \left(\text{number of ``ordered'' selections of $k$ from $n$}\right)\\ = &\quad \left(\text{number of ``unordered'' selections of $k$ from $n$}\right)\\ &\qquad \times\left(\text{number of ways of arranging the $k$ items in order}\right). \end{align*}\]
“Ordered” and “unordered” are somewhat misnomers. It is not important whether there is a “first”, “second”, “third”, etc. What is important is whether or not there are distinct stages, each with its own number of “choices”.
- In Example 15.2, it doesn’t matter if we pick the CEO first and the CFO second or the CFO first and the CEO second; what does matter is that choosing the CEO is a distinct stage from choosing the CFO.
Number of permutations. The number of ways of arranging $k$ items in order is \[ k\times (k-1)\times (k-2)\times\cdots\times 3\times 2\times1 = k! \]
Number of combinations. The number of ways to choose $k$ items without replacement from a group of $n$ distinct items where “order” does not matter, denoted $\binom{n}{k}$, is \[ \binom{n}{k} = \frac{n(n-1)(n-2)\cdots(n-k+1)}{k!} = \frac{n!}{k!(n-k)!} \]
The quantity on the right is just a compact way of representing the quantity in the middle. But since factorials can be very large, it’s best to use the quantity in the middle to compute. In R: choose(n, k). In Python: math.comb(n, k)
The symbol $\binom{n}{k}$ is by definition equal to the quantity in the middle above. It is read as “$n$ choose $k$” and is referred to as a binomial coefficient.

Example 15.4 Continuous Example 15.3. Your boss is forming a committee of 3 people for a new project team, and 5 people — Ariana, Beyonce, Cardi, Drake, Elvis— have volunteered to be on the committee. In the interest of fairness, 3 of the 5 people will be selected uniformly at random to form the committee.

Find the probability that the committee consists of Ariana, Beyonce, and Cardi.
Find the probability that Ariana and Beyonce are on the committee.
Find the probability that Ariana is on the committee.

Example 15.5 A standard deck of 52 cards contains 4 suits (hearts, diamonds, spades, clubs) each consisting of 13 different face values (2 through 10, jack, queen, king, ace). The deck is well shuffled and a hand of 5 cards is dealt (without replacement as is usual in practice).

How many possible hands are there?
What is the probability the hand contains exactly 4 hearts?
What is the probability the hand contains 3 aces and 2 kings?
What is the probability the hand is a full house (3 cards of one face value and 2 of another)?

Example 15.6 Capture-recapture sampling is a technique often used to estimate the size of a population. Suppose you want to estimate $N$, the number of monarch butterflies in Pismo Beach. Assume that $N$ is a fixed but unknown number¹; the population size doesn’t change over time. You first capture a sample of $N_1$ butterflies, selected randomly without replacement, and tag them and release them. At a later date, you then capture a second sample of $n$ butterflies, selected randomly from the $N$ butterflies without replacement. Let $X$ be the number of butterflies in the second sample that have tags (because they were also caught in the first sample). (Assume that the tagging has no effect on behavior, so that selection in the first sample is independent of selection in the second sample.)

In practice, $N$ is unknown and the goal is to estimate it based on what we observe for $X$; we’ll see later how to do this later. For now, we’ll start with a simpler, but unrealistic, example where $N$ is known. Assume there are $N=52$ butterflies, $N_1 = 13$ are tagged in the first sample, and $n=5$ is the size of the second sample.

What are the possible values of $X$?
Describe in detail how you could use simulation to approximate the distribution of $X$.
Find $\text{P}(X = 0)$ in two ways. Interpret the value in context.
Find the probability that in the second sample the first butterfly selected is tagged but the rest are not.
Find the probability that in the second sample the first four butterflies selected are not tagged but the fifth is.
Find $\text{P}(X = 1)$ in two ways. Interpret the value in context.
Find $\text{P}(X = 2)$ in two ways. Interpret the value in context.
Suggest a formula for determining the distribution of $X$.
Suggest a simple shortcut formula for $\text{E}(X)$.
Use the distribution of $X$ to compute $\text{E}(X)$. Does it match with the shortcut formula? Intepret the value in context.

Consider a population of size $N=N_1+N_0$ (e.g., a box with $N$ tickets)
- Each individual in the population can be classified as “success” (1) or “failure” (0)
- $N_1$ individuals in the population are “success” (1)
- $N_0$ individuals in the population are “failure” (0)
Randomly select a sample of $n$ individuals from a population with $N_1$ successes and $N_0$ failures without replacement and let $X$ be the number of individuals in the selected sample that are successes. The distribution of $X$ is defined to be the Hypergeometric($N_1$, $N_0$, $n$) distribution.
- If successes are labeled as 1 and failures as 0, the random variable $X$ which counts the number of successes in the sample is equal to the sum of the 1/0 labels of the individuals in the sample.
A discrete random variable $X$ has a Hypergeometric distribution with with parameters $n, N_0, N_1$, all nonnegative integers—with $N = N_0+N_1$ and $p=N_1/N$—if its distribution satisfies² \[\begin{align*} \text{P}(X=x) & = \frac{\binom{N_1}{x}\binom{N_0}{n-x}}{\binom{N_0+N_1}{n}},\quad x = 0, 1, \ldots, n;\, x \le N_1;\, x \ge n - N_0 \\ \end{align*}\]
If $X$ has a Hypergeometric($n$, $N_1$, $N_0$) distribution \[\begin{align*} \text{E}(X) & = np\\ \text{Var}(X) & = np(1-p)\left(\frac{N-n}{N-1}\right) \end{align*}\]

Another approach—a Bayesian approach—is to treat the unknown number $N$ as a random variable with a distribution that quantifies its uncertainty.↩︎
We must have $X\le N_1$ since there can’t be more successes in the sample than there are in the population. Similarly, we must have $X\ge n - N_0$, that is $n-X\le N_0$, since there can’t be more failures in the sample than there are in the population. Often the population sizes $N_0$ and $N_1$ are large relative to the sample size $n$ in which case $X$ simply takes values $0, 1,\ldots, n$.↩︎