Conditional Probability

Lecture 12

Dr. Greg Chism

University of Arizona
INFO 511 - Spring 2025

Conditional probability

The probability an event will occur given that another event has already occurred is a conditional probability. The conditional probability of event \(A\) given event \(B\) is:

\[P(A | B) = \frac{P(A \text{ and } B)}{P(B)}\]

Conditional probabilities

\[P(A | B) = \frac{P(A \text{ and } B)}{P(B)}\]

Examples come up all the time in the real world:

  • Given that it rained yesterday, what is the probability that it will rain today?
  • Given that a mammogram comes back positive, what is the probability that a woman has breast cancer?
  • Given that I’ve already watched six episodes of How I Met Your Mother tonight, what is the probability that I’ll get any work done this evening?

Coffee and mortality


Did not die

Died
Does not drink coffee 5438 1039
Drinks coffee occasionally 29712 4440
Drinks coffee regularly 24934 3601

Three probabilities


Did not die

Died
Does not drink coffee 5438 1039
Drinks coffee occasionally 29712 4440
Drinks coffee regularly 24934 3601

Define events \(A\) = died and \(B\) = non-coffee drinker. Calculate the following for a randomly selected person in the cohort:

  • Marginal probability: \(P(A)\), \(P(B)\)

  • Joint probability: \(P(A \text{ and } B)\)

  • Conditional probability: \(P(A | B)\), \(P(B | A)\)

Independence

The multiplicative rule

We can write the definition of condition probability

\[P(A | B) = \frac{P(A \text{ and } B)}{P(B)}\]


Using the equation above, we get…

\[P(B) \times P(A | B) = P(A \text{ and } B)\]

What does the multiplicative rule mean in plain English?

Defining independence

Events \(A\) and \(B\) are said to be independent when

\[P(A | B) = P(A) \hspace{10mm} \textbf{OR} \hspace{10mm} P(B | A) = P(B)\]


In other words, knowing that one event has occurred doesn’t cause us to “adjust” the probability we assign to another event.

Checking independence

We can use the multiplicative rule to see if two events are independent.

If events \(A\) and \(B\) are independent, then

\[P(A \text{ and } B) = P(A) \times P(B)\]

Independent vs. disjoint events

Since for two independent events \(P(A|B) = P(A)\) and \(P(B|A) = P(B)\), knowing that one event has occurred tells us nothing more about the probability of the other occurring.

For two disjoint events \(A\) and \(B\), knowing that one has occurred tells us that the other definitely has not occurred: \(P(A \text{ and } B) = 0\).

Disjoint events are not independent!

Checking independence


Did not die

Died
Does not drink coffee 5438 1039
Drinks coffee occasionally 29712 4440
Drinks coffee regularly 24934 3601


Are dying and abstaining from coffee independent events? How might we check?

Bayes’ Rule

An example

In an introductory statistics course, 50% of students were first years, 30% were sophomores, and 20% were upperclassmen.

80% of the first years didn’t get enough sleep, 40% of the sophomores didn’t get enough sleep, and 10% of the upperclassmen didn’t get enough sleep.

What is the probability that a randomly selected student in this class didn’t get enough sleep?

Bayes’ Rule

As we saw before, the two conditional probabilities \(P(A | B)\) and \(P(B | A)\) are not the same. But are they related in some way?

Yes they are (!) using Bayes’ rule:

Bayes’ rule:

\[\begin{align}P(A | B) &= \frac{P(A \text{ and } B)}{P(B)}\\[10pt] &= \frac{P(B | A)P(A)}{P(B)} \end{align}\]

Bayes’ Rule (continued)

Putting together a few rules of probability…

\[\begin{align}P(A | B) &= \frac{P(A \text{ and } B)}{P(B)}\\[10pt] &= \frac{P(B | A)P(A)}{P(B)}\\[15pt] &= \frac{P(B | A)P(A)}{P(B | A)P(A) + P(B | A^c)P(A^c)}\end{align}\]

Let’s took at an example to see how this works.

Diagnostic Testing

Definitions

Suppose we’re interested in the performance of a diagnostic test. Let \(D\) be the event that a patient has the disease, and let \(T\) be the event that the test is positive for that disease.

  • Prevalence: \(P(D)\)
  • Sensitivity: \(P(T | D)\)
  • Specificity: \(P(T^c | D^c)\)
  • Positive predictive value: \(P(D | T)\)
  • Negative predictive value: \(P(D^c | T^c)\)

What do these probabilities mean in plain English?

Rapid self-administered COVID-19 tests

For a Abbott BinaxNOW COVID-19 Rapid antigen tests,

  • Sensitivity, \(P(T | D)\), is 64.2% in symptomatic individuals

  • Specificity, \(P(T^c | D^c)\), is 99.8%

  • From CDC statistics in 2021, with 8.7% prevalence from Pima County, Arizona persons aged ≥10 years.

Suppose a randomly selected American aged 13+ has a positive test result. What is the probability they have COVID-19?

Using Bayes’ Rule

\[\begin{align*} P(D | T) &= \frac{P(D \text{ and } T)}{P(T)}\\ &= \frac{P(T | D)P(D)}{P(T)}\\[5pt] &= \frac{P(T | D)P(D)}{P(T | D)P(D) + P(T | D^c)P(D^c)}\\[5pt] &= \frac{P(T | D)P(D)}{P(T | D)P(D) + (1 - P(T^c | D^c))(1 - P(D))} \end{align*}\]


What does all of this mean? Let’s take a look!

ae-08

Given:

  • Prevalence: \(P(D)\) = 8.7% = 0.087
  • Sensitivity: \(P(T | D)\) = 64.2% = 0.642
  • Specificity: \(P(T^c | D^c)\) = 99.8% = 0.998

Work through ae-08 then move on to the discussion questions

A discussion

Think about the following questions:

  • Is this calculation surprising?
  • What is the explanation?
  • Was this calculation actually reasonable to perform?
  • What if we tested in a different population, such as high-risk individuals?
  • What if we were to test a random individual in a county where the prevalence of COVID-19 is approximately 25%?