1. Different hypothesis Test for the mammography experiment

In the last lecture, we covered the basics of hypothesis testing with the HIP mammography study as our example. The study's aim is to determine whether offering mammographies for breast cancer detection reduces the rate of death due to breast cancer. There are 31000 individuals in each of the treatment and control groups; only those in the treatment groups are offered mammographies.

We recap the elements of the hypothesis testing framework. In the mammography study, they are:

Throughout the hypothesis test, we focused on the observed death rate in the treatment group as the variable, and compare it to \pi = 0.00203, the observed death rate in the control group. The question below examines the validity of this approach.

2. Hypergeometric probability distribution

The hypergeometric distribution is a discrete distribution based on the following probability problem:

“Suppose there are N balls in a bowl, K of which are red and the remaining N-K of which are blue. From the bowl, nballs are drawn without replacement. What is the probability that among the n balls drawn, exactly x are red?"

The solution to this problem is given by the following pmf:

\displaystyle \displaystyle \mathbb {P}(X = x) \displaystyle = \frac{\left(\text {Number of ways to choose } x \text { out of } K \text { red balls} \right) \cdot \left(\text {Number of ways to choose } n-x \text { out of } N-K \text { blue balls } \right)}{\text {Number of ways to choose } n \text { balls out of} N}
\displaystyle = \frac{\dbinom {K}{x}\dbinom {N-K}{n-x}}{\dbinom {N}{n}}.

This pmf defines the hypergeometric distribution \text {Hypergeometric}(N, K, n) with the three parameters: