Answer:
a) 100% probability that 50 or more had high blood-lead levels in the sample taken a decade ago.
b) 0% probability that 50 or more had high blood-lead levels in the sample taken now.
Explanation:
For each children, there are only two possible outcomes. Either they have high levels of lead blood, or they do not. This means that we use the binomial probability distribution to solve this problem.
However, we are working with samples that are considerably big. So i am going to aproximate this binomial distribution to the normal.
Binomial probability distribution
Probability of exactly x sucesses on n repeated trials, with p probability.
Can be approximated to a normal distribution, using the expected value and the standard deviation.
The expected value of the binomial distribution is:
![E(X) = np](https://img.qammunity.org/2020/formulas/mathematics/college/3bk54jyet3zf9d1ttkfgkk3hm6rop2x236.png)
The standard deviation of the binomial distribution is:
![√(V(X)) = √(np(1-p))](https://img.qammunity.org/2020/formulas/mathematics/college/2sa1f9xfwifu1mdu4sxvyrexjj48j2tfk4.png)
Normal probability distribution
Problems of normally distributed samples can be solved using the z-score formula.
In a set with mean
and standard deviation
, the zscore of a measure X is given by:
![Z = (X - \mu)/(\sigma)](https://img.qammunity.org/2020/formulas/mathematics/middle-school/ijf8wrxup4oiph7gw8zex0r9316mpsigqy.png)
The Z-score measures how many standard deviations the measure is from the mean. After finding the Z-score, we look at the z-score table and find the p-value associated with this z-score. This p-value is the probability that the value of the measure is smaller than X, that is, the percentile of X. Subtracting 1 by the pvalue, we get the probability that the value of the measure is greater than X.
When we are approximating a binomial distribution to a normal one, we have that
,
.
(a) In a random sample of 216 children taken more than a decade ago, what is the probability that 50 or more had high blood-lead levels?
Here we have
.
84% of children at risk. This means that
![p = 0.84](https://img.qammunity.org/2020/formulas/mathematics/high-school/q3686gkwx630x5qs9n2c5wxv5xlg532eu8.png)
So we have
![\mu = E(X) = np = 216*0.84 = 181.44](https://img.qammunity.org/2020/formulas/mathematics/high-school/dxd0xgg4vsx5s1fvo419vzopb7pduzdamj.png)
![\sigma = √(V(X)) = √(np(1-p)) = √(216*0.84*0.16) = 5.39](https://img.qammunity.org/2020/formulas/mathematics/high-school/k8qw12phkzedg7gvrzjx3fmceue4v1gpmi.png)
The probability is 1 subtracted by the pvalue of Z when
. So
![Z = (X - \mu)/(\sigma)](https://img.qammunity.org/2020/formulas/mathematics/middle-school/ijf8wrxup4oiph7gw8zex0r9316mpsigqy.png)
![Z = (50-181.44)/(5.39)](https://img.qammunity.org/2020/formulas/mathematics/high-school/kuu07ofmdqoxjad816aml9bo07kb2pcjvg.png)
![Z = -24.38](https://img.qammunity.org/2020/formulas/mathematics/high-school/nimqfh4lxe7fh5rrnk8vlsxp9m556u3u2h.png)
has a pvalue of 0. This means that there was a 100% probability that 50 or more had high blood-lead levels in the sample taken a decade ago.
(b) In a random sample of 216 children taken now, what is the probability that 50 or more have high blood-lead levels?
Here we have
.
8% of children at risk. This means that
![p = 0.08](https://img.qammunity.org/2020/formulas/mathematics/college/d9rwnh306xy12qyju9dstzwul36bqjocwh.png)
So we have
![\mu = E(X) = np = 216*0.o8 = 17.28](https://img.qammunity.org/2020/formulas/mathematics/high-school/uuvngen33kuqarpsccggoi9yp6wpazm4ah.png)
![\sigma = √(V(X)) = √(np(1-p)) = √(216*0.08*0.92) = 3.99](https://img.qammunity.org/2020/formulas/mathematics/high-school/lja0hkf7vgtcjoziiihr3aijm6y8zmou02.png)
The probability is 1 subtracted by the pvalue of Z when
. So
![Z = (X - \mu)/(\sigma)](https://img.qammunity.org/2020/formulas/mathematics/middle-school/ijf8wrxup4oiph7gw8zex0r9316mpsigqy.png)
![Z = (50-17.28)/(3.99)](https://img.qammunity.org/2020/formulas/mathematics/high-school/ciwit9gcyll0hean1jz59zv5q7hzm0svin.png)
![Z = -8.20](https://img.qammunity.org/2020/formulas/mathematics/high-school/mdqtfo79kdkzidcz2nnqzqm0z3krrk4rjd.png)
has a pvalue of 1. This means that there is a 0% probability that 50 or more had high blood-lead levels in the sample taken now.