Answer:
90% confidence interval for the difference in true proportion of the two groups is (-0.0717, 0.0517).
Explanation:
Before building the confidence interval, we need to understand the central limit theorem and subtraction of normal variables.
Central Limit Theorem
The Central Limit Theorem estabilishes that, for a normally distributed random variable X, with mean
and standard deviation
, the sampling distribution of the sample means with size n can be approximated to a normal distribution with mean
and standard deviation
.
For a skewed variable, the Central Limit Theorem can also be applied, as long as n is at least 30.
For a proportion p in a sample of size n, the sampling distribution of the sample proportion will be approximately normal with mean
and standard deviation
![s = \sqrt{(p(1-p))/(n)}](https://img.qammunity.org/2022/formulas/mathematics/college/21siyq2l0d9z8pcii2ysmig6q1uk55fvwj.png)
Subtraction between normal variables:
When two normal variables are subtracted, the mean is the difference of the means, while the standard deviation is the square root of the sum of the variances.
First group: Sample of 163, 13% has a second episode.
This means that:
![p_1 = 0.13, s_1 = \sqrt{(0.13*0.87)/(163)} = 0.026](https://img.qammunity.org/2022/formulas/mathematics/college/jg6nssntgn4dr1l8wixtbvs87uhmnyomdc.png)
Second group: Sample of 160, 14% has a second episode
This means that:
![p_2 = 0.14, s_2 = \sqrt{(0.14*0.86)/(160)} = 0.027](https://img.qammunity.org/2022/formulas/mathematics/college/n95ljvsy4dyh9llle53wbz75wvwx2s4g29.png)
Distribution of the difference:
![p = p_1 - p_2 = 0.13 - 0.14 = -0.01](https://img.qammunity.org/2022/formulas/mathematics/college/g7lgibdg6u8m82w12whz0u7qssmg2azsyc.png)
![s = √(s_1^2+s_2^2) = √(0.026^2+0.027^2) = 0.0375](https://img.qammunity.org/2022/formulas/mathematics/college/ze56dq7yfrckjxk0roxh181wecjdh2kn40.png)
Confidence interval:
The confidence interval is:
![p \pm zs](https://img.qammunity.org/2022/formulas/mathematics/college/wj9caku600g3pv821d3qu5mork79nvmhtt.png)
In which
z is the zscore that has a pvalue of
.
90% confidence level
So
, z is the value of Z that has a pvalue of
, so
.
Lower bound:
![p - 1.645s = -0.01 - 1.645*0.0375 = -0.0717](https://img.qammunity.org/2022/formulas/mathematics/college/a96uemwet66n1zsvt0zbbpyonyvr9g6zl4.png)
Upper bound:
![p + 1.645s = -0.01 + 1.645*0.0375 = 0.0517](https://img.qammunity.org/2022/formulas/mathematics/college/igo4ckyy9xjvs8li6uassm75e68fxhxcin.png)
90% confidence interval for the difference in true proportion of the two groups is (-0.0717, 0.0517).