Confidence Interval WS
Transcription
Confidence Interval WS
CONFIDENCE INTERVAL WORKSHEET MTH 3210 SPRING 2015 1. Supplementary Notes Confidence intervals are the first example we have of inferential statistics in this course. Recall that the objective of inferential statistics is to make a correct statement about a population parameter based on information from a single sample. For this quiz, the population parameter being estimated is the unknown population mean, µ. We do not know µ because we do not have a census; that is, we do not have all the data in our entire population. In any situation where the entire population is unattainable, especially in situations where only a relatively small sample is realistically attainable, we must use a sample from our population to build a precise interval estimate of µ with a prescribed reliability called the confidence level. There are 2 types of confidence intervals for the unknown mean, µ, of a single population. Recipes for each of these confidence intervals follow. For both of these confidence intervals, it is crucially important that the sample collected is a simple random sample, that is, every experimental unit in the population has the same chance of being selected for the sample. The foundational theorem that the following confidence intervals are built upon is the Central Limit Theorem. In order to use the Central Limit Theorem reliably we must collect a simple random sample and we must assume that the size of the sample is greater than 30 or that the underlying population that we are working with is normally distributed. 1.1. Confidence interval for µ when σ is known. For this test, we assume that the population standard deviation, σ, is known. If σ is not known (as is usually the case), then we must use the next confidence interval, below. Precisely, we assume that • We have a simple random sample • The sample size is at least 30 or the underlying population has a normal distribution • The population standard deviation, σ, is already known. A 100(1 − α)% confidence interval for the population mean µ of the population is given by, σ σ x¯ − zα/2 √ , x¯ + zα/2 √ n n Here n is the sample size, x is the sample mean, and zα/2 is the z-value under the normal curve for which the area under the curve, to the right of z, is α/2. The zα/2 values can be inferred from Table II in the book. 1 2 CONFIDENCE INTERVAL WORKSHEET MTH 3210 SPRING 2015 1.2. Confidence interval for µ when σ is unknown. For this test, we do not need to know the population standard deviation σ. The cost of not knowing σ is that we now need to use the t-distribution, which depends on the number of degrees of freedom, df ; in this case, df = n − 1, where n is our sample size. Precisely, we assume that • We have a simple random sample • The sample size is at least 30 or the underlying population has a normal distribution A 100(1 − α)% confidence interval for the population mean µ of the population is given by, s s (¯ x − tα/2 √ , x¯ + tα/2 √ ) n n Here n is the sample size, x is the sample mean, s is the sample standard deviation and tα/2 is the t-value under the t-distribution (with n − 1 degrees of freedom) curve for which the area under the curve, to the right of t, is α/2. Such values can be inferred from Table IV in the book. 2. Practice Problems 2.1. Practice Problem. For these exercises, σ is known. Find a 100(1 − α)% confidence interval for µ in each case. Assume that the underlying population is normally distributed. (1) x¯ = 20, σ = 3, n = 36, α = .05 (2) x¯ = 25, σ = 3, n = 36, α = .01 (3) x¯ = 30, σ = 3, n = 25, α = .1 2.2. Practice Problem. For these exercises, σ is NOT known. Find a 100(1 − α)% confidence interval for µ in each case. Assume that the underlying population is normally distributed. (1) x¯ = 35, s = 3.76, n = 25, α = .1 (2) x¯ = 50, s = 5.02, n = 16, α = .01 (3) x¯ = 50, s = 3.71, n = 12, α = .05 2.3. Practice Problem. A simple random sample of 36 St Bernard dog weights yields a sample mean of 193 pounds. It is known that the standard deviation of the population of all St Bernard dog weights is 18 pounds. Find a confidence interval for the true mean weight of all St Bernard dogs with a 99% confidence level. 2.4. Practice Problem. Eleven regions in the Congolese rain forest are randomly sampled. In each region rainfall was monitored for one year, and the following total yearly rainfalls, in centimeters, were reported: {276, 255, 255, 297, 213, 241, 269, 262, 145, 185, 209} CONFIDENCE INTERVAL WORKSHEET MTH 3210 SPRING 2015 3 Assume that yearly rainfalls within the Congolese rain forest are distributed normally. Find a confidence interval for the true mean rainfall in the Congolese rain forest with a 95% confidence level. 3. Quiz Problems 3.1. Quiz Problem. For these exercises, σ is known. Find a 100(1−α)% confidence interval for µ in each case. Assume that the underlying population is normally distributed. (1) x¯ = 1.05, σ = 0.25, n = 9, α = .05 (2) x¯ = 1990, σ = 120, n = 36, α = .01 (3) x¯ = 31.3, σ = 2.67, n = 16, α = .1 3.2. Quiz Problem. (1) x¯ = 43.1, s = 6.8, n = 26, α = .01 (2) x¯ = −10, s = 1.34, n = 9, α = .1 (3) x¯ = 3600, s = 94, n = 40, α = .05 3.3. Quiz Problem. A simple random sample of 21 chihuahua dog weights yields a sample mean of 5.6 pounds. It is known that the standard deviation of the population of all chihuahua weights is 1.8 pounds. Find a confidence interval for the true mean weight of all chihuahuas with a 95% confidence level. 3.4. Quiz Problem. Fifteen regions in an Amazon rain forest are randomly sampled. The total yearly rainfalls, in centimeters, for these regions were reported as follows: {181, 232, 230, 158, 242, 197, 228, 302, 242, 194, 200, 186, 146, 194, 202} Assume that yearly rainfalls within the Amazon rain forest are distributed normally. Find a confidence interval for the true mean rainfall in the Amazon rain forest with a 95% confidence level. Solution: Since we are not given the population standard deviation, σ, we need to compute the sample standard deviation and use the latter of our two confidence interval recipes. We find that x¯ = 208.9 and s = 38.51. Since we can assume that yearly rainfalls within the Amazon rain forest are distributed normally, we can apply the t confidence interval method despite the small sample size. Also, df = n − 1 = 14 Therefore, a 95% confidence interval for the true mean rainfall in the Congolese rain forest is given by, 38.51 38.51 208.9 ± t.025,14 √ = 208.9 ± 2.144 = [187.582, 230.218] 3.873 15 Appendix A. Solutions Practice Problems A.1. Solution to Practice Problem 2.1. (1) 20 ± z.025 √336 = 20 ± 1.96 63 = [19.02, 20.98] (2) 25 ± z.005 √336 = 25 ± 2.58 63 = [23.71, 26.29] (3) 30 ± z.05 √325 = 30 ± 1.64 35 = [29.01, 30.98] 4 CONFIDENCE INTERVAL WORKSHEET MTH 3210 SPRING 2015 A.2. Solution to Practice Problem 2.2. 3.76 (1) 35 ± t.05,24 √ = 35 ± 1.711 3.76 = [33.71, 36.29] 5 25 5.02 5.02 (2) 50 ± t.005,15 √16 = 50 ± 2.95 4 = [46.3, 53.7] 3.71 3.71 (3) 50 ± t.025,11 √ = 50 ± 2.2 √ = [47.64, 52.36] 12 12 A.3. Solution to Practice Problem 2.3. Since we are given the population standard deviation, σ = 18, we us the first of our two confidence interval recipes. Therefore, a 99% confidence interval for the true mean weight of all St Bernard dogs is given by, 18 193 ± z.005 √ = [185.27, 200.72] 36 A.4. Solution to Practice Problem 2.4. Since we are not given the population standard deviation, σ, we need to compute the sample standard deviation and use the latter of our two confidence interval recipes. We find that x¯ = 237 and s = 44.68. Since we can assume that yearly rainfalls within the Congolese rain forest are distributed normally, we can apply the t confidence interval method despite the small sample size. Also, df = n − 1 = 10 Therefore, a 95% confidence interval for the true mean rainfall in the Congolese rain forest is given by, 44.68 = [206.98, 267.04] 237 ± t.025,10 √ 11 Appendix B. Solutions to Quiz Problems Solution to 3.1: .25 = [0.887, 1.213] (1) 1.05 ± z.025 √ = 1.05 ± 1.96 .25 3 9 120 120 (2) 1990 ± z.005 √36 = 1990 ± 2.58 6 = [1938.4, 2041.6] 2.67 (3) 31.3 ± z.05 √ = 31.3 ± 1.64 2.67 = [30.21, 32.395] 4 16 Solution to 3.2: 6.8 (1) 43.1 ± t.025,25 √6.8 = 43.1 ± 2.79 5.099 = [39.38, 46.82] 26 1.34 1.34 (2) −10 ± t.005,8 √9 = −10 ± 1.86 3 = [−10.83, −9.17] 94 (3) 3600 ± t.05,39 √9440 = 3600 ± 2.02 6.325 = [3570, 3630] Solution to 3.3: Since we are given the population standard deviation, σ = 1.8, we us the first of our two confidence interval recipes. Therefore, a 95% confidence interval for the true mean weight of all chihuahuas is given by, 1.8 1.8 5.6 ± z.025 √ = 5.6 ± 1.96 = [4.83, 6.38] 4.58 21 Solution to 3.4: Since we are not given the population standard deviation, σ, we need to compute the sample standard deviation and use the latter of our two confidence interval recipes. We find that x¯ = 208.9 and s = 38.51. Since we can assume that yearly rainfalls within the Amazon rain forest are distributed normally, we can apply the t confidence interval CONFIDENCE INTERVAL WORKSHEET MTH 3210 SPRING 2015 5 method despite the small sample size. Also, df = n − 1 = 14 Therefore, a 95% confidence interval for the true mean rainfall in the Congolese rain forest is given by, 38.51 38.51 208.9 ± t.025,14 √ = [187.582, 230.218] = 208.9 ± 2.144 3.873 15