Objectives Qualls - 6. Confidence Interval of Population Proportion Lecture 6
Transcription
Objectives Qualls - 6. Confidence Interval of Population Proportion Lecture 6
Qualls - 6. Confidence Interval of Population Proportion Objectives Lecture 6 At the end of this section you should be able to answer questions concerning point and interval estimates of a population proportion, and determining the requisite sample size for a given confidence level. Confidence Intervals and Sample Sizes Part 2: Proportions Specifically, you should understand: • the difference between a point estimate and an interval estimate • how to calculate a confidence interval for a population proportion • how to determine the requisite sample size given a desired margin of error and confidence level. DePaul University Bill Qualls 1 2 Point Estimate of a Population Proportion • We used the following example when we introduced the binomial distribution: Assume my free-throw average is 40%. If I throw 3 free-throws, what is the probability that I will miss all three? Hit 1? Hit 2? Hit 3? Confidence Intervals about a Population Proportion • In our problems dealing with binomial probabilities, we have been given p. But what if we would like to estimate p with a known level of confidence? 3 4 Point Estimate of a Population Proportion Point Estimate of a Population Variance • A point estimate is a single value used to • Given that the variance for a binomial approximate a population parameter. distribution is defined as σ²=npq, where q = 1-p, and that p-hat is the best estimate for the population • The best point estimate ("p-hat") of a population parameter p, it follows then that the best estimate proportion (p) is the sample proportion. for the population variance is: successes pˆ = trials σˆ 2 = npˆ qˆ where qˆ = 1 − pˆ • The use of a carat symbol (^) over a letter is read as "hat", and indicates it is an estimated value. 5 Updated 3/15/2014 6 Qualls - 6. Confidence Interval of Population Proportion Interval Estimates The Problem with Point Estimates • If I sink 4 free-throws out of 10, then my point estimate for p is .4. • We can, however, assign a level of confidence to an interval estimate. • Likewise, if I sink 40 free-throws out of 100, then my point estimate for p is still .4. • We would intuitively have more confidence in the second statistic than in the first. • If you were asked to come up with a 95% confidence interval for the first case (4 free-throws out of 10), you might say you were 95% confident that the true proportion is between .3 and .5. • But these are both point estimates, and the problem with a point estimate is that we cannot assign any statistical level of confidence to it. • But in the second case (40 free-throws out of 100), you might say you were 95% confident that the true proportion is between .35 and .45. (Numbers used above are "guesses" only, for illustrative purposes.) 7 CI for Population Proportion 8 90% Confidence Interval • The formula for the confidence interval (CI) for a population proportion is usually shown as: p = pˆ ± zα / 2 pˆ qˆ n • Some texts prefer the notation: p = pˆ ± E where E is the margin of error and is calculated as: E = zα / 2 pˆ qˆ n • These formulas require np ≥ 15 and nq ≥ 15 (or else the distribution is too skewed; not normal.) 9 95% Confidence Interval 10 99% Confidence Interval 11 Updated 3/15/2014 12 Qualls - 6. Confidence Interval of Population Proportion Together Calculating Confidence Intervals • I attempt 100 free throws, and make a basket 40 times. Calculate a 95% confidence interval for my true free throw percentage. • Solution: pˆ qˆ n (.4)(.6) = .4 ± 1.96 100 = .4 ± .096 p = pˆ ± zα / 2 = [.304, .496] 13 14 Interpretation Interpretation So what does it mean? A miss like this will occur 5% of the time. Wrong: We are 95% confident that the true population proportion is between .304 and .496. Correct: If the sampling process were repeated many times, and the interval calculated each time, 95% of those intervals would capture the true population proportion. 15 16 Using the TI-83 Plus Together • Press [STAT] [TESTS] [1-PropZInt] In a survey of 1002 people, 701 said that they voted in a recent presidential election (based on data from ICR Research Group). Voting records show that 61% of eligible voters actually did vote. a. Find a 99% confidence interval estimate of the proportion of people who say that they voted. b. Are the survey results consistent with the actual voter turnout of 61%? Why or why not? • These are always "z", never "t". • Careful! Don't choose 1-PropZTest (yet). (Source: Triola, Page 333, Section 7-2, #34) 17 Updated 3/15/2014 18 Qualls - 6. Confidence Interval of Population Proportion Together Margin of Error Assume that a sample is used to estimate the population proportion p. Find the margin of error E that corresponds to the given statistics and confidence level: n = 1200, x = 800, 99% confidence. Given a confidence interval of [0.25, 0.39]. • What is p-hat? (Answer: 0.32) • What is the margin of error? (Answer: 0.07) E E 0.25 0.39 • What is the margin of error for the previous problem? (Source: Triola, Page 333, Section 7-2, #18) 20 19 Together Find the margin of error: Determining the Proper Sample Size 21 22 Sample Size Together • How large does sample need to be to get an estimate of p, with an acceptable margin of error? • My earlier attempts indicate that my free throw percentage is around 40%. But I would like a more narrow confidence interval than the ±9.6% I got with n=100. How many free throws should I attempt in order to get a 95% confidence interval with a 3% margin of error? E = zα / 2 pˆ qˆ [z ]2 pˆ qˆ → solve for n → n = α / 2 2 n E • In the above formula, E might be, for example, .03 for a 3% margin of error. • If no prior estimate of p is known then use .5 as .5 will always give you the maximum sample size. 23 Updated 3/15/2014 24 Qualls - 6. Confidence Interval of Population Proportion What about Population Size? Together "Many people incorrectly believe that the sample size should be some percentage of the population, but (the above formula) shows that the population size is irrelevant. (In reality, the population size is sometimes used, but only in cases in which we sample without replacement from a relatively small population.) Polls commonly use sample sizes in the range of 1000 to 2000 and, even though such polls may involve a very small percentage of the total population, they can provide results that are quite good." (Triola, page 330) Use the given data to find the minimum sample size required to estimate a population proportion or percentage. Margin of error: four percentage points; confidence level: 95%; no prior estimate of p-hat is available. 25 Together 26 Effect of Sample Size on C.I. Width Toyota provides an option of a sunroof and side air bag package for its Corolla model. This package costs $1400 ($1159 invoice price). Assume that prior to offering this option package, Toyota wants to determine the percentage of Corolla buyers who would pay $1400 extra for the sunroof and side air bags. How many Corolla buyers must be surveyed if we want to be 95% confident that the sample percentage is within four percentage points of the true percentage for all Corolla buyers? (Source: Triola, Page 333, Section 7-2, #44) Do parts of this problem sound familiar ? ? ? 27 Gut check Estimate of margin of error given sample size: E≈ 1 n Estimate of sample size for given margin of error: n≈ 1 E2 29 Updated 3/15/2014 28