One Sample Tests of Hypothesis Problems and Answers for Chapter 6
Transcription
One Sample Tests of Hypothesis Problems and Answers for Chapter 6
Problems and Answers for Chapter 6 One Sample Tests of Hypothesis 1. A plant breeder has been selecting Crispin apple trees for many years in an attempt to decrease variability in fruit size. When he began selecting, the variance was 400 g2 for mature fruit. A random sample of 30 apples from his most recent crop had a variance of 300 g2 . Has the breeder been successful in making Crispin apples less variable? What is the expected value of the test statistic here, if H0 is true? Find the P value of this test. 1. No. The hypotheses are H0 : σ 2 ≥ 400 g2 versus Ha : σ 2 < 400 g2 . E(χ2 ) = n − 1 = 29 and the actual χ2 = 21.75, so 0.10 < P < 0.25. The chi-square statistic is not significantly less than expected under H0 . 2. As part of a benthic community survey of Lady Elliot Island, sixteen sea stars, Linckia laevigata, were collected and their longest arm was measured to the nearest tenth of a centimeter. Assume normality for this population. 10.3 12.1 11.0 9.4 10.5 11.3 10.0 12.0 11.3 11.5 14.5 9.3 13.0 10.1 12.1 7.6 (a) For these data calculate the sample mean and variance. (b) Calculate the 95% confidence interval for the population mean. In your own words explain what this confidence interval represents. (c) Use a t test with α = 0.05 to determine whether the mean is significantly different from 12 cm. Discuss the relationship between the confidence interval in (b) above and this two-tailed t test. (d ) Calculate the 95% confidence interval for the population variance. Again, in your own words, explain what this confidence interval means. (e) Use a χ2 test with α = 0.05 to determine whether the variance is significantly different from 5.0 cm2 . Discuss the relationship between the confidence interval in (d ) above and this two-tailed χ2 test. Problems and Answers for Chapter 6. One Sample Tests of Hypothesis 40 2. (a) X = 11.00 cm, s2 = 2.644 cm2 , and n = 16. (b) C(10.13 cm ≤ µ ≤ 11.87 cm) = 0.95. I am 95% confident that the parametric mean µ for arm length lies between 10.13 cm and 11.87 cm (c) The hypotheses are H0 : µ = 12 cm versus Ha : µ = 12 cm. t = −2.46 and the c.v. = ±2.131. Since −2.46 < −2.131, reject H0 The confidence interval in (b) uses ±2.131 √sn to generate limits. Algebraically, the two-tailed test of hypothesis at α and the confidence interval at 1 − α are equivalent. (d ) C(1.442 cm2 ≤ σ 2 ≤ 6.335 cm2 ) = 0.95. I am 95% confident that the parametric variance, σ 2 , for arm length lies between 1.442 cm2 and 6.335 cm2 . (e) The hypotheses are H0 : σ 2 = 5.0 cm2 versus Ha : σ 2 = 5.0 cm2 . Calculate χ2 = 7.932. The c.v.’s = 6.26, 27.5. Since 6.26 < 7.932 < 27.5, accept H0 . The variance cannot be shown to be significantly different from 5.0 cm2 . Again algebraically, the two-tailed test of hypothesis at α and the confidence interval at 1 − α are equivalent. The sample variance of 5.0 cm2 lies within the confidence limits at 95%. 3. In healthy adult males the sciatic nerve conduction velocity values are normally distributed with a mean of 65 cm/msec and a standard deviation of 5 cm/msec. The conduction velocities of 16 subjects admitted to the poison control center of a metropolitan hospital with a diagnosis of methylmercury poisoning had a mean conduction velocity of 55 cm/msec and a variance of 49 (cm/msec)2 . (a) Do these data provide sufficient evidence to indicate that the conduction velocities are significantly slower in the poisoned individuals? (b) Do these data provide sufficient evidence to indicate that poisoned individuals are more variable in their sciatic nerve conduction velocities than normal individuals? 3. (a) The hypotheses are H0 : µ ≥ 65 cm/msec versus Ha : µ < 65 cm/msec. Using α = 0.05, the c.v. = −1.753. But t = −5.71. Since −5.71 < −1.753, reject H0 . Or note that the P value < 0.0005. The average conduction velocity in poisoned individuals is significantly slower than in normal individuals. (b) The hypotheses are H0 : σ 2 ≤ 25 (cm/msec)2 versus Ha : σ 2 > 25 (cm/msec)2 . Using α = 0.05, the c.v. = 25.0. But χ2 = 29.4 (and 0.01 < P value < 0.025). Since 29.4 > 25.0, reject H0 . Conduction velocities are significantly more variable in individuals who have methylmercury poisoning. 4. A television documentary on overeating claimed that adult American males are on average 15 lb overweight. To test this claim, 25 randomly selected adult males were examined, and the average excess weight was found to be 18 lb with a standard deviation of 10 lb. Do these data give us reason to believe the claim of 15 lb is incorrect? What type of error could you have made here? Explain. 4. The hypotheses are H0 : µ = 15 lb versus Ha : µ = 15 lb. Use α = 0.05 with c.v.’s = ±2.064. Calculate t = 1.5 (and 0.10 < P value < 0.20). Since −2.064 < 1.5 < 2.064, accept H0 . The sample mean is not sufficiently different from the claim of 15 lb to reject H0 . By accepting H0 we may be making a Type II Error—accepting a false null hypothesis. Probability of this type of error is unknowable. Problems and Answers for Chapter 6. One Sample Tests of Hypothesis 41 5. Before purchasing a large corn farm you wish to determine its profitability by analyzing its productivity. One claim made by the present owner is that the fields are very uniform in their yield per acre. In fact, it is claimed that the standard deviation in bushels per acre of corn yield is 5. A random sample of 25 one-acre plots yields an average of 140 bushels/acre and a standard deviation of 6.5 bushels/acre. Are the fields significantly more variable than is claimed? What type of error could you have made here? Explain. 5. The hypotheses are H0 : σ 2 ≤ 25 (bushels/acre)2 versus Ha : σ 2 > 25 (bushels/acre)2 . Here n = 25, and s = 6.5 bushels/acre. Using α = 0.05, the c.v. = 36.4. But χ2 = 40.56. Since 40.56 > 36.4, reject H0 . Evidence suggests the yields are more variable than claimed. Type I error is possible here. 0.01 < P value < 0.025. 6. The data below represent the systolic blood pressures (in mmHg) of 14 patients undergoing drug therapy for hypertension. Assuming normality of systolic blood pressures, on the basis of these data can you conclude that the mean is significantly less than 165 mmHg? 183 194 152 163 178 114 157 178 194 152 163 118 144 158 6. The hypotheses are H0 : µ ≥ 165 mmHg versus Ha : µ < 165 mmHg. t = −0.671 and the P value = 0.2550. So accept H0 ; we cannot conclude the mean systolic blood pressure is significantly less than 165 mmHg. 7. Analysis of the amniotic fluid from a random sample of 16 pregnant women early in gestation yielded the following measurements of uric acid present (in mg/100 ml). Do these data provide evidence to indicate that the population variance is greater than 0.92 (mg/100 ml)2 ? 4.2 4.7 3.3 4.7 3.5 3.0 3.1 4.8 3.8 5.5 3.3 4.0 3.5 3.6 4.0 7.6 7. The hypotheses are H0 : σ 2 ≤ 0.92 (mg/100 ml)2 versus Ha : σ 2 > 0.92 (mg/100 ml)2 . s2 = 1.15 (mg/100 ml)2 . χ2 = 21.67 and the P value = 0.117 > α = 0.05. Accept H0 , there is not sufficient evidence to claim the variance is greater than 0.92 (mg/100 ml)2 8. In one population of patients, the mean hemoglobin level is 13.5 g/100 ml of whole blood. The standard deviation of this variable is 1.4 g/100 ml of whole blood. A series of nine tests on a patient revealed a sample standard deviation of 2.1 g/100 ml of whole blood. Is the hemoglobin level in this patient significantly more variable than normal? 8. The hypotheses are H0 : σ 2 ≤ 1.4 (g/100 ml])2 versus Ha : σ 2 > 1.4 (g/100 ml)2 . Here s = 2.1 g/100 ml and n = 9. Testing at α = 0.05, the c.v. = 15.5. But χ2 = 18.0. Since 18.0 > 15.5, reject H0 . The patient’s hemoglobin level is significantly more variable than normal. 9. A geneticist, studying flower color in Mirabilis jalapa, the four-o’clock flower, crossed pink and white individuals together and predicted 50% white-flowered offspring and 50% pinkflowered offspring. Of the 100 seeds collected and germinated, 60 resulted in pink-flowered plants and 40 resulted in white-flowered plants. Was this result significantly different from the expectation? (Use a normal approximation here.) Problems and Answers for Chapter 6. One Sample Tests of Hypothesis 42 9. The hypotheses are H0 : p = 0.50 versus Ha : p = 0.50. From the Z distribution, the c.v. = 60 ±1.960. pˆ = 100 = 0.60. 0.10 0.60 − 0.50 = Z= = 2.0. 0.05 0.50(1−0.50) 100 Since 2.0 > 1.960, reject H0 . The result is significantly different from the expectation. 10. Recently there have been concerns about the effects of phthalates on the development of the male reproductive system. Phthalates are common ingredients in many plastics. In a pilot study a researcher gave pregnant rats daily doses of 750 mg/kg of body weight of DEHP (di-2ethylhexyl phthalate) throughout the period when their pups’ sexual organs were developing. The newly born male rat pups were sacrificed and their seminal vesicles were dissected and weighed. Below are the weights for the eight males (in mg). 1710 1630 1580 1670 1350 1650 1600 1650 If untreated newborn males have a mean of 1700 mg, can you say that rats exposed to DHEP in utero have a significantly lower weight? 10. The hypotheses are H0 : µ ≥ 1700 mg versus Ha : µ < 1700 mg. For a t test, the c.v. = −1.895. Calculate that t = −2.44. Since −2.44 < −1.895, reject H0 . Exposure to DEHP significantly decreases seminal vesicle weight. 11. Suppose that for years an investigator has used a stock of inbred rats whose weights have σ = 30 g. He is considering switching to a cheaper source of supply of rats, except he suspects that the new rats will show greater variability in weight. Measurement of a random sample of 20 rats from the cheaper source yielded a corrected sum of squares of 30,400 and a standard deviation of 40 g. Statistically confirm or refute the investigator’s suspicions. 11. The hypotheses are H0 : σ 2 ≤ 900g2 versus Ha : σ 2 > 900g2 . At α = 0.05, the c.v. = 30.1. χ2 = 33.78, so reject H0 . There is statistical evidence that the new rats are more variable than the old stock. 12. (a) The ancient Greeks used the term “golden rectangle” to describe those rectangles whose √ 1+ 5 height-to-width ratio is 1 : 2 ≈ 0.618034. They found such rectangles especially pleasing to the eye and often used them in architectural design (e.g., the Parthenon). Other artists and artisans, including musicians, have made explicit use of this ratio in their work. The ratio, unbelievably, arises in biological growth processes. For example, there are claims that the ratio of navel height-to-total height is this same golden ratio. The data below are breadth-to-length ratios of beaded rectangles used by the Shoshoni Indians to decorate leather goods. Is it reasonable to assume that they are implicitly using golden rectangles? Use the sign test here. (HSDS, #150) 0.693 0.749 0.654 0.670 0.662 0.672 0.615 0.606 0.690 0.628 0.668 0.611 0.606 0.609 0.601 0.553 0.570 0.844 0.576 0.933 (b) Find a 95% confidence interval for the median height-to-width ratio. Problems and Answers for Chapter 6. One Sample Tests of Hypothesis 43 12. (a) The hypotheses are H0 : M = 0.618034 versus Ha : M = 0.618034. Use the sign test. S− = 9, S+ = 11, so the test statistic is S− = 9 for the two-tailed test. F (9) = 0.4119 from Table C.1. Since 0.4119 > 0.025 accept H0 . Shoshoni Indians appear to be using the golden rectangle in their bead work. (b) A 95% confidence interval is [0.606, 0.672]. 13. The wall skink, Cryptoblepharus virgatus, is a common insectivore in the Brisbane area. They can be seen sunning themselves on vertical surfaces such as the brick walls of buildings. They can also be heard scurrying into the leaf litter as one walks along a path. Suppose we suspect that their median length (including tail) is 7 cm. Test the hypothesis H0 : M = 7 cm versus Ha : M = 7 cm at the 0.05 level. The data consist of a random sample of 66 skinks that were captured and measured: 41 have length greater than 7 cm, 23 are shorter than 7 cm, and 2 have length equal to 7 cm. 13. The hypotheses are H0 : M = 7 cm versus Ha : M = 7 cm. There are n = 66 observations with 41 larger than 7 cm, 23 shorter than 7 cm, 2 equal to 7 cm. Use the sign test with n = 64 usable observations. S− = 23, S+ = 41, so the test statistic for a two-tailed test is S− = 23. Test with α = 0.05. Use a normal approximation with µ = F (23) = P (S ≤ 23) = P 23.5 − 32 Z≤ 4 n 2 = 32 and σ = 64 4 . = P (Z ≤ −2.13) = 0.0166. Since 0.0166 < 0.025, reject H0 . Evidence suggests the median is different (larger) than 7 cm. 14. (a) The angle that a spider’s web makes with a line vertical to the earth’s surface varies by species. For example, for Isoxya cicatricosa this angle has been shown to have a median of 28.12◦ . The angles for ten webs were observed. Based on these data can you reject the hypothesis that I. cicatricosa constructed these webs? (HSDS, #194) 25◦ 12◦ 31◦ 26◦ 17◦ 15◦ 24◦ 10◦ 16◦ 12◦ (b) Find a 95% confidence interval for the median angle. 14. (a) The hypotheses are H0 : M = 28.12◦ versus Ha : M = 28.12◦ . Test with α = 0.05. S− = 9, S+ = 1, so the test statistic for a two-tailed test is S+ = 1. F (1) = 0.0107 from Table C.1. Since 0.0107 < 0.025, reject H0 . The webs don’t appear to be made by I. cicatricosa. (b) A 95% confidence interval is [12, 26]. 15. These data give pulse rates (in beats/minute) of 39 Peruvian Indians. Construct a 95% confidence interval for the median pulse rate. (HSDS, #345) 88 68 72 64 √ 76 68 88 60 84 72 80 60 64 76 76 72 60 72 64 92 64 52 72 80 60 72 60 72 64 64 76 64 68 60 88 68 74 56 72 n = 12.88 ≈ 13. A 95% confidence interval is [64, 72]. More conservatively, with 15. d = n−1−1.96 2 d = 12, the interval is [64, 74]. Problems and Answers for Chapter 6. One Sample Tests of Hypothesis 44 16. A Secchi disc (see figure below) is a simple but effective device to measure water transparency. The disc is attached to a rope and slowly lowered through the water until it just disappears from view. This depth is noted. It is lowered some more, then slowly raised until the disc is just observed. This depth is also noted. The Secchi depth is the average of these two depths. The depth is proportional to water transparency and inversely proportional to water turbidity, but it is also influenced by cloudiness, position of the sun, roughness of the water, and observer bias. Plankton biomass and suspended mud particles are the main contributors to water turbidity. In Seneca Lake (New York), the depth is primarily a function of plankton biomass. In 1997, the reported median Secchi depth at a station at the north end of the lake was 8.0 m. The following depths were recorded on 16 similar dates in 1999. The question is whether there has been a change in Secchi depth since 1997. Use a sign test. (Based on data reported in John Halfman, http://people.hws.edu/Halfman/Zebras/Zebras.htm, July, 2000.) ............. ....... .................................................................. ........................................... .... ...................................................... ... .................................................................................................................. ........................................................................ .. .. .................................................................. ..................................................... ....... ............................... ......... .... 9.0 9.0 10.5 8.0 9.2 7.0 8.3 6.0 7.1 4.7 7.8 6.0 7.9 6.5 8.3 7.4 16. The hypotheses are H0 : M = 8 m versus Ha : M = 8 m. Notice that n = 15 since one of the observations equals 8.0 m. The test statistic is the minimum of S+ = 6 and S− = 9. F (6) = 0.3036 and since this is a two-tailed test, the P value is 0.6072. So we accept H0 . There does not appear to be a change in the median Secchi depth from 1997. 17. The fasting blood glucose levels of nine women who had been using oral contraceptives for at least six months prior to sampling are listed below. It is thought that the mean fasting blood glucose level in women not using contraceptives is 3.8 mmol/l. Test whether there is evidence that the use of contraceptives raises glucose levels. Assume symmetry. 4.71 3.61 4.92 4.85 4.66 3.92 5.10 3.55 4.78 17. The hypotheses are H0 : M ≤ 3.8 mmol/L versus Ha : M > 3.8 mmol/L. Use the Wilcoxon signed-rank test with n = 9. The test statistic is W− = 5 (see table below). F (5) = 0.0195 so reject H0 . Data support the hypothesis that median fasting blood glucose level exceeds 3.8 mmol/L. Xi − 3.8 Signed rank 0.91 5 −0.19 −2 1.12 8 1.05 7 0.86 4 0.12 1 1.30 9 −0.25 −3 0.98 6 18. The carpet python, Morelia spilota variegata, is widely distributed and is common in rainforests, wet or dry eucalypt forests, and pastoral and urban areas. They locate warm-blooded prey by means of heat-sensory pits located along the lower jaw that can detect temperature changes of less than one-thirtieth of a degree (◦ C). Their mean length is approximately 250 cm. Because of an extended drought from 1994 through early 1996 at Lamington National Park, it was thought that the mean lengths of these snakes would be smaller than 250 cm. A survey was conducted and the lengths (in cm) below were recorded. Clearly state the null and alternative hypotheses. Assuming that lengths of snakes are symmetrically distributed about the mean, carry out an appropriate test of the hypothesis at the 0.05 level. Interpret the result. Problems and Answers for Chapter 6. One Sample Tests of Hypothesis 269 221 204 258 211 198 276 223 45 207 242 18. The hypotheses are H0 : M ≥ 250 cm versus Ha : M < 250 cm. Use the Wilcoxon signed rank test with n = 10 and α = 0.05. The test statistic is W+ = 8.5 (see below). The corresponding P value from Table C.6 is between 0.0244 and 0.0322. So P < 0.05 = α, reject H0 . The median is significantly less than 250 cm. Xi Xi − 250 |Xi − 250| Signed rank 269 19 19 3 221 −29 29 −6 204 −46 46 −9 258 8 8 1.5 211 −39 39 −7 198 −52 52 −10 276 26 26 4 223 −27 27 −5 207 −43 43 −8 242 −8 8 −1.5 19. The following measurements of wind velocities (in km/hr) during 20 consecutive 15 minute intervals (starting at 1430) were monitored during a thunderstorm on 9 July 2000 in Geneva, NY. Test whether the median wind velocity significantly exceeded 30 km/hr. (Based on data from Robert Seem, New York State Agricultural Experiment Station, August, 2000, Personal communication.) 24.6 38.5 27.5 36.7 39.6 34.4 42.5 33.3 41.8 33.3 47.2 36.7 52.3 33.3 44.7 29.3 37.3 28.6 42.5 24.1 19. The hypotheses are H0 : M ≤ 30 km/hr versus Ha : M > 30 km/hr. Use the sign test with n = 20. The test statistic is S− = 5. F (5) = 0.0059. Since 0.0059 < 0.05, reject H0 . Data support the hypothesis that wind velocity exceeded 30 km/hr. 20. Create a table giving the density and cumulative distribution function for W+ for a sample size of n = 4. Use the same techniques that were used for a sample size of n = 5 in Section 6.6. 20. The density and cumulative distribution function for W+ for n = 4 are W P (W = w) P (W ≤ w) 0 1 2 3 4 5 6 7 8 9 10 1 16 1 16 1 16 2 16 1 16 3 16 2 16 5 16 2 16 7 16 2 16 9 16 2 16 11 16 2 16 13 16 1 16 14 16 1 16 15 16 1 16 16 16 which are derived from 42 = 16 possible signed rankings below. Problems and Answers for Chapter 6. One Sample Tests of Hypothesis Ranking −1 +1 −1 −1 −1 +1 +1 +1 −1 −1 −1 +1 +1 +1 −1 +1 −2 −2 +2 −2 −2 +2 −2 −2 +2 +2 −2 +2 +2 −2 +2 +2 46 W+ −3 −3 −3 +3 −3 −3 +3 −3 +3 −3 +3 +3 −3 +3 +3 +3 −4 −4 −4 −4 +4 −4 −4 +4 −4 +4 +4 −4 +4 +4 +4 +4 0 1 2 3 4 3 4 5 5 6 7 6 7 8 9 10 21. Bornstein and Bornstein (see “The pace of life,” Nature, 1976, 259: 556–558) proposed a correlation between the population of a city or town and the pace at which people walk in that location. For a particular sized town their data predict a time of 13.9 s to walk 50 ft. To investigate this prediction the paces of 16 randomly chosen people were measured. Making no special assumptions about the distribution of individual paces, set up the appropriate hypotheses and determine whether or not the null hypothesis should be rejected at α = 0.05. 8.4 11.6 9.8 11.9 10.3 12.4 10.6 12.5 10.8 13.9 11.0 14.0 11.1 15.4 11.5 15.7 21. The hypotheses are H0 : M = 13.9 s versus Ha : M = 13.9 s. Use the sign test with n = 15. S− = 12 and S+ = 3. F (3) = 0.0176. Since 0.0176 < 0.025, reject H0 . Data indicate a significant difference from 13.9 s. 22. Redo Problem 4 in Chapter 1 as a test of hypothesis question. 22. The hypotheses are H0 : µ ≤ 12.5 yr versus Ha : µ > 12.5 yr. Use a t test with α = 0.05. Calculate that X = 14.75 yr, n = 10, and s = 0.84 yr. The c.v. = 1.833. t= X −µ √s n = 14.75 − 12.5 0.84 √ 10 = 2.25 = 8.33. 0.27 Since 8.33 > 1.833, reject H0 . Menarche is significantly later in world-class swimmers. 23. Redo Problem 6 in Chapter 1 as a test of hypothesis question. Problems and Answers for Chapter 6. One Sample Tests of Hypothesis 47 23. The hypotheses are H0 : µ ≤ 24 hours versus Ha : µ > 24 hours. For a t test with α = 0.05. Calculate that X = 14.75 yr, n = 15, and s2 = 0.849 hr2 . The c.v. = 1.761. t= X −µ √s n = 25.87 − 24 0.92 √ 15 = 1.87 = 7.79. 0.24 Since 7.79 > 1.761, reject H0 . The average day for bunkered people is significantly longer than 24 hours. 24. (a) The fasting blood glucose (FBG) levels of a population of women is thought to have a standard deviation of 0.75 mmol/l. A study is being designed to determine whether FBG levels are increased in women who have taken oral contraceptives for at least six months. How large a sample is required to detect a 0.50 mmol/l increase if α = 0.05 and the power is 0.95? (b) The researchers also wish to determine whether oral contraceptives have any effect on cholesterol levels. If the standard deviation for cholesterol is thought to be 0.80 mmol/l, how large a sample is required to detect a 0.30 mmol/l difference if α = 0.05 and the power is 0.95? 24. (a) Assuming σ = 0.75 mmol/l, σ 0.75 = µ0 − µ 0.50 = 1.5 Utilize Table 6.1 to find the minimum required sample size of 25 for this one-sided test. (b) Assuming σ = 0.80 mmol/l, σ 0.80 = µ0 − µ 0.30 = 2.67. To be conservative, round this value up to 2.70. Utilize Table 6.1 to find the minimum required sample size of 95 for this two-sided test. 25. A machine used to produce individual serving packages of ketchup is normally run at 75 units/minute and at that speed the σ = 0.12 g. The production manager is considering raising the production speed to the maximum of 100 units/minute, but believes the servings will be more variable at that production rate. Measurement of a random sample of 25 serving packages produced at the higher rate yielded a standard deviation of 0.15 g. Statistically confirm or refute the production manager’s suspicions. 25. H0 : σ 2 = 0.0144 g2 versus Ha : σ 2 > 0.0144 g2 . At α = 0.05, the c.v. = 36.4. χ2 = 37.5, so reject H0 . There is statistical evidence that running the machine at its maximum capacity will cause the serving packages to be more variable.