Hypothesis Tests – 1 sample Means 17. TV safety. The manufacturer

Transcription

Hypothesis Tests – 1 sample Means 17. TV safety. The manufacturer
Hypothesis Tests – 1 sample Means
17. TV safety. The manufacturer of a metal stand for home
TV sets must be sure that its product will not fail under
the weight of the TV. Since some larger sets weigh
nearly 300 pounds, the company's safety inspectors
have set a standard of ensuring that the stands can
support an average of at least 500 pounds. Their
inspectors regularly subject a random sample of the
stands to increasing weight until they fail. They test the
hypothesis H0:  = 500 against HA:  < 500, using the
level of significance  = .01. If the sample of stands
fail to pass this safety test, the inspectors will not
certify the product for sale to the general public.
a) Is this an upper-tail or lower-tail test? In the
context of the problem, why do you think this is
important?
b) Explain what will happen if the inspectors commit
a Type I error.
c) Explain what will happen if the inspectors commit
a Type II error.
18. Catheters. During an angiogram, heart problems can
be examined via a small tube (a catheter) threaded into
the heart from a vein in the patient's leg. It's important
that the company who manufactures the catheter
maintain a diameter of 2.00 mm. (The standard
deviation is quite small.) Each day, quality control
personnel make several measurements to test H0:  =
2.00 against HA:   2.00 at a significance level of  =
.05. If they discover a problem, they will stop the
manufacturing process until it is corrected.
a) Is this a one-sided or two-sided test? In the context
of the problem, why do you think this is important?
b) Explain in this context what happens if the quality
control people commit a Type I error.
c) Explain in this context what happens if the quality
control people commit a Type II error.
19. TV safety revisited. The manufacturer of the metal TV
stands in Exercise 17 is thinking of revising its safety
test.
a) If the company's lawyers are worried about being
sued for selling an unsafe product, should they
increase or decrease the value of ? Explain.
b) In this context, what is meant by the power of the
test?
c) If the company wants to increase the power of the
test, what options does it have? Explain the
advantages and disadvantages of each option.
20. Catheters again. The catheter company in Exercise 18
is reviewing its testing procedure.
a) Suppose the significance level is changed to  =
.01. Will the probability of Type II error increase,
decrease, or remain the same?
b) What is meant by the power of the test the
company conducts?
c) Suppose the manufacturing process is slipping out
of proper adjustment. As the actual mean diameter
of the catheters produced gets farther and farther
above the desired 2.00 mm, will the power of the
quality control test increase, decrease, or remain
the same?
d) What could they do to improve the power of the
test?
21. Marriage. In 1960, census results indicated that the
age at which American men first married had a mean
of 23.3 years. It is widely suspected that young people
today are waiting longer to get married. We want to
find out if the mean age of first marriage has increased
during the past 40 years.
a) Write appropriate hypotheses.
b) We plan to test our hypothesis by selecting a
random sample of 40 men who married for the first
time last year. Do you think the necessary
assumptions for inference are satisfied? Explain.
c) Describe the approximate sampling distribution
model for the mean age in such samples.
d) The men in our sample married at an average age
of 24.2 years with a standard deviation of 5.3
years. What's the P-value for this result?
e) Explain (in context) what this P-value means.
f) What's your conclusion?
22. Fuel economy. A company with a large fleet of cars
hopes to keep gasoline costs down, and sets a goal of
attaining a fleet average of at least 26 miles per gallon.
To see if the goal is being met, they check the gasoline
usage for 50 company trips chosen at random, finding
a mean of 25.02 mpg and a standard deviation of 4.83
mpg. Is this strong evidence that they have failed to
attain their fuel economy goal?
a) Write appropriate hypotheses.
b) Are the necessary assumptions to perform
inference satisfied?
c) Describe the sampling distribution model of mean
fuel economy for samples like this.
d) Find the P-value.
e) Explain what the P-value means in this context.
f) State an appropriate conclusion.
25. Cars. One of the important factors in auto safety is the
weight of the vehicle. Insurance companies are
interested in knowing the average weight of cars
currently licensed in the United States; they believe it
is 3000 pounds. To see if that estimate is correct, they
checked a random sample of 91 cars. For that group the
mean weight was 2919 pounds, with a standard
deviation of 531.5 pounds. Is this strong evidence that
the mean weight of all cars is not 3000 pounds?
26. Portable phones. A manufacturer claims that a new
design for a portable phone has increased the range to
150 feet, allowing many customers to use the phone
throughout their homes and yards. An independent
testing laboratory found that a random sample of 44 of
these phones worked over an average distance of 142
feet, with a standard deviation of 12 feet. Is there
evidence that the manufacturer's claim is false?
27. Chips ahoy. In 1998, as an advertising campaign, the
Nabisco Company announced a "1000 Chips
Challenge," claiming that every 18-ounce bag of their
Chips Ahoy cookies contained 1000 chocolate chips.
Dedicated Statistics students at the Air Force Academy
(no kidding) purchased some randomly selected bags
of cookies, and counted the chocolate chips. Some of
their data are given below. (Chance, 12, no. 1 [1999])
1219
1244
1214
1258
1087
1356
1200
1132
1419
1191
1121
1270
1325
1295
1345
1135
a) Check the assumptions and conditions for
inference. Comment on any concerns you have.
b) Create a 95% confidence interval for the average
number of chips in bags of Chips Ahoy cookies.
c) What does this evidence say about Nabisco's
claim? Use your confidence interval to test an
appropriate hypothesis and state your conclusion.
28. Yogurt. Consumer Reports tested 14 brands of vanilla
yogurt and found the following numbers of calories per
serving:
160 130 200 170 220 190 230
80 120 120 180 100 140 170
a) Check the assumptions and conditions for
inference.
b) Create a 95% confidence interval for the average
calorie content of vanilla yogurt.
c) A diet guide claims that you will get 120 calories
from a serving of vanilla yogurt. What does this
evidence indicate? Use your confidence interval to
test an appropriate hypothesis and state your
conclusion.
29. Maze. Psychology experiments sometimes involve
testing the ability of rats to navigate mazes. The mazes
are classified according to difficulty, as measured by
the mean length of time it takes rats to find the food at
the end. One researcher needs a maze that will take rats
an aver age of about one minute to solve. He tests one
maze on several rats, collecting the data at the right.
a) Plot the data. Do you think the conditions for
inference are satisfied? Explain.
b) Test the hypothesis that the mean completion time
for this maze is 60 seconds. What is your
conclusion?
c) Eliminate the outlier, and test the hypothesis again.
What is your conclusion?
d) Do you think this maze meets the "one-minute
average" requirement? Explain.
Time
(sec)
38.4
57.6
46.2
55.5
62.5
49.5
38.0
40.9
62.8
44.3
33.9
93.8
50.4
47.9
35.0
69.2
52.8
46.2
60.1
56.3
55.1
30. Braking. A tire manufacturer is considering a newly
designed tread pattern for its all-weather tires. Tests
have indicated that these tires will provide better gas
mileage and longer treadlife. The last remaining test is
for braking effectiveness. The company hopes the tire
will allow a car traveling at 60 mph to come to a
complete stop within an average of 125 feet after the
brakes are applied. They will adopt the new tread
pattern unless there is strong evidence that the tires do
not meet this objective. The distances (in feet) for 10
stops on a test track were
129,128,130,132,135,123,102,125,128, and 130.
Should the company adopt the new tread pattern? Test
an appropriate hypothesis and state your conclusion.
Explain how you dealt with the outlier, and why you
made the recommendation you did.
Hypothesis Tests – 1 sample Means
Answers:
17. a) Lower-tail. We want to show it will not hold 500
pounds (or more) easily.
b) Reject that the stand can hold at least 500 lbs, when
it can. In other words, They will decide the stands are
unsafe, when they are in fact safe
c) Fail to reject that the stand can only hold at least
500 lbs when it can not. In other words, they will
decide the stands are safe, when they're not.
18. a) Two-sided. If they're too big, they won't fit through
the vein. If they're too small, they probably won't
work well.
b) The catheters are rejected when in fact the
diameters are fine, and the manufacturing process is
needlessly stopped.
c) Catheters that do not meet specifications are
allowed to be produced and sold.
19. a) Increase . This means a larger chance of declaring
the stands safe, if they are not.
b) The probability of correctly detecting that the
stands are not capable of holding more than 500
pounds.
c) Decrease the standard deviation—probably costly.
Increase the sample size—takes more time for testing
and is costly. Increase —more Type I errors.
Increase the "design load" to be well above 500
pounds—again, costly.
20. a) Increase
b) The probability of correctly detecting deviations
from 2 mm in diameter.
c) Increase
d) Increase the sample size or increase a.
21. a) H0:  = 23.3; HA:  > 23.3
b) We have a random sample that comprises less than
10% of the population. Population may not be
normally distributed, as it would be easier to have a
few much older men at their first marriage than some
very young men. However, with a sample size of 40, y
should be approximately Normal. We should check
the histogram for severity of skewness and possible
outliers.
c) t39(23.3, s/ 40 )
d) 0.1414
e) If the average age at first marriage is still 23.3
years, there is a 14.5% chance of getting a sample
mean of 24.2 years or older simply from natural
sampling variation.
f) We have not shown that the average age at first
marriage has increased from the mean of 23.3 years.
22. a) H0:  = 26; HA:  < 26
b) We have a representative sample, less than 10% of
all trips, and a large enough sample that skewness
should not be a problem.
c) t49(25.02, 4.83/ 50 )
d) 0.0789
e) If the average fuel economy is 26 mpg, the chance
of obtaining a sample mean of 25.02 or less by natural
sampling variation is 8%.
f) Since 0.05 < P < 0.10, there is some evidence that
the company may not be achieving the fuel economy
goal.
25. No. Based on this large random sample, t = -1.45; Pvalue = 0.1494. Because the P-value is high, we fail to
reject H0. These data do not provide evidence to
support the claim that the average weight of all cars is
not 3000 pounds. We retain the null hypothesis that
the mean is 3000 pounds.
26. Yes. Based on this large random sample, t = -4.42, Pvalue = 3.29 X 10-5. Because the P-value is low, we
reject H0. These data show that the range of these
phones is not 150 feet; in fact, it is less.
27. a) Random sample; less than 10% of all Chips Ahoy
bags; the nearly Normal condition seems reasonable
from a Normal probability plot. The histogram is
roughly unimodal and symmetric with no outliers.
b) (1187.9, 1288.4) chips
c) Based on this sample, the mean number of chips in
an 18-ounce bag is between 1187.9 and 1288.4, with
95% confidence. The mean number of chips is clearly
greater than 1000. However, if the claim is about
individual bags, then it's not necessarily true. If the
mean is 1188 and the SD deviation is near 94, then
2.5% of the bags will have fewer than 1000 chips
using the Normal model. If in fact the mean is 1288,
the proportion below 1000 will be less than 0.3%, but
the claim is still false.
28. a) Random sample; this is less than 10% of all vanilla
yogurt brands. Nearly Normal condition is reasonable
by examining a Normal probability plot. The
histogram is roughly unimodal (although somewhat
uniform) and symmetric with no outliers.
b) (132.0, 183.7) calories
c) Based on this sample, the mean number of calories
in a serving of vanilla yogurt is between 132 and
183.7, with 95% confidence. We conclude that the
diet guide's claim of 120 calories is too low.
29. a) The Normal probability plot is relatively straight,
with one outlier at 93.8 sec. Without the outlier, the
conditions seem to be met. The histogram is roughly
unimodal and symmetric with no other outliers.
b) t = -2.63, P-value = 0.0160. With the outlier
included, we might conclude that the mean
completion time for the maze is not 60 seconds; in
fact, it is less.
c) t = -4.46, P-value = 0.0003. Because the P-value is
so small, we reject H0. Without the outlier, we
conclude that the average completion time for the
maze is not 60 seconds; in fact, it is less. The outlier
here did not change the conclusion.
d) The maze does not meet the "one-minute average"
requirement. Both tests rejected a null hypothesis of a
mean of 60 seconds.
30. The data value of 102 feet is an outlier. When this is
removed, the Normal probability plot is relatively
straight. The Nearly Normal Condition seems
satisfied. With the outlier removed, the histogram is
roughly unimodal and symmetric with no other
outliers. H0:  = 125; HA:  > 125 feet. With the
outlier eliminated, y = 128.89, t = 3.29, P-value =
0.01. With a P-value this low, we reject H0. There is
strong evidence to suggest that the tires will not bring
the car to a complete stop within 125 feet. On the
basis of these data, the company should not adopt the
new tread pattern. Only 2 out of the 10 data values
were less than the desired 125 feet, and 1 of these was
an outlier.