Transcript Chapter 10
Chapter 10
Chi-Square Tests and the FDistribution
Larson/Farber 4th ed
1
Chapter Outline
•
•
•
•
10.1 Goodness of Fit
10.2 Independence
10.3 Comparing Two Variances
10.4 Analysis of Variance
Larson/Farber 4th ed
2
Section 10.1
Goodness of Fit
Larson/Farber 4th ed
3
Section 10.1 Objectives
• Use the chi-square distribution to test whether a
frequency distribution fits a claimed distribution
Larson/Farber 4th ed
4
Multinomial Experiments
Multinomial experiment
• A probability experiment consisting of a fixed
number of trials in which there are more than two
possible outcomes for each independent trial.
• A binomial experiment had only two possible
outcomes.
• The probability for each outcome is fixed and each
outcome is classified into categories.
Larson/Farber 4th ed
5
Multinomial Experiments
Example:
• A radio station claims that the distribution of music
preferences for listeners in the broadcast region is as
shown below.
Distribution of music Preferences
Classical
4% Oldies
2%
Country
36% Pop
18%
Gospel
11% Rock
29%
Each outcome is
classified into
categories.
Larson/Farber 4th ed
The probability for
each possible
outcome is fixed.
6
Chi-Square Goodness-of-Fit Test
Chi-Square Goodness-of-Fit Test
• Used to test whether a frequency distribution fits an
expected distribution.
• The null hypothesis states that the frequency
distribution fits the specified distribution.
• The alternative hypothesis states that the frequency
distribution does not fit the specified distribution.
Larson/Farber 4th ed
7
Chi-Square Goodness-of-Fit Test
Example:
• To test the radio station’s claim, the executive can
perform a chi-square goodness-of-fit test using the
following hypotheses.
H0: The distribution of music preferences in the
broadcast region is 4% classical, 36% country,
11% gospel, 2% oldies, 18% pop, and 29% rock.
(claim)
Ha: The distribution of music preferences differs from
the claimed or expected distribution.
Larson/Farber 4th ed
8
Chi-Square Goodness-of-Fit Test
• To calculate the test statistic for the chi-square
goodness-of-fit test, the observed frequencies and the
expected frequencies are used.
• The observed frequency O of a category is the
frequency for the category observed in the sample
data.
Larson/Farber 4th ed
9
Chi-Square Goodness-of-Fit Test
• The expected frequency E of a category is the
calculated frequency for the category.
Expected frequencies are obtained assuming the
specified (or hypothesized) distribution. The
expected frequency for the ith category is
Ei = npi
where n is the number of trials (the sample size)
and pi is the assumed probability of the ith
category.
Larson/Farber 4th ed
10
Example: Finding Observed and
Expected Frequencies
A marketing executive randomly
selects 500 radio music listeners
from the broadcast region and asks
each whether he or she prefers
classical, country, gospel, oldies,
pop, or rock music. The results are
shown at the right. Find the
observed frequencies and the
expected frequencies for each type
of music.
Larson/Farber 4th ed
Survey results
(n = 500)
Classical
8
Country
210
Gospel
72
Oldies
10
Pop
75
Rock
125
11
Solution: Finding Observed and
Expected Frequencies
Observed frequency: The number of radio music
listeners naming a particular type of music
Survey results
(n = 500)
Classical
8
Country
210
Gospel
72
Oldies
10
Pop
75
Rock
125
Larson/Farber 4th ed
observed frequency
12
Solution: Finding Observed and
Expected Frequencies
Expected Frequency: Ei = npi
Type of
music
Classical
Country
Gospel
Oldies
Pop
Rock
Larson/Farber 4th ed
% of
listeners
4%
36%
11%
2%
18%
29%
Observed
frequency
8
210
72
10
75
125
n = 500
Expected
frequency
500(0.04) = 20
500(0.36) = 180
500(0.11) = 55
500(0.02) = 10
500(0.18) = 90
500(0.29) = 145
13
Chi-Square Goodness-of-Fit Test
For the chi-square goodness-of-fit test to be used, the
following must be true.
1. The observed frequencies must be obtained by using
a random sample.
2. Each expected frequency must be greater than or
equal to 5.
Larson/Farber 4th ed
14
Chi-Square Goodness-of-Fit Test
• If these conditions are satisfied, then the sampling
distribution for the goodness-of-fit test is approximated
by a chi-square distribution with k – 1 degrees of
freedom, where k is the number of categories.
• The test statistic for the chi-square goodness-of-fit test is
2
(
O
E
)
2
E
The test is always
a right-tailed test.
where O represents the observed frequency of each
category and E represents the expected frequency of each
category.
Larson/Farber 4th ed
15
Chi-Square Goodness-of-Fit Test
In Words
1. Identify the claim. State the
null and alternative
hypotheses.
In Symbols
State H0 and Ha.
2. Specify the level of
significance.
Identify .
3. Identify the degrees of
freedom.
d.f. = k – 1
4. Determine the critical
value.
Use Table 6 in
Appendix B.
Larson/Farber 4th ed
16
Chi-Square Goodness-of-Fit Test
In Words
In Symbols
5. Determine the rejection region.
6. Calculate the test statistic.
7. Make a decision to reject or fail
to reject the null hypothesis.
(O E)2
E
2
If χ2 is in the
rejection region,
reject H0.
Otherwise, fail to
reject H0.
8. Interpret the decision in the
context of the original claim.
Larson/Farber 4th ed
17
Example: Performing a Goodness of Fit
Test
Use the music preference data to perform a chi-square
goodness-of-fit test to test whether the distributions are
different. Use α = 0.01.
Distribution of
music preferences
Classical
4%
Country
36%
Gospel
11%
Oldies
2%
Pop
18%
Rock
29%
Larson/Farber 4th ed
Survey results
(n = 500)
Classical
8
Country
210
Gospel
72
Oldies
10
Pop
75
Rock
125
18
Solution: Performing a Goodness of Fit
Test
• H0: music preference is 4% classical, 36% country,
11% gospel, 2% oldies, 18% pop, and 29% rock
• Ha: music preference differs from the claimed or
expected distribution
• Test Statistic:
• α = 0.01
• d.f. = 6 – 1 = 5
• Rejection Region
• Decision:
0.01
0
Larson/Farber 4th ed
15.086
• Conclusion:
χ2
19
Solution: Performing a Goodness of Fit
Test
Type of
music
Classical
Country
Gospel
Oldies
Pop
Rock
Observed
frequency
8
210
72
10
75
125
Expected
frequency
20
180
55
10
90
145
2
(
O
E
)
2
E
(8 20)2 (210 180)2 (72 55) 2 (10 10) 2 (75 90) 2 (125 145) 2
20
180
55
10
90
145
22.713
Larson/Farber 4th ed
20
Solution: Performing a Goodness of Fit
Test
• H0: music preference is 4% classical, 36% country,
11% gospel, 2% oldies, 18% pop, and 29% rock
• Ha: music preference differs from the claimed or
expected distribution
• Test Statistic:
• α = 0.01
χ2 = 22.713
• d.f. = 6 – 1 = 5
• Rejection Region
• Decision: Reject H0
0.01
0
Larson/Farber 4th ed
χ2
15.086
22.713
There is enough evidence to
conclude that the distribution
of music preferences differs
from the claimed distribution.
21
Example: Performing a Goodness of Fit
Test
The manufacturer of M&M’s candies claims that the
number of different-colored candies in bags of dark
chocolate M&M’s is uniformly distributed. To test this
claim, you randomly select a bag that contains 500 dark
chocolate M&M’s. The results are shown in the table on
the next slide. Using α = 0.10, perform a chi-square
goodness-of-fit test to test the claimed or expected
distribution. What can you conclude? (Adapted from
Mars Incorporated)
Larson/Farber 4th ed
22
Example: Performing a Goodness of Fit
Test
Color
Brown
Yellow
Red
Blue
Orange
Green
Frequency
80
95
88
83
76
78
n = 500
Larson/Farber 4th ed
Solution:
• The claim is that the
distribution is uniform, so
the expected frequencies of
the colors are equal.
• To find each expected
frequency, divide the sample
size by the number of colors.
• E = 500/6 ≈ 83.3
23
Solution: Performing a Goodness of Fit
Test
• H0: Distribution of different-colored candies in bags
of dark chocolate M&Ms is uniform
• Ha: Distribution of different-colored candies in bags
of dark chocolate M&Ms is not uniform
• Test Statistic:
• α = 0.10
• d.f. = 6 – 1 = 5
• Rejection Region
• Decision:
0.10
0
Larson/Farber 4th ed
9.236
• Conclusion:
χ2
24
Solution: Performing a Goodness of Fit
Test
2
(
O
E
)
2
E
Color
Brown
Yellow
Red
Blue
Orange
Green
Observed
frequency
80
95
88
83
76
78
Expected
frequency
83.3
83.3
83.3
83.3
83.3
83.3
(80 83.3)2 (95 83.3)2 (88 83.3)2 (83 83.3)2 (76 83.3) 2 (78 83.3)2
83.3
83.3
83.3
83.3
83.3
83.3
3.016
Larson/Farber 4th ed
25
Solution: Performing a Goodness of Fit
Test
• H0: Distribution of different-colored candies in bags
of dark chocolate M&Ms is uniform
• Ha: Distribution of different-colored candies in bags
of dark chocolate M&Ms is not uniform
• Test Statistic:
• α = 0.01
χ2 = 3.016
• d.f. = 6 – 1 = 5
• Rejection Region
• Decision: Fail to Reject H0
0.10
0
3.016
Larson/Farber 4th ed
9.236
χ2
There is not enough evidence
to dispute the claim that the
distribution is uniform.
26
Section 10.1 Summary
• Used the chi-square distribution to test whether a
frequency distribution fits a claimed distribution
Larson/Farber 4th ed
27