The binomial distributions

"I gather, young man, that youwish to be a Member of Parliament. The first lesson you must learnis, when I call for statistics about the rate of infant mortality,what I want is proof that fewer babies died when I was Prime Ministerthan when anyone else was prime minister. That is a politicalstatistic."

Winston Churchill

13.2 INFERENCE FOR TWO-WAY TABLES (Pages 717 - 738)

OVERVIEW: Sometimes one wants tocompare proportions from two or more groups. One can use thechi-square test to do this. Basically, the null hypothesis statesthat all the population proportions are equal. The alternatehypothesis is that they are not, which means that there appears to bea significant difference between at least two of the populationproportions.

If the null hypothesis is true, the expected countfor a cell in a two-way table is

expected count = (row total)(column total)/(table total).
The x² statistic is
x² = sum[(observed count -expected count)²/(expected count)].
The degrees offreedom is (number of rows - 1)(number ofcolumns - 1).
You can use x² when all expected cell counts are at least 1, and no morethan 20% are less than 5.

Example:

The table displays the number of students passedand failed by each of three instructors who teach IntroductoryStatistics at a specific university.

	Instructor #1	Instructor #2	Instructor #3	TOTALS
Passed	50	47	56	153
Failed	5	14	8	27
TOTALS	55	61	64	180

Suppose we wish to examine the null hypothesis,H₀, that theproportions of students failed by the three instructors are the same.The expected frequencies under H₀ are displayed in the lowerright of each observed count cell. Remember that expected count =(row total)(column total)/(table total). For instance,

46.75 = (55)(153)/180.

Note also that all expected counts are more than5, and that degrees of freedom = (2-1)(3-1) = 2.

	Instructor #1	Instructor #2	Instructor #3	TOTALS
Passed	50_46.75	47_51.85	56_54.40	153
Failed	5_8.25	14_9.15	8_9.60	27
TOTALS	55	61	64	180

The calculation of x² yields

x² = (50-46.75)²/46.75 +(47-51.85)²/51.85 + (56-54.40)²/54.40 +(5-8.25)²/8.25 + (14-9.15)²/9.15 +(8-9.60)²/9.60 = 4.84

Using the x² table with 2 degrees of freedom, we cannot rejectH₀ at the 5%level of significance, since the critical region is x² > 5.99, and our calculated valueis not in this region. At the 10% level of significance, the criticalregion is x² > 4.61. We wouldreject H₀ atthis level.

Using the TI-83 to calculate a P-value, we havex²cdf(4.84,1E99,2) = .0889216175, or about 8.9%. Note that this"jives" with rejection of H₀ at the 10% level ofsignificance, and non-rejection at the 5% level.

===========================

On page 735, the authors note that the chi-squaretest and the z-test "agree" when you have a 2-by-2 table situation.In this situation, the chi-square statistic is the square of thez-statistic, and the P-value for x² is the same for the two-sided P-value for z. What followsis an attempt to illustrate this fact using data from Example 12.13(page 683). Here is the data in tabular from with expected valuescalculated and displayed.

	Gemifibozoil	Placebo	TOTALS
Heart attack	56_70.36	84_69.64	140
No heart attack	1995_1980.64	1946_1960.36	3941
TOTALS	2051	2030	4081

On page 685, the calculated z-statistics is shownto be z = -2.47.

The calculated x² is

x² = (56-70.36)²/70.36 +(84-69.64)²/69.64 + (1995-1980.64)²/1980.64 +(1946-1960.36)²/1960.36 = 6.101.

Note that (-2.47)² = 6.101, illustrating that,in this situation, x² = z².

The degrees of freedom here is (2-1)(2-1) = 1. TheP-value for x² is x²cdf(6.101,1E99,1) = .0135105406, or about 1.35%.

On page 685, the authors give a 1-sided P-value(for the z-statistic) of .0068. Note that 2(.0068) = .0136, or about1.36%, further illustrating what the authors are saying in theircomparison of x² and z on page 735.

RETURN TO TEXTBOOK HOME PAGE