Sanderson M. Smith

Home | About Sanderson Smith | Writings and Reflections | Algebra 2 | AP Statistics | Statistics/Finance | Forum

Example of Problem Involving

INFERENCE ON SLOPE OF REGRESSION LINE

(Problem #6 on 2001 AP Statistics Examination)

This is an analysis of Problem #6 in Section II on the 2001 Advanced Placement Statistics Examination.

**The College Board does not allow a reproduction of the problem
statement here. If you do not have before you a statement of the
actual problem, you can get it from the College Board site**

**http://www.collegeboard.org/ap/statistics/frq01/index.html**

**=============================================**

**Comment:** This problem involves reading generic computer
output. There is no need to input the actual data values provided
into a calculator.

**Here is a solution:**

(a) One way to compare the data in the two groups would be to construct back-to-back dot plots
As indicated by the display above, those who did complete the Ph.D. program tended to have a higher undergraduate GPA in statistics and mathematics than those who did not. Here is a "big 5" summary, illustrating that, with the exception of MIN values, the values for those who "completed" are greater than the corresponding values for those who "did not complete."
(b) Using the computer output, the regression equation for those who completed the program is
The premise states that all assumptions for inference were reasonable. We can therefore run a hypothesis test (t test) on b, the true slope of the regression line.
The computer output shows t = -5.90 with a two-sided
P-value of 0.0000. In other words, H (c) We are interested in the point (3.5, 14). For those who completed the Ph.D. program, the calculated regression line is
For those who did not complete the program, the calculated regression line is
Since (3.5,14) is much closer to the model for those who completed the program, it is reasonable to think that an applicant with GAP = 3.5 and a mean number of credit hours = 14.0 will successfully complete the program. |

**Additional notes relating to this problem:**

The required statistical value of t = -5.90 is provided in the
computer output. Note that it can be calculated by the formula t =
b/SE_{b} = -2.7555/0.4668 = -5.90.

One could respond to (b) by constructing a confidence interval for b. For instance, using the provided data, the 95% confidence interval is

-2.7555 plus/minus 2.201(0.4668) = (-3.783, -1.728)

Since this interval does not contain 0, there is evidence to
reject H_{0} at the 5% level of significance.

Previous Page | Print This Page

Copyright © 2003-2009 Sanderson Smith