STAT177 Homework sheet #9 Solutions

 

1)     Here is the appropriate output:

Regression model:

 

Vacation = 0.1878 + 0.0434Income

2)     Here is the appropriate output:

Percentage of variation in Vacation explained by the model: 93.55%

3)     Here is the appropriate output:

Ho: the model is not significant

Ha: the model is significant

Reject Ho if test statistic > 5.987

Test statistic = 86.999

 

P-value = 0

 

 

Reject Ho

 

 

Conclude the model is significant

4)     Note that we input 75 and not 75000 in the input box since the variable is coded in thousands. Here is the appropriate output:

Predicting Vacation

 

Income = 75

Prediction:

3.4428

We then multiply 3.4428 by 1000 to get 3442.80 which rounds to 3400.

5)     Here is the appropriate output:

95% confidence interval for Vacation

Lower limit:

3.1788

 

 

Upper limit:

3.7067

 

 

As with question 4, we multiply the limits by 1000 to get 3178.8 and 3706.7 which round to 3200 and 3700 respectively.

6)     Here is the appropriate output:

95% prediction interval for Vacation

Lower limit:

2.9296

 

Upper limit:

3.956

 

Once again, we multiply the limits by 1000 to get 2929.6 and 3956 which round to 2900 and 4000 respectively.

7)     Here is the output of the model:

Regression model:

 

 

 

 

 

grade = 24.3923 + 3.8911study + 2.9052sleep - 3.8384gender - 2.6087income

Percentage of variation in grade explained by the model: 70.16%

Adjusted for the number of variables: 62.2%

 

 

 

Ho: the model is not significant

 

 

 

 

Ha: the model is significant

 

 

 

 

Reject Ho if test statistic > 3.056

 

 

 

 

Test statistic = 8.816

 

 

 

 

 

P-value = 0.001

 

 

 

 

 

Reject Ho

 

 

 

 

 

 

Conclude the model is significant

 

 

 

 

 

 

 

 

 

 

 

 

 

95% Confidence Interval

 

 

 

Coefficient

Lower limit

Upper limit

P-value

VIF

Comment

Intercept

24.3923

-1.171

49.9556

0.0601

 

 

study

3.8911

2.338

5.4441

0.0001

1.085

Keep this variable

sleep

2.9052

0.2589

5.5515

0.0335

1.07

Keep this variable

gender

-3.8384

-13.5456

5.8688

0.4126

1.0481

Drop this variable

income

-2.6087

-12.8851

7.6676

0.5964

1.0797

Drop this variable

From the output we see the model is grade = 24.3923 + 3.8911*study + 2.9052*sleep – 3.8384*gender – 2.6087*income

8)     The percentage of variation in grade explained by the model is 70.16%.

9)     Ho: B1 = B2 = B3 = B4 = 0

Ha: not all coefficients equal 0

Reject Ho if test statistic > 3.056

Test statistic = 8.816. As well, p-value = 0.1%

Reject Ho and conclude the model is significant.

10) From the t test section, we see the p-value for study is 0.01%, that of sleep is 3.35%, that of gender is 41.26% and that of income is 59.64%. Since study and sleep are the only variables with p-values less than 5%, these are the only variables that are significant.

11) The highest VIF is 1.085. Since all the VIF are less than 10, there are no collinearity problems.

12) Here is the output of the second model:

Regression model:

 

 

 

 

 

grade = 20.0198 + 3.9576study + 3.0188sleep

 

 

 

Percentage of variation in grade explained by the model: 68.02%

Adjusted for the number of variables: 64.26%

 

 

 

Ho: the model is not significant

 

 

 

 

Ha: the model is significant

 

 

 

 

Reject Ho if test statistic > 3.592

 

 

 

 

Test statistic = 18.078

 

 

 

 

 

P-value = 0

 

 

 

 

 

Reject Ho

 

 

 

 

 

 

Conclude the model is significant

 

 

 

 

 

 

 

 

 

 

 

 

 

95% Confidence Interval

 

 

 

Coefficient

Lower limit

Upper limit

P-value

VIF

Comment

Intercept

20.0198

-3.0083

43.048

0.0842

 

 

study

3.9576

2.507

5.4083

0

1.0217

Keep this variable

sleep

3.0188

0.5298

5.5079

0.0203

1.0217

Keep this variable

 

 

 

 

 

 

 

Predicting grade

 

 

 

 

 

study = 10

Prediction:

80.728

 

 

 

 

sleep = 7

95% confidence interval for grade

 

 

 

 

Lower limit:

75.5609

 

 

 

 

 

Upper limit:

85.8951

 

 

 

 

 

95% prediction interval for grade

 

 

 

 

Lower limit:

59.7745

 

 

 

 

 

Upper limit:

101.6814

 

 

 

 

From the output, we see the model is grade = 20.0198 + 3.9576*study + 3.0188*sleep

13) From the output, the average grade based on 10 hours of study and 7 hours of sleep is 80.728 which rounds to 81.

14) The output for the 95% confidence interval ranges from 75. 5609 to 85.8951 which round to 76 and 86 respectively.

15) The output for the 95% prediction interval ranges from 59.7745 to 101.6814. The lower limit rounds to 60. However, since you can’t have a grade higher than 100, the upper limit of the prediction interval is 100.

16) We construct a table:

 

Adj. r2

ANOVA p-value

t-test – are all significant?

(yes/no)

Model #1

62.2%

0.001

No

Model #2

64.26%

0

Yes

Best

#2

#2

#2

We choose model #2 since:

§       it has the highest adjusted r2

§       it has the lowest ANOVA p-value

§       all its variables are significant

§       it has no collinearity problems