MGMT2263

Worksheet #6 Solutions

 

We begin with the KPK output:

SUMMARY OUTPUT

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

Regression Statistics

 

 

 

 

 

 

Multiple R

0.8376

 

 

 

 

 

 

R Square

0.7016

 

 

 

 

 

 

Adjusted R Square

0.6220

 

 

 

 

 

 

Standard Error

9.8975

 

 

 

 

 

 

Observations

20

 

 

 

 

 

 

 

 

 

 

 

 

 

 

ANOVA

 

 

 

 

 

 

 

 

df

SS

MS

F

Significance F

 

 

Regression

4

3454.6016

863.6504

8.8164

0.0007

 

 

Residual

15

1469.3984

97.9599

 

 

 

 

Total

19

4924.0000

 

 

 

 

 

 

 

 

 

 

 

 

 

 

Coefficients

Standard Error

t Stat

P-value

Lower 95%

Upper 95%

VIF

Intercept

24.3923

11.9934

2.0338

0.0601

-1.1710

49.9556

 

study

3.8911

0.7286

5.3403

0.0001

2.3380

5.4441

1.0850

sleep

2.9052

1.2416

2.3399

0.0335

0.2589

5.5515

1.0700

gender

-3.8384

4.5543

-0.8428

0.4126

-13.5456

5.8688

1.0481

income

-2.6087

4.8213

-0.5411

0.5964

-12.8851

7.6676

1.0797

 

1)     From the output, grade = 24.3923 + 3.8911(study) + 2.9052(sleep) – 3.8384(gender) – 2.6087(income)

2)     The percentage of variation in grade explained by the model is 70.16%.

3)     Ho: B1 = B2 = B3 = B4 = 0

Ha: not all coefficients equal 0

Reject Ho if test statistic > 3.06

Test statistic = 8.8164.

Reject Ho and conclude the model is significant.

4)     From the t test section, we see the p-value for study is 0.01%, that of sleep is 3.35%, that of gender is 41.26% and that of income is 59.64%. Since study and sleep are the only variables with p-values less than 5%, these are the only variables that are significant.

5)     The highest VIF is 1.085. Since all the VIF are less than 10, there are no collinearity problems.

 

Here is the KPK output for the second model:

SUMMARY OUTPUT

 

 

 

 

 

 

 

 

 

 

 

 

 

Regression Statistics

 

 

 

 

 

Multiple R

0.8247

 

 

 

 

 

R Square

0.6802

 

 

 

 

 

Adjusted R Square

0.6426

 

 

 

 

 

Standard Error

9.6247

 

 

 

 

 

Observations

20

 

 

 

 

 

 

 

 

 

 

 

 

ANOVA

 

 

 

 

 

 

 

df

SS

MS

F

Significance F

 

Regression

2

3349.2104

1674.6052

18.0775

6.18997E-05

 

Residual

17

1574.7896

92.6347

 

 

 

Total

19

4924

 

 

 

 

 

 

 

 

 

 

 

 

Coefficients

Standard Error

t Stat

P-value

Lower 95%

Upper 95%

Intercept

20.0198

10.9148

1.8342

0.0842

-3.0083

43.0480

study

3.9576

0.6876

5.7560

0.0000

2.5070

5.4083

sleep

3.0188

1.1798

2.5589

0.0203

0.5298

5.5079

6)     From the output, grade = 20.0198 + 3.9576(study) + 3.0188(sleep)

7)     The KPK output gives 80.728 which rounds to 81.

8)     The 95% confidence interval given by KPK ranges from 76 to 86 after rounding.

9)     The 95% prediction interval given by KPK ranges from 60 to 102. Since the upper limit cannot be more than 100, the appropriate upper limit is 100.

10)  Here are the results:

 

Adj. r2

ANOVA p-value

t tests (all variables significant?)

#1

62.2%

0.0007

no

#2

64.26%

6.19 x 10-5

yes

Best

#2

#2

#2

Model #2 is best since it has the highest adjusted r2, the lowest ANOVA p-value and all its variables are significant.

11)  Ho: age does not depend on gender

Ha: age does depend on gender

Degrees of freedom = 1 x 2 = 2; Reject Ho if test statistic > 5.9915

Here is the crosstab of expected values:

22.09

9.82

13.09

31.91

14.18

18.91

Test statistic = 0.165 + 0.068 + 0.091 + 0.114 + 0.047 + 0.063 = 0.548.

Do not reject Ho.        

Conclude that the client’s age does not depend on gender.

12)  Cramer’s V = sqrt(0.548/110) = 0.0706 = 7.06%

13)  When we work out the expected values for the 20+ category, this is what we get:

working

semi-retired

retired

1.33

3.62

5.05

As we see, 2 of the 3 expected values are less than 5. Thus, we need to collapse this category with the 10 to 19 category.

14)  Here is the new crosstab of observed values:

 

< 10

10 or more

Total

Working

10

4

14

Semi-retired

13

25

38

Retired

15

38

53

Total

38

67

105

Ho: hours worked does not depend on work status

Ha: hours worked does depend on work status

Degrees of freedom = 2 x 1 = 2; Reject Ho if test statistic > 5.9915.

Here is the crosstab of expected values:

5.067

8.933

13.752

24.248

19.181

33.819

Test statistic = 4.803 + 2.724 + 0.041 + 0.023 + 0.911 + 0.517 = 9.019

Reject Ho.

Conclude hours worked depends on work status

15)  Cramer’s V = sqrt(9.019/105) = 0.2931 = 29.31%

16)  Ho: the data is normally distributed

Ha: it is not

Reject Ho if test statistic > 0.285

The mean of the data is 278.125 and the standard deviation is 436.8551. This spreadsheet summarizes the results:

Value

Z score

F(z)

S(z)

S'(z)

Max diff

50

-0.52

0.3015

0.1250

0

0.3015

75

-0.46

0.3228

0.2500

0.125

0.1978

100

-0.41

0.3409

0.3750

0.25

0.0909

120

-0.36

0.3594

0.5000

0.375

0.1406

140

-0.32

0.3745

0.6250

0.5

0.2505

150

-0.29

0.3859

0.7500

0.625

0.3641

240

-0.09

0.4641

0.8750

0.75

0.4109

1350

2.45

0.9929

1.0000

0.875

0.1179