MGMT2263

Worksheet #6

 

You are given the following data of shipping distance and the time it takes for a shipment to travel that distance by courier.

Distance in km

825

215

1070

550

480

920

1350

325

670

1215

Time in days

3.5

1

4

2

1

3

4.5

1.5

3

5

 

1)     Construct a model in which the shipping time depends on the distance. State what the model is. (time = 0.1181 + 0.0036*distance)

2)     What percentage of the variation in shipping time is explained by the model? (90.05%)

3)     Test the hypothesis that the model is significant at the 5% level of significance. (F = 72.3959; conclude model is significant)

4)     Construct a 95% confidence interval of the slope coefficient. If this interval were used to test the hypothesis in question 3, why would the same conclusion be reached? (0.0026 < B1 < 0.0046; hypothesized coefficient of zero is not in the interval)

5)     In theory, the residuals are normally distributed with a mean of zero and a common variance. What is the estimate of that variance? (0.2304)

6)     Test the hypothesis that there is a significant direct relationship between the variables at a 5% level of significance. (t =  8.51; conclude there is a significant direct relationship)

7)     If the distance is 500 km, how many days should you expect the shipping time to be? Round to 2 decimals. (1.91 days)

8)     If the distance is 500 km, what is the range of the average shipping time for 95% of the time? (1.48 to 2.34 days)

9)     For a particular shipment that travels 500 km, what is the range of the shipping time for 95% of the time? (0.72 to 3.1 days)

 

Suppose you are given the following information:

X = size of home (in thousands of square feet)

Y = home price (in thousands of dollars)

size

1.82

1.59

1.57

1.81

2.01

1.57

1.87

1.82

1.59

1.95

price

173.1

160

164.6

183.5

194.8

166

178.7

181.5

160.5

196.5

 

10)  Construct a model in which the home price depends on the home size (price = 43.4702 + 75.2556*size)

11)  What percentage of the variation in price is explained by the model? (88.52%)

12)  Test the hypothesis that the model is significant at the 5% level of significance. (F = 61.6842; conclude model is significant)

13)  Construct a 95% confidence interval of the slope coefficient. If this interval were used to test the hypothesis in question 12, why would the same conclusion be reached? (53.1597 < B1 < 97.3515; hypothesized coefficient of zero is not in the interval)

14)  If a home has 1500 square feet, what would you expect the price to be? Round to the nearest hundred. ($156,400)

15)  If the square footage is 1500 square feet, what is the range of the average home price for 95% of the time? Round to the nearest hundred. ($149,600 to $163,100)

16)  If a particular home with 1500 square feet is put up for sale, what is the range of the selling price of this home for 95% of the time? Round to the nearest hundred. ($143,400 to $169,300)

 

An educational researcher wanted to see what factors influenced school grades and examined the average number of hours students studied for a test, average number of hours of sleep per night, gender and household income level. For the purposes of doing regression, male=0 and female=1 and low income = 0 and medium/high income = 1. Twenty subjects were randomly chosen. A model using all variables was constructed. These are the results:

SUMMARY OUTPUT

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

Regression Statistics

 

 

 

 

 

 

Multiple R

0.8376

 

 

 

 

 

 

R Square

0.7016

 

 

 

 

 

 

Adjusted R Square

0.6220

 

 

 

 

 

 

Standard Error

9.8975

 

 

 

 

 

 

Observations

20

 

 

 

 

 

 

 

 

 

 

 

 

 

 

ANOVA

 

 

 

 

 

 

 

 

df

SS

MS

F

Significance F

 

 

Regression

4

3454.6016

863.6504

8.8164

0.0007

 

 

Residual

15

1469.3984

97.9599

 

 

 

 

Total

19

4924.0000

 

 

 

 

 

 

 

 

 

 

 

 

 

 

Coefficients

Standard Error

t Stat

P-value

Lower 95%

Upper 95%

VIF

Intercept

24.3923

11.9934

2.0338

0.0601

-1.1710

49.9556

 

study

3.8911

0.7286

5.3403

0.0001

2.3380

5.4441

1.0850

sleep

2.9052

1.2416

2.3399

0.0335

0.2589

5.5515

1.0700

gender

-3.8384

4.5543

-0.8428

0.4126

-13.5456

5.8688

1.0481

income

-2.6087

4.8213

-0.5411

0.5964

-12.8851

7.6676

1.0797

17)  Construct a model with grade as the dependent variable.

(grade = 24.3923 + 3.8911*study + 2.9052*sleep – 3.8384*gender – 2.6087*income)

18)  What percentage of the variation in grade is explained by the model? (70.16%)

19)  Is the model significant? Test at a 5% level of significance. (F = 8.8164; yes, the model is significant)

20)  What individual variables are significant at a 5% level of significance? (study and sleep)

21)  Are there any collinearity problems among the independent variables? (no)

 

A second model was constructed using only study and sleep as the independent variables. Here are the results:

SUMMARY OUTPUT

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

Regression Statistics

 

 

 

 

 

 

Multiple R

0.8247

 

 

 

 

 

 

R Square

0.6802

 

 

 

 

 

 

Adjusted R Square

0.6426

 

 

 

 

 

 

Standard Error

9.6247

 

 

 

 

 

 

Observations

20

 

 

 

 

 

 

 

 

 

 

 

 

 

 

ANOVA

 

 

 

 

 

 

 

 

df

SS

MS

F

Significance F

 

 

Regression

2

3349.2104

1674.6052

18.0775

6.18997E-05

 

 

Residual

17

1574.7896

92.6347

 

 

 

 

Total

19

4924

 

 

 

 

 

 

 

 

 

 

 

 

 

 

Coefficients

Standard Error

t Stat

P-value

Lower 95%

Upper 95%

 

Intercept

20.0198

10.9148

1.8342

0.0842

-3.0083

43.0480

 

study

3.9576

0.6876

5.7560

0.0000

2.5070

5.4083

 

sleep

3.0188

1.1798

2.5589

0.0203

0.5298

5.5079

 

 

 

 

 

 

 

 

 

X Variable 1

X Variable 2

Predicted Value

Std Error Prediction

Lower 95% Mean

Upper 95% Mean

Lower 95% Predict

Upper 95% Predict

10

7

80.7280

2.4491

75.5609

85.8951

59.7745

101.6814

 

22)  State the new model. (grade = 20.0198 + 3.9576*study + 3.0188*sleep)

23)  If someone studies for 10 hours on average and sleeps for 7 hours, what would you expect the person’s grade to be, rounding to the nearest whole number? (81)

24)  Based on 10 hours of study and 7 hours of sleep, in what grade range would you expect the average grade to fall 95% of the time, rounding to the nearest whole number? (76 to 86)

25)  For an individual who studies for 10 hours and sleeps for 7 hours, in what grade range would you expect the person’s grade to fall 95% of the time? (60 to 100 since the mark cannot be above 100)

26)  Using the criteria of adjusted r2, ANOVA p-value and t-test p-values (using a 5% level of significance), which of the two models is best? (Model #2)