STAT177 homework sheet #9
A travel agent wanted to see the relationship between a household’s annual income and how much per week they spent on vacation. From its records, it sampled 8 clients and got the following results (all in thousands of dollars):
|
Annual income |
51.3 |
69.69 |
73.58 |
71.56 |
56.79 |
33.48 |
52.39 |
41.39 |
|
Vacation amount |
2.35 |
3.01 |
3.58 |
3.28 |
2.85 |
1.81 |
2.34 |
1.82 |
1)
Construct a model in
which the amount spent on vacation depends on household annual income. State
what the model is. (vacation = 0.1878 + 0.0434*income)
2)
What percentage of the
variation in how much a household spends on vacation is explained by annual
household income? (93.55%)
3)
Is the model
significant? Test at 5%. State the p-value and conclusion. (p-value = 0;
conclude the model is significant)
4)
If a household earns
$75,000 per year, how much would you expect it to spend per week on vacation?
Round to the nearest hundred. ($3400)
5)
For households that
earns $75,000 per year, what is the range of the average amount spent per week
on vacation 95% of the time? Round the limits to the nearest hundred. ($3200 to
$3700)
6)
For a particular
household that earns $75,000 per year, what is the range of the amount this
household spends on vacation 95% of the time? Round the limits to the nearest
hundred. ($2900 to $4000)
An
educational researcher wanted to see what factors influenced school grades and
examined the average number of hours students studied for a test, average
number of hours of sleep per night, gender and household income level. For the
purposes of doing regression, male=0 and female=1 and low income = 0 and
moderate/high income = 1. Twenty subjects were randomly chosen. These were the
results:
|
grade |
study |
sleep |
gender |
income |
|
71 |
4 |
6 |
0 |
0 |
|
75 |
3 |
9 |
0 |
0 |
|
61 |
6 |
7 |
1 |
0 |
|
63 |
8 |
6 |
1 |
1 |
|
63 |
6 |
10 |
0 |
0 |
|
58 |
7 |
7 |
1 |
0 |
|
60 |
5 |
9 |
1 |
0 |
|
90 |
10 |
7 |
0 |
0 |
|
93 |
9 |
10 |
1 |
0 |
|
83 |
11 |
8 |
0 |
1 |
|
73 |
9 |
5 |
0 |
1 |
|
75 |
12 |
6 |
0 |
0 |
|
87 |
12 |
5 |
0 |
1 |
|
88 |
12 |
4 |
1 |
0 |
|
90 |
14 |
6 |
0 |
0 |
|
47 |
4 |
4 |
1 |
0 |
|
98 |
11 |
8 |
1 |
1 |
|
96 |
11 |
10 |
1 |
0 |
|
64 |
7 |
7 |
1 |
1 |
|
45 |
5 |
6 |
1 |
1 |
7)
Construct
a model with grade as the dependent variable and the other variables as the
independent variables. State what the model is (grade = 24.3923 + 3.8911*study
+ 2.9052*sleep – 3.8384*gender – 2.6087*income)
8)
What
percentage of the variation in grade is explained by the model? (70.16%)
9)
Is
the model significant? (p-value = 0.1%; yes, the model is significant)
10)
What
individual variables are significant? (study and sleep; these are the only
variables in which the p-value is less than 5%)
11)
Are
there any collinearity problems among the independent variables? (no – all the
VIF values are less than 10)
12)
Construct
a new model using only study and sleep as the independent variables. State what
the model is. (grade = 20.0198 + 3.9576*study + 3.0188*sleep)
13)
If
someone studies for 10 hours on average and sleeps for 7 hours, what would you
expect the person’s grade to be, rounding to the nearest whole number? (81)
14)
Based
on 10 hours of study and 7 hours of sleep, in what grade range would you expect
the average grade to fall 95% of the time, rounding to the nearest whole
number? (76 to 86)
15)
For
an individual who studies for 10 hours and sleeps for 7 hours, in what grade
range would you expect the person’s grade to fall 95% of the time? (60 to 100
since the mark cannot be above 100)
16)
Using
the criteria of adjusted r2, ANOVA p-value and t-test p-values
(using a 5% level of significance), which of the two models is best? (Model #2)