CORRELATION AND SIMPLE REGRESSION
Name ______________________________ Signature ___________________________
QSO 510 Quantitative Analysis for Decision Making
Summer 2012 Mid-term Exam - Instructor: Dr. Derek Kane
Instructions
1. This exam is taken in-class, you may use any notes or books that you care to. You may only use your own notes.
2. The exam only requires the use of a hand calculator. You may use notes you have taken on the
3. Answer all questions in the context of the problem. General answers are not expected.
4. You must show all steps including formulas used and all calculations done to arrive at the final answers. Incomplete solutions will receive partial credit.
5. Use at least four significant digits at all intermediate steps. Round off the final answers appropriately. Note: 0.0042 is only two significant digits as leading zeros are not considered significant. Trailing zeros are considered significant.
6. You only need to do four problems. If you do all five problems indicate which four you want me to grade.
7. You are welcome to ask questions you have on the problems. Please do not ask any questions relating to the solution of any problem.
8. You must work on the exam by yourself.
(For Instructor’s use)
|
Problem |
Points |
|
1 |
|
|
2 |
|
|
3 |
|
|
4 |
|
|
5 |
|
|
Total |
|
Problem 1 (25 points)
You have an assembly line which produces 50kg bags of flour with a standard deviation of 2.0kg.
a) Assuming the distribution of weight is normal, what is the chance any single bag’s weight is less than 49kg?
b) If you chose 81 bags at random, what would be the expected average weight of the bags in your sample? What would the standard deviation of the sample average be? What is the shape of this distribution? Give reasons for your answers .
c) If you pulled a sample of 64 bags, what is the chance that you would find the average weight was between 49.25 and 50.25kg?
Write your answers in the space below and continue on the next page.
a) The probability we are looking for is
Thus there is a 30.85% chance that the weight will be less than 49kg.
b) Because there are more than 30 items in the sample, the Central Limit Theorem applies and
The shape is Gaussian.
c) The probability is:
There is an 84% chance that the sample will weigh between 49.25 and 50.25.
Problem 2 (25 points)
We want to look at economic opportunity for women. Gallup conducted a survey of women’s employment in Greece and found that 205 of 500 women surveyed were employed
a) Compute the proportion and standard deviation of Greek women who are employed.
b) Construct a 95% confidence interval for women’s employment in Greece.
c) If you are working for CNN, how would you explain the confidence interval to your audience? Does the confidence interval seem small enough? If not, how would you make the interval smaller? Provide a clear and complete answer.
Begin your answer below and continue on the next page.
a) The proportion and standard deviation is
b) The confidence interval is
c) This is the range where we are 95% sure the true proportion lies. We should take a larger sample to get a smaller confidence interval.
Problem 3 (25 points)
According to the same Gallup poll, 46% of men in Greece are employed. Perform a 95% hypothesis test to determine whether the survey of 500 women showing 41% employment indicates that women have a lower employment rate than men in Greece.
Begin your answer below and continue on the next page.
The hypotheses are
The value of α is given as 0.05 and the number of people in the survey is n=500
The threshold of this one-sided z-test is 1.645
The proportion is so the z-statistic is
We accept the alternate hypothesis.
There is statistical evidence that women in Greece have a lower employment rate than men.
Problem 4 (25 points)
The following table shows the Housing Starts in thousands for the past five quarters.
|
Quarter |
Housing starts (1000s) |
SSXX |
SSXY |
SSYY |
|
1 |
90 |
4 |
34.8 |
302.76 |
|
2 |
123 |
1 |
-15.6 |
243.36 |
|
3 |
118 |
0 |
0 |
112.36 |
|
4 |
100 |
1 |
-7.4 |
54.76 |
|
5 |
106 |
4 |
-2.8 |
1.96 |
|
3 |
107.4 |
10 |
9 |
715.2 |
(a)
(b) Determine the regression equation using Housing Starts as the dependent variable.
(c) Compute the standard error of the estimate se.
Part (c) and (d) are not going to be on the upcoming midterm.
(d)
Begin your answer in the space below and continue on the next page
Problem 5 (25 points)
The following is a portion of the data collected to investigate the correlations between four stocks and the S&P 500 Industrial Average:
|
Date |
JP Morgan |
Walmart |
Exxon |
Apple |
S&P |
|
7/6/2012 |
33.9 |
71.08 |
84.8 |
605.88 |
1354.68 |
|
7/5/2012 |
34.38 |
71.08 |
85.57 |
609.94 |
1367.58 |
|
7/3/2012 |
35.88 |
70.75 |
86.28 |
599.41 |
1374.02 |
|
7/2/2012 |
35.98 |
69.35 |
85.34 |
592.52 |
1365.51 |
|
6/29/2012 |
35.43 |
69.72 |
85.57 |
584 |
1362.16 |
|
6/28/2012 |
35.58 |
68.3 |
83.1 |
569.05 |
1329.04 |
|
6/27/2012 |
36.48 |
68.59 |
83.2 |
574.5 |
1331.85 |
|
6/26/2012 |
35.41 |
68.58 |
82.4 |
572.03 |
1319.99 |
|
6/25/2012 |
35.03 |
68.18 |
81.24 |
570.77 |
1313.72 |
|
6/22/2012 |
35.69 |
67.3 |
82.11 |
582.1 |
1335.02 |
|
6/21/2012 |
35.22 |
67.7 |
82.11 |
577.67 |
1325.51 |
|
6/20/2012 |
36.15 |
68.52 |
84.97 |
585.74 |
1355.69 |
|
6/19/2012 |
35.09 |
67.81 |
84.48 |
587.41 |
1357.98 |
|
6/18/2012 |
34.33 |
68.12 |
83.11 |
585.78 |
1344.78 |
|
6/15/2012 |
34.74 |
67.75 |
83.22 |
574.13 |
1342.84 |
|
6/14/2012 |
34.36 |
67.63 |
82.13 |
571.53 |
1329.1 |
|
6/13/2012 |
34.02 |
67.07 |
80.63 |
572.16 |
1314.88 |
|
6/12/2012 |
33.49 |
67.72 |
81.26 |
576.16 |
1324.18 |
|
6/11/2012 |
32.55 |
67.53 |
80.27 |
571.17 |
1308.93 |
|
6/8/2012 |
33.4 |
68.22 |
80.84 |
580.32 |
1325.66 |
|
6/7/2012 |
32.54 |
65.87 |
80.69 |
571.72 |
1314.99 |
|
6/6/2012 |
32.8 |
65.93 |
80.18 |
571.46 |
1315.13 |
|
6/5/2012 |
31.73 |
65.5 |
77.6 |
562.83 |
1285.5 |
|
6/4/2012 |
30.74 |
65.99 |
77.83 |
564.29 |
1278.18 |
|
Regression Statistics |
|
|
|
|
|
|
|
Multiple R |
0.962267935 |
|
|
|
|
|
|
R Square |
0.925959578 |
|
|
|
|
|
|
Adjusted R Square |
0.925623031 |
|
|
|
|
|
|
Standard Error |
45.16258314 |
|
|
|
|
|
|
Observations |
885 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
ANOVA |
|
|
|
|
|
|
|
|
df |
SS |
MS |
F |
Significance F |
|
|
Regression |
4 |
22447261.48 |
5611815.37 |
2751.349908 |
0 |
|
|
Residual |
880 |
1794899.846 |
2039.658916 |
|
|
|
|
Total |
884 |
24242161.33 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Coefficients |
Standard Error |
t Stat |
P-value |
Lower 95% |
Upper 95% |
|
Intercept |
172.7601601 |
31.63797452 |
5.460531613 |
6.17796E-08 |
110.6654669 |
234.8548532 |
|
JP Morgan |
11.9624917 |
0.286137971 |
41.80672581 |
2.9476E-211 |
11.40089919 |
12.52408421 |
|
Walmart |
2.290929236 |
0.655364656 |
3.495655764 |
0.000496528 |
1.004669044 |
3.577189428 |
|
Exxon |
3.078491008 |
0.256273015 |
12.01254456 |
6.86108E-31 |
2.575513354 |
3.581468662 |
|
Apple |
0.669893335 |
0.026953728 |
24.85345781 |
1.0171E-103 |
0.616992241 |
0.722794428 |
Answer the following questions based on the Excel output report. Support your answers with numbers from the output report. Use level of significance = 0.05.
a) Write the estimated multiple regression equation. Note: Use actual variable names and numbers. If using symbols, define them before using in the equation.
b) Clearly explain the meaning of b1 (the coefficient of JP Morgan). Note: Use actual variable names and numbers in answering your question. b1 is the slope is not a sufficient answer.
If the price of JP Morgan increases by $1, then the S&P goes up 11.96.
c) Clearly explain the meaning of b2 (the coefficient of Walmart). Note: Use actual variable names and numbers in answering your question. b2 is the slope is not a sufficient answer.
If the price of Walmart increases by $1, then the S&P goes up 2.291.
d) Clearly explain the meaning of b3 (the coefficient of Exxon). Note: Use actual variable names and numbers in answering your question. b3 is the slope is not a sufficient answer.
If the price of Exxon increases by $1, then the S&P goes up 3.078.
e) Clearly explain the meaning of b4 (the coefficient of Apple). Note: Use actual variable names and numbers in answering your question. b4 is the slope is not a sufficient answer.
If the price of Apple increases by $1, then the S&P goes up 0.6699.
f) Is the regression equation significant? Give reasons for your answer. (Hint: The answer to this question requires test of the hypothesis: Ho: 1 = 2 = 3 = 4 = 0 vs. Ha: At least one j is not equal to zero, where j = 1, 4)
It is significant, because it has variables with p-values less than 0.05.
g) Which variables in the current equation are significant and which are not significant? Give reason for your answer. (Hint: The answer to this question requires test of hypothesis: Ho: j = 0 vs. Ha: j 0 for j = 1, 4).
All of the variables are significant.
h) Do you have an explanation for why these stocks and the S&P 500 index would be so well correlated?
The stocks are all components of the S&P 500.
Begin your answer in the space below and continue on the next page
Y
Y
ˆ
X