Statistics for Managerial Decisions Making
HW11
MGMT 650 Fall 2018 Week 11 Homework (Last updated 6/24/2018)
Excel Competencies - Regression
| Studies have shown that the frequency with which shoppers browse Internet retailers is related to the frequency with which they actually purchase products and/or services online. The following data show respondents age and answer to the question “How many minutes do you browse online retailers per week?” | ||
| Age (X) | Time (Y) | |
| 19 | 577 | |
| 49 | 401 | |
| 56 | 104 | |
| 20 | 393 | |
| 24 | 555 | |
| 21 | 601 | |
| 30 | 434 | |
| 28 | 540 | |
| 60 | 97 | |
| 58 | 349 | |
| 23 | 578 | |
| 26 | 536 | |
| 60 | 110 | |
| 57 | 221 | |
| 46 | 452 | |
| 30 | 376 | |
| 28 | 343 | |
| 44 | 487 | |
| 49 | 412 | |
| 33 | 449 | |
| 1 | Use Data > Data Analysis > Correlation to compute the correlation checking the Labels checkbox. | |
| 2 | Use the Excel function =CORREL to compute the correlation. If answers for #1 and 2 do not agree, there is an error. | |
| 3 | The strength of the correlation motivates further examination. | |
| a) Insert Scatter (X, Y) plot linked to the data on this sheet with Age on the horizontal (X) axis. | ||
| b) Add to your chart: the chart name, vertical axis label, and horizontal axis label. | ||
| c) Complete the chart by adding Trendline and checking boxes | ||
| 4 | Read directly from the chart: | |
| a) Intercept = | ||
| b) Slope = | ||
| c) R2 = | ||
| 5 | Perform Data > Data Analysis > Regression. Highlight the Y-intercept with yellow. Highlight the X variable in blue. Highlight the standard error in orange | |
| 6 | Using your highlighted cells, what is the equation of the regression line? | |
| 7 | Use Excel to predict the number of minutes spent by a 41-year old shopper. Enter = followed by the regression formula. | |
| Enter the intercept and slope into the formula by clicking on the cells in the regression output with the results. | ||
Excel Competencies -Pivot Table
| Saeko has a yarn shop and wants to expand the colors in her yarn lines. She sends you the raw data in the "Pivot Table Data" tab. | |
| This data consists of the number of yards and meters of the color names, and the color types that each name belongs to, sold during her busiest weekend last year. | |
| 8 | Use the "Pivot Table Data" tab to create a pivot table that shows Saeko the number of yards and meters that were sold in the various yarn types during the busiest weekend of her shop last year. |
| 9 | Using the pivot table that you just created, determine which of the color types sold the least amount of yards using =MIN() |
| Using the pivot table that you just created, determine which of the color types sold the largest amount of yards using =MAX() | |
| If Saeko wants to expand the colors in both the best selling and the worst selling color type, which two color types should she buy? |
Pivot Table Data
| Customer | Color Name | Color Type | Yards | Meters |
| 1 | Coriander | White | 471 | 430.7 |
| 2 | Lynx | Brown | 285 | 260.6 |
| 3 | Daffodil | Yellow | 338 | 309.1 |
| 4 | Periwinkle | Purple | 285 | 260.6 |
| 5 | Opal | Blue | 342 | 312.7 |
| 6 | Toffee | Brown | 279 | 255.1 |
| 7 | Ruby | Red | 154 | 140.8 |
| 8 | Ash | Blue | 299 | 273.4 |
| 9 | Ash | Blue | 450 | 411.5 |
| 10 | Ash | Blue | 404 | 369.4 |
| 11 | Whirlpool | Blue | 419 | 383.1 |
| 12 | Whirlpool | Blue | 370 | 338.3 |
| 13 | Verde | Green | 153 | 139.9 |
| 14 | Regal | Purple | 173 | 158.2 |
| 15 | Lynx | Brown | 396 | 362.1 |
| 16 | Toffee | Brown | 446 | 407.8 |
| 17 | Chocolate | Brown | 222 | 203.0 |
| 18 | Mist | White | 393 | 359.4 |
| 19 | Whirlpool | Blue | 185 | 169.2 |
| 20 | Alfalfa | Green | 412 | 376.7 |
| 21 | Ruby | Red | 445 | 406.9 |
| 22 | Verde | Green | 333 | 304.5 |
| 23 | Sky | White | 414 | 378.6 |
| 24 | Chocolate | Brown | 352 | 321.9 |
| 25 | Whirlpool | Blue | 222 | 203.0 |
| 26 | Black | Black | 324 | 296.3 |
| 27 | Mist | White | 274 | 250.5 |
| 28 | Alfalfa | Green | 305 | 278.9 |
| 29 | Jade | Green | 389 | 355.7 |
| 30 | Opal | Blue | 278 | 254.2 |
| 31 | Cream | White | 253 | 231.3 |
| 32 | Chocolate | Brown | 219 | 200.3 |
| 33 | Ruby | Red | 224 | 204.8 |
| 34 | Mist | White | 353 | 322.8 |
| 35 | Regal | Purple | 361 | 330.1 |
| 36 | Jade | Green | 440 | 402.3 |
| 37 | Sky | White | 493 | 450.8 |
| 38 | Periwinkle | Purple | 319 | 291.7 |
| 39 | Coriander | White | 297 | 271.6 |
| 40 | Black | Black | 462 | 422.5 |
| 41 | Ash | Blue | 182 | 166.4 |
| 42 | Chocolate | Brown | 385 | 352.0 |
| 43 | Periwinkle | Purple | 325 | 297.2 |
| 44 | Black | Black | 341 | 311.8 |
| 45 | Blush | Red | 433 | 395.9 |
| 46 | Lynx | Brown | 355 | 324.6 |
| 47 | Mist | White | 158 | 144.5 |
| 48 | Coriander | White | 364 | 332.8 |
| 49 | Cream | White | 314 | 287.1 |
| 50 | Verde | Green | 367 | 335.6 |
| 51 | Chocolate | Brown | 328 | 299.9 |
| 52 | Daffodil | Yellow | 247 | 225.9 |
| 53 | Chocolate | Brown | 479 | 438.0 |
| 54 | Daffodil | Yellow | 243 | 222.2 |
| 55 | Regal | Purple | 256 | 234.1 |
| 56 | Sapphire | Blue | 342 | 312.7 |
| 57 | Coriander | White | 176 | 160.9 |
| 58 | Black | Black | 345 | 315.5 |
| 59 | Lynx | Brown | 393 | 359.4 |
| 60 | Lynx | Brown | 329 | 300.8 |
| 61 | Periwinkle | Purple | 362 | 331.0 |
| 62 | Chocolate | Brown | 495 | 452.6 |
| 63 | Coriander | White | 349 | 319.1 |
| 64 | Verde | Green | 246 | 224.9 |
| 65 | Lynx | Brown | 441 | 403.3 |
| 66 | Alfalfa | Green | 386 | 353.0 |
| 67 | Cream | White | 175 | 160.0 |
| 68 | Daffodil | Yellow | 190 | 173.7 |
| 69 | Periwinkle | Purple | 486 | 444.4 |
| 70 | Cream | White | 336 | 307.2 |
| 71 | Toffee | Brown | 436 | 398.7 |
| 72 | Opal | Blue | 496 | 453.5 |
| 73 | Cream | White | 303 | 277.1 |
| 74 | Cream | White | 389 | 355.7 |
| 75 | Coriander | White | 415 | 379.5 |
| 76 | Mist | White | 352 | 321.9 |
| 77 | Verde | Green | 154 | 140.8 |
| 78 | Toffee | Brown | 160 | 146.3 |
| 79 | Blush | Red | 465 | 425.2 |
| 80 | Opal | Blue | 235 | 214.9 |
| 81 | Daffodil | Yellow | 248 | 226.8 |
| 82 | Mist | White | 339 | 310.0 |
| 83 | Alfalfa | Green | 436 | 398.7 |
| 84 | Jade | Green | 194 | 177.4 |
| 85 | Ruby | Red | 309 | 282.5 |
| 86 | Verde | Green | 244 | 223.1 |
| 87 | Cream | White | 293 | 267.9 |
| 88 | Lynx | Brown | 229 | 209.4 |
| 89 | Chocolate | Brown | 311 | 284.4 |
| 90 | Regal | Purple | 319 | 291.7 |
| 91 | Sky | White | 275 | 251.5 |
| 92 | Regal | Purple | 390 | 356.6 |
| 93 | Opal | Blue | 242 | 221.3 |
| 94 | Lynx | Brown | 163 | 149.0 |
| 95 | Regal | Purple | 364 | 332.8 |
| 96 | Ash | Blue | 292 | 267.0 |
| 97 | Periwinkle | Purple | 204 | 186.5 |
| 98 | Black | Black | 449 | 410.6 |
| 99 | Black | Black | 281 | 256.9 |
| 100 | Blush | Red | 162 | 148.1 |
| 101 | Regal | Purple | 182 | 166.4 |
| 102 | Sapphire | Blue | 491 | 449.0 |
| 103 | Whirlpool | Blue | 212 | 193.9 |
| 104 | Chocolate | Brown | 375 | 342.9 |
| 105 | Whirlpool | Blue | 181 | 165.5 |
| 106 | Sapphire | Blue | 288 | 263.3 |
| 107 | Chocolate | Brown | 368 | 336.5 |
| 108 | Toffee | Brown | 485 | 443.5 |
| 109 | Sapphire | Blue | 475 | 434.3 |
| 110 | Whirlpool | Blue | 187 | 171.0 |
| 111 | Black | Black | 192 | 175.6 |
| 112 | Regal | Purple | 213 | 194.8 |
| 113 | Opal | Blue | 360 | 329.2 |
| 114 | Blush | Red | 336 | 307.2 |
| 115 | Coriander | White | 379 | 346.6 |
| 116 | Whirlpool | Blue | 411 | 375.8 |
| 117 | Ash | Blue | 217 | 198.4 |
| 118 | Blush | Red | 246 | 224.9 |
| 119 | Black | Black | 191 | 174.7 |
| 120 | Ruby | Red | 191 | 174.7 |
| 121 | Mist | White | 497 | 454.5 |
| 122 | Chocolate | Brown | 246 | 224.9 |
| 123 | Opal | Blue | 439 | 401.4 |
| 124 | Toffee | Brown | 334 | 305.4 |
| 125 | Ash | Brown | 427 | 390.4 |
Cleaning Data with Outlier
| 10 | On this worksheet, make an XY scatter plot linked to the following data: |
| X | Y |
| 4.27 | 5.978 |
| 7.14 | 8.568 |
| 6.74 | 8.458 |
| 8.58 | 9.154 |
| 9.03 | 11.06 |
| 6.69 | 33.87 |
| 7.76 | 8.088 |
| 5.3 | 5.3 |
| 10.51 | 12.612 |
| 5.52 | 6.072 |
| 8.9 | 9.35 |
| 1.08 | 1.08 |
| 5.65 | 6.78 |
| 9.06 | 10.308 |
| 9.46 | 11.352 |
| 8.07 | 10.298 |
| 10.34 | 12.578 |
| 10.15 | 11.255 |
| 7.61 | 8.371 |
| 3.35 | 4.365 |
| 3.82 | 5.876 |
| 1.05 | 1.47 |
| 4.85 | 4.85 |
| 6.48 | 7.664 |
| 10.5 | 10.5 |
| 11 | Add trendline and regression equation to the plot. |
| 12 | The scatterplot reveals a point outside the point pattern. Copy the data to a new location in the worksheet. You now have 2 sets of data. |
| Data that are more tha 1.5 IQR below Q1 or more than 1.5 IQR above Q3 are considered outliers and must be investigated. | |
| It was determined that the outlying point resulted from data entry error. Remove the outlier in the copy of the data. | |
| 13 | Make a new scatterplot linked to the cleaned data without the outlier, and add trendline and regression equation label. |
| 14 | Compare the regression equations of the two plots. How did removal of the outlier affect the slope and R2? |
Learning
| 7/8/17 18:19 | |
| Highlight the correct answer or answers (#17 & 19) for each of the following questions: | |
| 15 | The correlation R measures the strength of the linear association of variables Y and X, and does not have a unit of measure, e.g. feet, acres, pounds, seconds. |
| ● True | |
| ● False | |
| 16 | Based on the correlation computed in tab "Excel Competencies", does Time tends to increase with Age? |
| ● True | |
| ● False | |
| 17 | The strength of the linear relationship between Age and the Time is |
| ● Weak | |
| ● Moderate | |
| ● Strong | |
| 18 | Highlight the 4 correct statements. Try not to mix up explanatory and response with dependent and independent. |
| ● X denotes the independent or response variable | |
| ● X denotes the independent or explanatory variable | |
| ● Y denotes the dependent or explanatory variable | |
| ● Y denotes the dependent or response variable | |
| ● x denotes an observed value of the independent variable | |
| ● x denotes an observed value of the dependent variable | |
| ● denotes the mean value of observations of the response variable | |
| 19 | The best fitting line minimizes the vertical distances from the points to the line. Hence, |
| the Y coordinate of a point on the best fitting line provides an estimate or prediction of Y at the value of the corresponding X coordinate. | |
| This process is called regression (to move backward) because | |
| ● The estimate of Y will be closer to the mean in standard deviations than X is. | |
| ● The estimate of Y will be farther from the mean in standard deviations than X is. | |
| 20 | Based on the data in "Excel Competencies", can Y be predicted for a person who is 80? |
| ● No | |
| ● Yes | |
| 21 | The "intercept" and "slope" completely define the best fitting line. |
| The intercept is the vertical distance from the origin (where the X and Y axes intersect) up or down to the line. It sets the elevation of the line. | |
| ● TRUE | |
| ● FALSE | |
| 22 | As a positively signed slope increases |
| ● The whole line moves up without rotating | |
| ● The whole line moves down without rotating | |
| ● The line rotates clockwise becoming less steep | |
| ● The line rotates counterclockwise becoming more steep | |
| 23 | Based on the regression output in "Excel Competencies", when Age increases by 1 year, Time decreases by |
| ● 0.97 | |
| ● 32.10 | |
| ● 750.02 | |
| ● 8.3966 | |
| 24 | R2 measures the fit of the line to the points. As R2 increases |
| ● The scatter about the line increases and the amount of the variation of Y explained by X decreases | |
| ● The scatter about the line decreases and the amount of the variation of Y explained by X increases | |
| ● The scatter about the line increases and the amount of the variation of Y explained by X increases | |
| ● The scatter about the line decreases and the amount of the variation of Y explained by X decreases | |
| 25 | The Standard Error is a standard deviation measuring the scatter of the points about the regression line. |
| ● True | |
| ● False | |