Your final project entails systematic extraction of decision-aiding insights from a dataset (SampleDataSet.xlsx) provided to you in the Doc Sharing area. The goal of this project is to provide you with hands-on experience in conducting and interpreting different types of statistical analysis. The focus of your analysis will be on marketing strategies and analysis-related topics. At times, you will be expected to conduct additional research on topics that are not adequately covered in your text, for example, data due diligence.
In this section, you will conduct correlation and regression analyses using the provided SampleDataSet.xlsx.
• Correlation: Compute a correlation matrix that includes all continuous variables. Identify all individual correlations that are significant at the 95 percent level.
• Regression: Build a multiple regression model to explain the variability in the median school year. Describe the goodness of fit of your model and summarize your findings. Select at least four to seven similar independent variables from the remaining forty-nine measures and justify your selection.
Submit your response in Microsoft Excel.
Cite any sources using the APA format on a separate page.
Corrections to Assignment 4:
Instructions
For the correlation test, pick out 6 or 7 variables and then compute the correlation coefficients using the method in data analysis-excel. You need to make sure you describe how everything is calculated an document your results. Make sure you clean up the data first--no blanks, no zeros where there should be data, such as the age data. If you want to include variables such as Married vs single, then you will have to assign values to these, such as married=1 and single =0. The t-test is described below. Make sure median schooling is one of the variables. Do the correlation test between median schooling and the other 5 or 6 variables.
t test for no correlation
t = r * ((n-2)^.5)/(1 - r^2)^.5
r = sample correlation coefficient
n= sample size
Ho: rho = 0
H1: rho not=0
Set Alpha: critical region = .05
2 1/2% in each tail
In the regression analysis, again pick out the most significant independent variables to test their effect on the median schooling. Make sure you explain how regression is performed and explain all test stastics from the regression, including which variables are the most important in explaining the median education level. Again, make sure also that you clean up your data before doing the correlation analysis.
11 years ago