Data Analysis (Sociology and Statistics)

profileuamokz103

The spreadsheet contains data from the General Social Survey. The codebook lists the variables and explains what each means and how each is coded. You will focus on INCOME06 and CLASS. Note: 4 codes are used for different forms of missing values: IAP = inapplicable, DK = don’t know, NA = not available, and REFUSED. For CLASS, there’s also a code for NO CLASS. 

  • Create 2 tables to display the distributions of 2 variables: INCOME06 and CLASS. Since INCOME06 has a large number of categories (25, plus missing values), you should recode this variable into a smaller number of categories - 5-6 at most. Choose wisely - this will be graded on the reasonableness of your categorization scheme.

  • Create another table (a cross tab) to show the bivariate association between these 2 variables. Again, you will recode INCOME06 into a smaller number of categories (a maximum of 5-6).

  • In your report, describe the level of measurement for each variable (nominal, ordinal, interval, ratio). 

  • In your report, describe the central tendency of each variable. Be sure to use measures that are appropriate for each variable, given its level of measurement.

  • In your report, describe the association you observe between the two variables (the direction and your assessment of its strength).

  • 10 years ago
  • 20
Answer(1)

Purchase the answer to view it

blurred-text
  • attachment
    solution.docx
  • attachment
    solution.xlsx