Using SPSS Syntax

profilehw929
CodingChallenge3s18.docx

Coding Challenge 3: Using SPSS Syntax (Yuk!)

Due April 27, 2018

The data for Data Analysis Assignment 4 were generated with the following SPSS syntax code:

* Encoding: UTF-8.

DATA LIST FIXED/

make (A17) price 19-23 mpg 25-26 rep78 28 hdroom 30-32 (F,1)

trunk 34-35 weight 37-40 length 42-44 turn 46-47

displ 49-51 gratio 53-56 (F,2) foreign 58.

BEGIN DATA.

AMC Concord 4099 22 3 2.5 11 2930 186 40 121 3.58 0

AMC Pacer 4749 17 3 3.0 11 3350 173 40 258 2.53 0

AMC Spirit 3799 22 . 3.0 12 2640 168 35 121 3.08 0

Audi 5000 9690 17 5 3.0 15 2830 189 37 131 3.20 1

Audi Fox 6295 23 3 2.5 11 2070 174 36 97 3.70 1

BMW 320i 9735 25 4 2.5 12 2650 177 34 121 3.64 1

Lines removed to save space

Toyota Corona 5719 18 5 2.0 11 2670 175 36 134 3.05 1

Volvo 260 11995 17 5 2.5 14 3170 193 37 163 2.98 1

VW Dasher 7140 23 4 2.5 12 2160 172 36 97 3.74 1

VW Diesel 5397 41 5 3.0 15 2040 155 35 90 3.78 1

VW Rabbit 4697 25 4 3.0 15 1930 155 35 89 3.78 1

VW Scirocco 6850 25 4 2.0 16 1990 156 36 97 3.78 1

END DATA.

SAVE OUTFILE = ‘\\Client\H$\Desktop\auto.sav'.

Instructions:

1. Generate a SPSS data file called “faithful.sav” using the Old Faithful data, found on the Week 9 Module on Canvas. Use the code above to guide you. It should be easier to generate the Old Faithful data than it was to generate the auto data. The data file must be generated using SPSS code like the above. Do not simply load a .csv file into SPSS.

2. Load the data “faithful.sav” into SPSS data editor. Take a screen shot.

3. Create a histogram of the Old Faithful waiting times in SPSS. Take a screen shot. Describe the histogram.

4. We have seen that the Central Limit Theorem is valid for unimodal and symmetric data sets even when the sample sizes are small (like 5 or 10). Will the CLT work for data like the Old Faithful waiting times? Create a simulation, using SPSS (either SPSS syntax, SPSS menus, or the Excel kluge discussed in class), to determine whether the CLT is valid for data like the Old Faithful waiting times. Use sample sizes of 5, 10, 30, 50, 70, and 100. Check not only histograms, but also the means and standard deviations. Pretend that the original Old Faithful data is the population (so its mean is the population mean and its SD is the population SD).

5. Deliverable: ONE word or PDF document containing the following

a. Code to create the data set

b. The screen shot of the SPSS Data View containing the Old Faithful data.

c. Histogram of the Old Faithful waiting times (appropriately labeled)

d. SPSS Syntax Code OR Instructions using SPSS menus to simulate the Central Limit Theorem. If you use Excel, upload your Excel file with your answer.

e. Histograms of the sample means of samples of size 5, 10, 30, 50, 70, and 100 – all on one page in a 3 row by 2 column format. All histograms should be appropriately labeled.

f. For each histogram, an explanation of the information displayed in the graph, including the numerical mean and SD for each histogram.

g. Two to three sentences explaining whether the CLT “works” for data like the Old Faithful data. I will not be picky about how many sentences you use. The point is BE BRIEF!

Points will be subtracted for submission of multiple files, except if you have an Excel file for the simulation.

Links for Coding Challenge Help:

Guide to simulations in SPSS: http://college.cengage.com/mathematics/brase/understandable_statistics/8e/shared/tech_guide/spss-07.pdf

SPSS Tutorial on Sampling Distributions: https://www.spss-tutorials.com/sampling-distribution-what-is-it/

Instructions from a different data set: http://www.csis.ysu.edu/~chang/project/PCLTsimul2292.pdf

SPSS Sampling Basics: https://www.spss-tutorials.com/spss-sampling-basics/

If you use something other than these four documents to help you, please reference the document. I would love to know what else is out there!