Descriptive Analysis Plan
University of Maryland University College
STAT200 - Assignment #1: Descriptive Statistics Data Analysis Plan
Student (Full Name):
Class:
Instructor:
Date:
Scenario: Please write a few lines describing your scenario and the four variables (in addition to income) you have selected.
Use Table 1 to report the variables selected for this assignment. Note: The information for the required variable, “Income,” has already been completed and can be used as a guide for completing information on the remaining variables.
Table 1. Variables Selected for the Analysis
|
Variable Name in the Data Set |
Description (See the data dictionary for describing the variables.) |
Type of Variable (Qualitative or Quantitative) |
|
Variable 1: “Income”
|
Annual household income in USD. |
Quantitative |
|
Variable 2: |
|
|
|
Variable 3: |
|
|
|
Variable 4: |
|
|
|
Variable 5: |
|
|
Reason(s) for Selecting the Variables and Expected Outcome(s):
1. 1. Variable 1: “Income” -
2. 2. Variable 2: “ “ -
3. 3. Variable 3: “ “ -
4. 4. Variable 4: “ “ -
5. 5. Variable 5: “ “ -
Data Set Description:
Proposed Data Analysis:
Measures of Central Tendency and Dispersion
Complete Table 2. Numerical Summaries of the Selected Variables and briefly explain why you choose those measurements. Note: The information for the required variable, “Income,” has already been completed and can be used as a guide for completing information on the remaining variables.
Table 2. Numerical Summaries of the Selected Variables
|
Variable Name |
Measures of Central Tendency and Dispersion |
Rationale for Why Appropriate |
|
Variable 1: “Income”
|
· ● Number of Observations · ● Median · ● Sample Standard Deviation |
I am using median for two reasons: 1. 1. If there are any outliers or the data is not normally distributed, the median is the best measure of central tendency. 2. 2. The variable is quantitative.
I am using sample standard deviation for three reasons: 1. 1. The data is a sample from a larger data set. 2. 2. It is the most commonly used measure of dispersion. 3. 3. The variable is quantitative.
|
|
Variable 2: |
|
|
|
Variable 3: |
|
|
|
Variable 4: |
|
|
|
Variable 5: |
|
|
Graphs and/or Tables
Complete Table 3. Type of Graphs and/or Table for Selected Variables and briefly explain why you choose those graphs and/or tables. Note: The information for the required variable, “Income,” has already been completed and can be used as a guide for completing information on the remaining variables.
Table 3. Type of Graphs and/or Tables for Selected Variables
|
Variable Name |
Graph and/or Table |
Rationale for why Appropriate? |
|
Variable 1: “Income”
|
Graph: I will use the histogram to show the normal distribution of data.
|
Histogram is one of the best plot to show the normal distribution of quantitative level data . |
|
Variable 2: |
|
|
|
Variable 3: |
|
|
|
Variable 4: |
|
|
|
Variable 5: |
|
|