MA135_Applications_StudentSampleFinalProjectPart2docx.docx

MA135 Statistical Applications Final Project

Part 2

Name:

Example Student

Do countries with higher female education levels have lower infant mortality rates?

Title:

(Type your approved research question here)

Part 2: Summary Statistics & Statistical Analysis

Summary Statistics of the Independent Variable ()

Your part 1 must be approved before you begin part 2.

1. Data selection methodology

In statistics, how the sample is obtained for data analysis is important. Explain how you obtained or selected your data. Did you choose all the data because it is comprehensive of your data sets (example: all 50 states)? Did you to delete some values because it was missing? Was your data selected because it meets the criteria? What is your sample size?

The data is from two separate sources. The dependent variable (y), under-five mortality rate (per 1000 births) for each country in 2009 is from www.childmortality.org. The independent variable (x), is mean years in school for women 25 years and above for each country in 2009 is from www.healthmetricsandevaluation.org. I deleted countries with missing data values and included only 33 countries that had both the dependent and independent variable data values. The sample size of paired data is n= 33.

2. Independent Variable (

a. Identify the Independent Variable ():

Write the name that you used to label the independent variable in Excel

Mean years of schooling (females aged 25 years and above) in 2009

:

b. Describe and explain the independent variable in a complete sentence with units and years if applicable. Use the wording from the data source.

The independent variable is the mean years of education women (25 years of age or older) received in each country in 2009.

3. Measures of Center of the Independent Variable (𝑥)

a. You may paste the summary statics table from Excel below for the independent variable . Adjust the font size if necessary. ~ optional~

b. Describe and determine the mean, median, and midrange of the independent variable. Include units. Carry one more decimal place than is present in the original set of values. {Chapter 3}

Mean:

Median:

Midrange:

The middle value when the list is arranged in order.

8.4 years

Total of all values divided by n

7.66 years

Add the highest value plus the lowest value and divide that total by 2.

6.8 years

4. Measures of Variation of the Independent Variable (

a. Determine the range, standard deviation, and variance of the independent variable. Include units. Carry one more decimal place than is present in the original set of values. {Chapter 4}

Range:

Standard Deviation:

Variance:

12.40 years

3.40 years

11.53 years squared

5. Measures of Relative Standing of the Independent Variable ()

a. Identify the measures of the independent variable. Include units .

Minimum:

Median:

Maximum:

0.6 years

8.4 years

13.0 years

b. Determine the data point that has the minimum and maximum values in your independent variables column.

With Minimum Value:

With Maximum Value:

The United States has the maximum (13.0) mean years of schooling for women 25 years and above in 2009.

The country, Chad, has the minimum (0.6) mean years of schooling for women 25 years and above in 2009.

6. Analysis

Describe the independent variable in a short paragraph based on your analysis of the data. Bonus: Include graphs.

The range shows a substantial difference of 12.4 years for mean years of schooling for women age 25 and older between the countries with minimum and maximum values. 50% of the countries have mean years of schooling for women that is less than 8.4 years which implies that half of the countries have mean years of schooling for women below 9th grade high school. Countries that have mean years of schooling for women age 25 and older that is less than 0.86 years are considered unusual as they are more than two standard deviations below the mean. These nations have mean years of schooling below first grade in elementary school. The highest frequency for mean years of schooling for women age 25 and above in each country is between 9 and 10 years.

Chart, histogram  Description automatically generated

Summary Statistics of the Dependent Variable (y)

1. Dependent (response) Variable (y)

a. Identify the dependent variable (y):

Write the name that you used to label the dependent variable.

Under age five mortality (per 1000 births) in 2009

b. Describe and explain the dependent variable in a complete sentence with units and years if applicable. Use the wording from the data source (website).

The dependent variable is the number of deaths of children between birth and age five expressed per 1000 births in each country in 2009.

2. Measures of Center of the Dependent Variable (

a. You may paste the summary statics table below for the dependent variable . Adjust the font size if necessary. ~ optional~

b. Determine the mean, median, and midrange of the dependent variable. Include units. Carry one more decimal place than is present in the original set of values. {Chapter 3}

Mean:

Median:

Midrange:

23.7 deaths per 1000 children under 5

41.98 deaths per 1000 children under 5

95.0 deaths per 1000 children under 5

3. Measures of Variation of the Dependent Variable (

a. Determine the range, standard deviation, and variance of the dependent variable. Include units. Carry one more decimal place than is present in the original set of values. {Chapter 4}

Range:

Standard Deviation:

Variance:

Highest value – Lowest value = 187.38 – 2.6 = 184.8 deaths per 1000 children under 5

42.39 deaths per 1000 children under 5

1797.09 deaths per 1000 children under 5 squared

4. Measures of Relative Standing of the Dependent Variable (y)

a. Identify each value below. Include units.

Minimum

Median:

Maximum:

23.7 deaths per 1000 children under 5

2.6 deaths per 1000 children under 5

187.3 deaths per 1000 children under 5

b. Determine the data point that has the minimum and maximum values in your dependent variables column.

With Minimum Value:

With Maximum Value:

In 2009, Angola, a country in Africa, has the highest number of deaths per 1000 children under age 5.

The country, Luxembourg, in Europe, has the lowest number of deaths per 1000 children under age 5.

c. Separate the data for your dependent variable into two groups based on a designation of your independent variable – You may arrange the groups by location, age, alternating data, etc…per your choice. For example, Group one can be for any 15 values that correspond to the lower levels of the independent variable and Group two can be for the other 15 values that correspond to the higher levels of the independent variable. {Chapter 4}

Note: This work is from a different data set with fewer data items than 30. This is an example as if 12 countries were chosen from Europe and 8 from Asia, for example.

This is only an example of what to do with your data computationally.

Group One

Group Two

N = 12

N = 8

M = Mean = = = 15.75

M = Mean: = =12.5

SS = = ∑ ( = 70.22

SS = = ∑ = 78

5. Analysis

a. Describe the dependent variable in a short paragraph (at least three sentences) based on your analysis of the data. Bonus: Include graphs

The range of 184.8 indicates that there is a substantial gap in child mortality rates between the country with the lowest and highest child mortality rates per 1000 children. For the countries selected, the mean number of deaths per 1000 children under age 5 was 41.98 deaths. 50% of countries have child mortality rates greater than 23.7 (deaths per 1000 children under age 5). Countries that have a child mortality rate that is above 126.76 are considered unusual as they are more than two standard deviations above the mean. These nations have an unusually high number of deaths of children under age 5. The highest frequency for deaths of children under age 5 (per 1000 children) is between 0 and 10 deaths. This indicates that while it is evident that there is great disparity of health levels based on the wide range of the child mortality rate in the world, most countries have between 0 and 10 deaths per 100 children under age 5.

Chart, histogram  Description automatically generated

Hypothesis Testing

1. Hypothesis Test

Test the claim that there is a significant difference for your independent () variable between the two groups of the dependent () variable.

a. State the null and alternative hypothesis. {Chapter 8}

The null hypothesis ( ) : =

The alternative hypothesis ( ) :

b. Find the pooled variance for the two groups. {Chapter 10}

= = = = 8.23

c. Compute the standard error of the difference between the means of the two groups. {Chapter 10}

= =

= = = 1.31

d. Hypothesis Test Based on the t-Test using a significance level.

t = = = = 2.48

My decision about the null hypothesis ( ): Reject the null hypothesis. Because the test statistic is in the critical area.

e. Write a statement stating whether there is significant difference between the two groups as related to the independent and the dependent variables.

Conclusion about the claim: There is a difference between the two groups. The evidence supports the claim that the mean number of deaths for children under age 5 is significantly different in countries in Europe as compared to countries in Asia.

f. Write 2 or more paragraphs stating your conclusions, observations, analysis, and evaluation. Reflect on your work too.

For this project I was able to……

The observations show that….

It is noticeable that……

One analysis may surmise that women with more education probably have better knowledge of how to take care of an infant or child care. A woman with access to more education may decide to get married and have children later in their lives and that access to education and age maturity may in turn decrease the number of early childhood deaths. Furthermore, educated women would make better informed reproductive and healthcare decision.

An evaluation can indicate that the result of better infant care can reflect lower infant mortality rates.

This project was…

I was surprised that…

If I had to do this project again, I would probably…..

image30.png

image4.png

image40.png

image1.png

image10.png

image2.png

image20.png

image3.png