For Math guru only

Aziz12m

Excelfiles9-9-171.rar

Home >Mathematics homework help >Statistics homework help >For Math guru only

Copy of Student_Assignment_File.11.01.2016.xlsx

Data

Salary

Compa-ratio

Midpoint

Age

Performance Rating

Service

Gender

Raise

Degree

Gender1

Grade

Copy Employee Data set to this page.

The ongoing question that the weekly assignments will focus on is: Are males and females paid the same for equal work (under the Equal Pay Act)?

Note: to simplfy the analysis, we will assume that jobs within each grade comprise equal work.

The column labels in the table mean:

ID – Employee sample number

Salary – Salary in thousands

Age – Age in years

Performance Rating – Appraisal rating (Employee evaluation score)

SERvice – Years of service

Gender: 0 = male, 1 = female

Midpoint – salary grade midpoint

Raise – percent of last raise

Grade – job/pay grade

Degree (0= BS\BA 1 = MS)

Gender1 (Male or Female)

Compa-ratio - salary divided by midpoint

Week 2

This assignment covers the material presented in weeks 1 and 2.

Six Questions

Before starting this assignment, make sure the the assignment data from the Employee Salary Data Set file is copied over to this Assignment file.

You can do this either by a copy and paste of all the columns or by opening the data file, right clicking on the Data tab, selecting Move or Copy, and copying the entire sheet to this file

(Weekly Assignment Sheet or whatever you are calling your master assignment file).

It is highly recommended that you copy the data columns (with labels) and paste them to the right so that whatever you do will not disrupt the original data values and relationships.

To Ensure full credit for each question, you need to show how you got your results. For example, Question 1 asks for several data values. If you obtain them using descriptive statistics,

then the cells should have an "=XX" formula in them, where XX is the column and row number showing the value in the descriptive statistics table. If you choose to generate each

value using fxfunctions, then each function should be located in the cell and the location of the data values should be shown.

So, Cell D31 - as an example - shoud contain something like "=T6" or "=average(T2:T26)". Having only a numerical value will not earn full credit.

The reason for this is to allow instructors to provide feedback on Excel tools if the answers are not correct - we need to see how the results were obtained.

In starting the analysis on a research question, we focus on overall descriptive statistics and seeing if differences exist. Probing into reasons and mitigating factors is a follow-up activity.

The first step in analyzing data sets is to find some summary descriptive statistics for key variables. Since the assignment problems will

focus mostly on the compa-ratios, we need to find the mean, standard deviations, and range for our groups: Males, Females, and Overall.

Sorting the compa-ratios into male and females will require you copy and paste the Compa-ratio and Gender1 columns, and then sort on Gender1.

The values for age, performance rating, and service are provided for you for future use, and - if desired - to test your approach to the compa-ratio answers

(see if you can replicate the values).

You can use either the Data Analysis Descriptive Statistics tool or the Fx =average and =stdev functions.

The range can be found using the difference between the =max and =min functions with Fx functions or from Descriptive Statistics.

Suggestion: Copy and paste the compa-ratio data to the right (Column T) and gender data in column U.

If you use Descriptive statistics, Place the output table in row 1 of a column to the right.

If you did not use Descriptive Statistics, make sure your cells show the location of the data (Example: =average(T2:T51)

Compa-ratio

Age

Perf. Rat.

Service

Overall

Mean

35.7

85.9

9.0

Standard Deviation

8.2513

11.4147

5.7177

Note - remember the data is a sample from the larger company population

Range

Female

Mean

32.5

84.2

7.9

Standard Deviation

6.9

13.6

4.9

Range

26.0

45.0

18.0

Male

Mean

38.9

87.6

10.0

Standard Deviation

8.4

8.7

6.4

Range

28.0

30.0

21.0

A key issue in comparing data sets is to see if they are distributed/shaped the same. At this point we can do this

by looking at the probabilities that males and females are distributed in the same way for a grade levels.

Empirical Probability: What is the probability for a:

Probability

a. Randomly selected person being in grade E or above?

b. Randomly selected person being a male in grade E or above?

c. Randomly selected male being in grade E or above?

d. Why are the results different?

Normal Curve based probability: For each group (overall, females, males), what are the values for each question below?:

Make sure your answer cells show the Excel function and cell location of the data used.

The probability of being in the top 1/3 of the compa-ratio distribution.

Note, we can find the cutoff value for the top 1/3 using the fx Large function: =large(range, value).

Value is the number that identifies the x-largest value. For the top 1/3 value would be the value that starts the top 1/3 of the range,

For the overall group, this would be the 50/3 or 17th (rounded), for the gender groups, it would be the 25/3 = 8th (rounded) value.

Overall

Female

Male

All of the functions below are in the fx statistical list.

How nany salaries are in the top 1/3 (rounded to nearest whole number) for each group?

Use the "=ROUND" function (found in Math or All list)

What Compa-ratio value starts the top 1/3 of the range for each group?

Use the "=LARGE" function

iii

What is the z-score for this value?

Use Excel's STANDARDIZE function

iv.

What is the normal curve probability of exceeding this score?

Use "=1-NORM.S.DIST" function

How do you interpret the relationship between the data sets? What does this suggest about our equal pay for equal work question?

Based on our sample data set, can the male and female compa-ratios in the population be equal to each other?

First, we need to determine if these two groups have equal variances, in order to decide which t-test to use.

What is the data input ranged used for this question:

Step 1:

Ho:

Ha:

Step 2:

Decision Rule:

Step 3:

Statistical test:

Why?

Step 4:

Conduct the test - place cell B77 in the output location box.

Step 5:

Conclusion and Interpretation

What is the p-value:

Is the P-value < 0.05 (for a one tail test) or 0.025 (for a two tail test)?

What is your decision: REJ or NOT reject the null?

What does this result say about our question of variance equality?

Are male and female average compa-ratios equal?

(Regardless of the outcome of the above F-test, assume equal variances for this test.)

What is the data input ranged used for this question:

Step 1:

Ho:

Ha:

Step 2:

Decision Rule:

Step 3:

Statistical test:

Why?

Step 4:

Conduct the test - place cell B109 in the output location box.

Step 5:

Conclusion and Interpretation

What is the p-value:

Is the P-value < 0.05 (for a one tail test) or 0.025 (for a two tail test)?

What is your decision: REJ or NOT reject the null?

What does your decision on rejecting the null hypothesis mean?

If the null hypothesis was rejected, calculate the effect size value:

If the effect size was calculated, what doe the result mean in terms of why the null hypothesis was rejected?

What does the result of this test tell us about our question on salary equality?

Is the Female average compa-ratio equal to or less than the midpoint value of 1.00?

This question is the same as: Does the company, pay its females - on average - at or below the grade midpoint (which is considered the market rate)?

Suggestion: Use the data column T to the right for your null hypothesis value.

What is the data input ranged used for this question:

Step 1:

Ho:

Ha:

Step 2:

Decision Rule:

Step 3:

Statistical test:

Why?

Step 4:

Conduct the test - place cell B162 in the output location box.

Step 5:

Conclusion and Interpretation

What is the p-value:

Is the P-value < 0.05 (for a one tail test) or 0.025 (for a two tail test)?

What, besides the p-value, needs to be considered with a one tail test?

Decision: Reject or do not reject Ho?

What does your decision on rejecting the null hypothesis mean?

If the null hypothesis was rejected, calculate the effect size value:

If the effect size was calculated, what doe the result mean in terms of why the null hypothesis was rejected?

What does the result of this test tell us about our question on salary equality?

Considering both the salary information in the lectures and your compa-ratio information, what conclusions can you reach about equal pay for equal work?

Why - what statistical results support this conclusion?

Week 3

ANOVA

Three Questions

Remember to show how you got your results in the appropriate cells. For questions using functions, show the input range when asked.

Group name:

One interesting question is are the average compa-ratios equal across salary ranges of 10K each.

Salary Intervals:

22-29

30-39

40-49

50-59

60-69

70-79

While compa-ratios remove the impact of grade on salaries, are they different for different pay levels,

Compa-ratio values:

that is are people at different levels paid differently relative to the midpoint? (Put data values at right.)

What is the data input ranged used for this question:

Step 1:

Ho:

Ha:

Step 2:

Decision Rule:

Step 3:

Statistical test:

Why?

Step 4:

Conduct the test - place cell b16 in the output location box.

Step 5:

Conclusions and Interpretation

What is the p-value?

Is P-value < 0.05?

What is your decision: REJ or NOT reject the null?

If the null hypothesis was rejected, what is the effect size value (eta squared)?

If calculated, what does the effect size value tell us about why the null hypothesis was rejected?

What does that decision mean in terms of our equal pay question?

If the null hypothesis in question 1 was rejected, which pairs of means differ?

Why?

Groups Compared

Diff

+/- Term

Low

High

Difference Significant?

Why?

G1 G2

G1 G3

G1 G4

G1 G5

G1 G6

G2 G3

G2 G4

G2 G5

G2 G6

G3 G4

G3 G5

G3 G6

G4 G5

G4 G6

G5 G6

Since compa is already a measure of pay for equal work, do these results impact

your conclusion on equal pay for equal work? Why or why not?

Week 4

Regression and Corellation

Five Questions

Compa-ratio

Midpoint

Age

Performance Rating

Service

Raise

Degree

Gender

Remember to show how you got your results in the appropriate cells. For questions using functions, show the input range when asked.

Create a correlation table using Compa-ratio and the other interval level variables, except for Salary.

Suggestion, place data in columns T - Y.

What range was placed in the Correlation input range box:

Place C9 in output box.

What are the statistically significant correlations related to Compa-ratio?

T =

Significant r =

Are there any surprises - correlations you though would be significant and are not, or non significant correlations you thought would be?

Why does or does not this information help answer our equal pay question?

Perform a regression analysis using compa as the dependent variable and the variables used in Q1 along with

including the dummy variables. Show the result, and interpret your findings by answering the following questions.

Suggestion: Place the dummy variables values to the right of column Y.

What range was placed in the Regression input range box:

Note: be sure to include the appropriate hypothesis statements.

Regression hypotheses

Ho:

Ha:

Coefficient hyhpotheses (one to stand for all the separate variables)

Ho:

Ha:

Place B36 in output box.

Interpretation:

For the Regression as a whole:

What is the value of the F statistic:

What is the p-value associated with this value:

Is the p-value < 0.05?

What is your decision: REJ or NOT reject the null?

What does this decision mean?

For each of the coefficients:

Midpoint

Age

Perf. Rat.

Service

Gender

Degree

What is the coefficient's p-value for each of the variables:

Is the p-value < 0.05?

Do you reject or not reject each null hypothesis:

What are the coefficients for the significant variables?

Using the intercept coefficient and only the significant variables, what is the equation?

Compa-ratio =

Is gender a significant factor in compa-ratio?

Regardless of statistical significance, who gets paid more with all other things being equal?

How do we know?

What does regression analysis show us about analyzing complex measures?

Between the lecture results and your results, what else would you like to know

before answering our question on equal pay? Why?

Between the lecture results and your results, what is your answer to the question

of equal pay for equal work for males and females? Why?

Randomized BUS308 Data - 2017.09.01-1.xlsm

Data

ID	Salary	Compa	Midpoint	Age	Performance Rating	Service	Gender	Raise	Degree	Gender1	Gr
1	61.6	1.081	57	34	85	8	0	5.7	0	M	E		The ongoing question that the weekly assignments will focus on is: Are males and females paid the same for equal work (under the Equal Pay Act)?
2	27.7	0.895	31	52	80	7	0	3.9	0	M	B		Note: to simplfy the analysis, we will assume that jobs within each grade comprise equal work.
3	35.1	1.132	31	30	75	5	1	3.6	1	F	B
4	58	1.018	57	42	100	16	0	5.5	1	M	E		The column labels in the table mean:
5	48.2	1.004	48	36	90	16	0	5.7	1	M	D		ID – Employee sample number			Salary – Salary in thousands
6	78.1	1.165	67	36	70	12	0	4.5	1	M	F		Age – Age in years			Performance Rating - Appraisal rating (employee evaluation score)
7	41.3	1.032	40	32	100	8	1	5.7	1	F	C		Service – Years of service (rounded)			Gender – 0 = male, 1 = female
8	22.1	0.962	23	32	90	9	1	5.8	1	F	A		Midpoint – salary grade midpoint			Raise – percent of last raise
9	73.9	1.103	67	49	100	10	0	4	1	M	F		Grade – job/pay grade			Degree (0= BS\BA 1 = MS)
10	23.1	1.003	23	30	80	7	1	4.7	1	F	A		Gender1 (Male or Female)			Compa - salary divided by midpoint
11	23.3	1.012	23	41	100	19	1	4.8	1	F	A
12	64.7	1.135	57	52	95	22	0	4.5	0	M	E
13	41.1	1.027	40	30	100	2	1	4.7	0	F	C
14	24	1.045	23	32	90	12	1	6	1	F	A
15	22.6	0.984	23	32	80	8	1	4.9	1	F	A
16	42.2	1.054	40	44	90	4	0	5.7	0	M	C
17	63.7	1.118	57	27	55	3	1	3	1	F	E
18	34.5	1.113	31	31	80	11	1	5.6	0	F	B
19	24.3	1.055	23	32	85	1	0	4.6	1	M	A
20	34.1	1.101	31	44	70	16	1	4.8	0	F	B
21	77.2	1.152	67	43	95	13	0	6.3	1	M	F
22	60.3	1.257	48	48	65	6	1	3.8	1	F	D
23	23.1	1.004	23	36	65	6	1	3.3	0	F	A
24	56.3	1.173	48	30	75	9	1	3.8	0	F	D
25	25	1.087	23	41	70	4	0	4	0	M	A	1.06848	0.0788733795	43.42	19.2012152393
26	23.5	1.020	23	22	95	2	1	6.2	0	F	A	1.06132	0.0841312071	46.8	19.6160436038
27	46.9	1.172	40	35	80	7	0	3.9	1	M	C				.
28	78.3	1.169	67	44	95	9	1	4.4	0	F	F
29	76.3	1.139	67	52	95	5	0	5.4	0	M	F
30	49.3	1.027	48	45	90	18	0	4.3	0	M	D
31	23.6	1.028	23	29	60	4	1	3.9	1	F	A
32	26.5	0.855	31	25	95	4	0	5.6	0	M	B		Compa	A	B	C	D	E	F
33	57.5	1.008	57	35	90	9	0	5.5	1	M	E		F mean	1.0141666667	1.1205	1.0375	1.1243333333	1.175	1.134
34	28.6	0.923	31	26	80	2	0	4.9	1	M	B		m	1.0573333333	0.892	1.0833333333	0.9995	1.0818	1.12275
35	22.6	0.984	23	23	90	4	1	5.3	0	F	A
36	23.6	1.026	23	27	75	3	1	4.3	0	F	A		F Stdev	0.0337876614	0.0311608729	0.0176776695	0.0751620472	0.0494974747	0.0212132034
37	23	0.999	23	22	95	2	1	6.2	0	F	A		m	0.0248260616	0.0190525589	0.0877971146	0.028991378	0.0607029196	0.0332603367
38	58.8	1.032	57	45	95	11	0	4.5	0	M	E
39	33.9	1.094	31	27	90	6	1	5.5	0	F	B
40	23.8	1.034	23	24	90	2	0	6.3	0	M	A
41	45.8	1.144	40	25	80	5	0	4.3	0	M	C
42	24.2	1.051	23	32	100	8	1	5.7	1	F	A
43	75.6	1.128	67	42	95	20	1	5.5	0	F	F
44	61.8	1.085	57	45	90	16	0	5.2	1	M	E
45	56.9	1.185	48	36	95	8	1	5.2	1	F	D
46	60.2	1.057	57	39	75	20	0	3.9	1	M	E
47	57.2	1.003	57	37	95	5	0	5.5	1	M	E
48	69.5	1.219	57	34	90	11	1	5.3	1	F	E
49	63	1.105	57	41	95	21	0	6.6	0	M	E
50	59.6	1.046	57	38	80	12	0	4.6	0	M	E

Sheet1

Sal	Compa	G	Mid	Age	EES	SR	G	Raise	Deg	SUMMARY OUTPUT									SUMMARY OUTPUT
24	1.045	1	23	32	90	9	1	5.8	1
24.2	1.053	1	23	30	80	7	1	4.7	1	Regression Statistics									Regression Statistics
23.4	1.018	1	23	41	100	19	1	4.8	1	Multiple R	0.7050179484								Multiple R	0.9931286935
23.4	1.017	1	23	32	90	12	1	6	1	R Square	0.4970503076								R Square	0.9863046018
22.6	0.983	1	23	32	80	8	1	4.9	1	Adjusted R Square	0.4132253589								Adjusted R Square	0.9840220355
22.9	0.995	1	23	36	65	6	1	3.3	0	Standard Error	0.0561252686								Standard Error	2.4352822665
23.1	1.003	1	23	22	95	2	1	6.2	0	Observations	50								Observations	50
23.3	1.011	1	23	29	60	4	1	3.9	1
22.7	0.985	1	23	23	90	4	1	5.3	0	ANOVA									ANOVA
23.5	1.023	1	23	27	75	3	1	4.3	0		df	SS	MS	F	Significance F					df	SS	MS	F	Significance F
23	1.002	1	23	22	95	2	1	6.2	0	Regression	7	0.1307500775	0.0186785825	5.9296225662	0.0000782906				Regression	7	17938.424611863	2562.632087409	432.1033638177	5.29906273684337E-37
24	1.042	1	23	32	100	8	1	5.7	1	Residual	42	0.1323019225	0.0031500458						Residual	42	249.085188137	5.9305997175
35.5	1.145	1	31	30	75	5	1	3.6	1	Total	49	0.263052							Total	49	18187.5098
34.7	1.119	1	31	31	80	11	1	5.6	0
35.5	1.146	1	31	44	70	16	1	4.8	0		Coefficients	Standard Error	t Stat	P-value	Lower 95%	Upper 95%	Lower 95.0%	Upper 95.0%		Coefficients	Standard Error	t Stat	P-value	Lower 95%	Upper 95%	Lower 95.0%	Upper 95.0%
35.2	1.136	1	31	27	90	6	1	5.5	0	Intercept	0.9486238772	0.0817167716	11.6086803119	0	0.7837127557	1.1135349987	0.7837127557	1.1135349987	Intercept	-4.8714544587	3.54570071	-1.3739045839	0.1767599037	-12.0269681853	2.2840592678	-12.0269681853	2.2840592678
40.4	1.01	1	40	32	100	8	1	5.7	1	Mid	0.0034995027	0.0006492568	5.3900133356	0.0000029767	0.0021892495	0.0048097559	0.0021892495	0.0048097559	Mid	1.2284155048	0.0281713308	43.6051641629	1.32019333894083E-36	1.1715634576	1.2852675521	1.1715634576	1.2852675521
42.7	1.068	1	40	30	100	2	1	4.7	0	Age	0.0005527738	0.0014459446	0.3822925256	0.7041721007	-0.0023652605	0.0034708081	-0.0023652605	0.0034708081	Age	0.0368279425	0.0627397124	0.5869957178	0.5603489282	-0.0897859231	0.1634418081	-0.0897859231	0.1634418081
53.4	1.112	1	48	48	65	6	1	3.8	1	EES	-0.0018462553	0.0010252155	-1.8008461371	0.0789105539	-0.0039152239	0.0002227133	-0.0039152239	0.0002227133	EES	-0.0821579785	0.0444842245	-1.8469014451	0.0718147225	-0.171930778	0.007614821	-0.171930778	0.007614821
51.5	1.072	1	48	30	75	9	1	3.8	0	SR	-0.0004182288	0.0018278101	-0.2288141345	0.820123898	-0.004106899	0.0032704414	-0.004106899	0.0032704414	SR	-0.0778484529	0.079308905	-0.9815852701	0.3319249969	-0.2379003029	0.0822033971	-0.2379003029	0.0822033971
49.8	1.037	1	48	36	95	8	1	5.2	1	G	0.0646649961	0.0183396697	3.5259629624	0.001034866	0.0276540443	0.101675948	0.0276540443	0.101675948	G	2.9145083112	0.7957605113	3.6625445343	0.000693549	1.3085985836	4.5204180389	1.3085985836	4.5204180389
68.3	1.198	1	57	27	55	3	1	3	1	Raise	0.0146549564	0.0139088976	1.0536389608	0.2980722322	-0.0134143354	0.0427242483	-0.0134143354	0.0427242483	Raise	0.6763294824	0.6035087689	1.1206622295	0.2687988764	-0.5416005215	1.8942594864	-0.5416005215	1.8942594864
65.4	1.148	1	57	34	90	11	1	5.3	1	Deg	0.0014675988	0.0161098249	0.0910996125	0.9278465471	-0.0310433441	0.0339785418	-0.0310433441	0.0339785418	Deg	0.0345044482	0.6990072742	0.0493620731	0.9608647532	-1.3761493419	1.4451582383	-1.3761493419	1.4451582383
78.4	1.17	1	67	44	95	9	1	4.4	0
75.9	1.133	1	67	42	95	20	1	5.5	0
24	1.044	0	23	32	85	1	0	4.6	1
23.3	1.012	0	23	41	70	4	0	4	0
24.1	1.049	0	23	24	90	2	0	6.3	0
27.5	0.887	0	31	52	80	7	0	3.9	0	t-Test: Two-Sample Assuming Equal Variances
27.1	0.875	0	31	25	95	4	0	5.6	0
27.7	0.895	0	31	26	80	2	0	4.9	1		Variable 1	Variable 2
40.8	1.019	0	40	44	90	4	0	5.7	0	Mean	1.06684	1.04836
43.9	1.097	0	40	35	80	7	0	3.9	1	Variance	0.00430164	0.00648099
41	1.025	0	40	25	80	5	0	4.3	0	Observations	25	25
48.7	1.014	0	48	36	90	16	0	5.7	1	Pooled Variance	0.005391315
49.4	1.029	0	48	45	90	18	0	4.3	0	Hypothesized Mean Difference	0
64.4	1.13	0	57	34	85	8	0	5.7	0	df	48
64.5	1.132	0	57	42	100	16	0	5.5	1	t Stat	0.8898352784
58.9	1.033	0	57	52	95	22	0	4.5	0	P(T<=t) one-tail	0.188996287
57.9	1.016	0	57	35	90	9	0	5.5	1	t Critical one-tail	1.6772241961
59	1.035	0	57	45	95	11	0	4.5	0	P(T<=t) two-tail	0.3779925741
63.3	1.111	0	57	45	90	16	0	5.2	1	t Critical two-tail	2.0106347576
56.8	0.996	0	57	39	75	20	0	3.9	1
58	1.017	0	57	37	95	5	0	5.5	1
62.4	1.094	0	57	41	95	21	0	6.6	0
63.8	1.12	0	57	38	80	12	0	4.6	0
79	1.179	0	67	36	70	12	0	4.5	1
77	1.149	0	67	49	100	10	0	4	1
74.8	1.116	0	67	43	95	13	0	6.3	1
76	1.135	0	67	52	95	5	0	5.4	0