Statistics HW

profilenehneh1
StatHW.docx

Question 1

Suppose that we have the following data on two paired samples.  Are differences statistically significant?  What is your t-value?  

T_1

T_2

92

100

102

104

80

86

96

96

92

94

90

90

84

88

102

98

98

102

86

88

Yes, 2.25

No, 3.25

Yes, 3.25

No, 1.25

Question 2

What does a multiple linear regression analysis examine?

The relationship between more than one dependent and only one independent variable  

The relationship between one or more than one dependent and only one independent variable      

The relationship between one dependent and more than one independent variables 

The relationship between more than one independent variables.

Question 3

What is the correlation for variables X and Y?

X

Y

23

4

10

3

16

5

18

7

14

6

31

12

19

6

0.80

0.79

0.91

0.5

Question 4

What does a multiple linear regression analysis examine?

The relationship between more than one dependent and only one independent variable  

The relationship between one or more than one dependent and only one independent variable      

The relationship between one dependent and more than one independent variables 

The relationship between more than one independent variables.

Question 5

What is the fundamental difference between quantitative data and qualitative data?

Quantitative data are comprised of numbers known in advance, whereas qualitative data do not contain numbers.

Qualitative data are harder to obtain than quantitative data.

Quantitative data contain variables for which the values are known in advance; however, this is not necessarily the case with qualitative data.

Qualitative data are comprised of information that must be coded by researchers, whereas this is not the case with quantitative data.

Question 6

Which of the following criteria cannot be used to determine the number of factors in an EFA?

Asking a group of researchers before the analysis

Eigenvalue rule

Scree test

Parallel analysis

Question 7

If you estimate a regression model comprised of 3 nominal variables with 2, 3, and 4 categories, respectively, how many "categorical" variables should be included in your estimation equation?

3

5

7

6

Question 8

What are some tools that we can use to determine how many segments should be retained after conducting a cluster analysis?

Little omega

The dendrogram

The variance ratio criterion (VRC)

All options provided are valid.

Question 9

Use the results from the factor analysis (pfc) below to determine the percentage of variance explained by the factor solution given the analyst retains all factors with eigenvalues greater than one.

50.29

100

86.73

97.81

Question 10

Use the data below to solve for the effect of education (Educ) on wages (Wage) holding all else constant.

Wage Educ Exper

5.1 8 21

4.95 9 42

6.67 12 1

4 12 4

7.5 12 17

13.07 13 9

4.45 10 27

19.47 12 9

13.28 16 11

8.75 12 9

11.35 12 17

11.5 12 19

6.5 8 27

6.25 9 30

19.98 9 29

7.3 12 37

8 7 44

22.2 12 26

3.65 11 16

20.55 12 33

The effect is statistically insignificant at the 10 percent level of significance.

0.91

1.48

0.95

Question 11

What makes the interpretation of conditional effects extra challenging in logistic regression?

It is not possible to model interaction effects in logistic regression

The results has to be raised by its natural logarithm

The conditional effect is dependent on the values of all X-variables

The maximum likelihood estimation makes the results unstable

Question 12

Suppose you run the follow factor analysis (pcf).  How many factors should be retained in the model.

1

2

3

4

Question 13

What does the least squares method do exactly?

Minimizes the distance between the data points        

Finds the least problematic regression line

Finds those (best) values of the intercept and slope that provide us with the smallest value of the residual sum of squares  

Finds those (best) values of the intercept and slope that provide us with the smallest value of the sum of residuals

Question 14

How do we interpret a dummy variable coefficient?

The difference between two means

the difference between two coefficients

The difference between two R-squared values

None of the provided options are valid

Question 15

What is the main difference between primary and secondary data?

Secondary data are data that have already been gathered.

Primary data are not stored in institutional databased; however, this is not the case with primary data.

Primary data are data that have already been gathered; whereas secondary data are gathered for a specific research project or task.

Secondary data are data that have already been gathered; whereas primary data are gathered for a specific research project or task.

Question 16

What is the standard deviation of a population comprised of the following values: 23, 10, 16, 18, 14, 31, and 19?

6.77

6.99

6.27

7

Question 17

What is the most severe type of missing data problem?

When data are missing completely at random.

If a data point is unrelated to the value of the variable under analysis, but depends on another variable.

When the probability that a data point is missing depends on the variable under analysis.

Missing data comprised of outliers do not pose a problem.

Question 18

Age

Subscribe

20

0

23

0

24

0

25

0

25

1

26

0

26

0

28

0

28

0

29

0

30

0

30

0

30

0

30

0

30

0

30

1

32

0

32

0

33

0

33

0

34

0

34

0

34

1

34

0

34

0

35

0

35

0

36

0

36

1

36

0

37

0

37

1

37

1

38

0

39

0

40

1

45

0

48

1

50

0

53

1

55

1

0.26

.023

0.128

0.021

Question 19

What are some procedures that we can use to validate a cluster solution?

We can assess the "stability" of a cluster solution. 

We can assess the degree to which a cluster solution results in differentiated segments.

We can use observable variables to "profile" a cluster solution.

All are valid techniques that can be used to validate a cluster solution.

Question 20

 Why is the number of dummy variables to be entered into the regression model always equal to the number of groups (g) minus 1 (g-1)?

To avoid model misspecification

To increase the R-squared value

To avoid the situation of perfect multicollinearity

The control for other variables in the model.

Question 21

Given the data below, what is the Euclidean distance between points E and F (remember, you can use the hypotenuse of a right triangle formed by points E and F in two-dimensional space)?

customer

price

loyalty

a

3

7

b

6

7

c

5

6

d

3

5

e

6

5

f

4

3

g

1

2

1.414

3.606

7.07

2.828

Question 22

For a given level of statistical significance, increasing the sample size will do what to the power of a statistical test.

decrease

the power of a statistical test will not change

increase 

It depends on the predetermined level of statistical significance.

Question 23

Suppose you run a factor analysis (pcf) to obtain the following variable loadings.  Are the results sufficient?  What should the analyst do next?

Yes, interpret the factors

No, implement an oblique factor rotation

No, estimate a new model

Yes, solve for communality as 1 - uniqueness 

Question 24

What will a factor loading in an orthogonal solution represent?  

partial correlation

correlation

multiple correlation

Eigenvalue

Question 25

What is the primary difference between a quasi-experiment and an experiment?

Experiments are comprised of randomly generated samples.

It is not possible to perform between-group analyses with quasi-experimental data.

Experiments are rare in the social sciences.

When conducting an experiment, researchers randomly assign units of analysis to treatment and control groups.

Question 26

The Logit models is estimated by way of?

Ordinary least squares

Poisson distribution

Negative binomial distribution

Maximum likelihood estimation

Question 27

Use the following sales data to determine whether mean sales varies from one generation to the next.  Are the differences statistically significant?  What is your calculated F-value?

Gen1

Gen2

Gen3

55

45

50

55

50

52

49

45

43

57

46

48

55

42

47

49

43

45

48

42

44

54

45

49

54

47

51

44

42

44

No, 12

No, 14

No, 11.92

Yes, 11.92