MIS major

mqalwabir2g

questions.docx

Home >Information Systems homework help >MIS major

Highlighted:

Correct

Was wrong

Question 1

According to the instruction in class, which of the following is true:

· The correlation coefficient between two variables DOES vary greatly depending on their underlying UNITS of measurement (ex: using grams vs kilograms scales changes the correlation between weight and height, etc)

· When determining the correlation coefficient r as part of a regression analysis it DOES matters which variable is considered to be the dependent variable and which the independent

· both of the above are true

· None of the above are true

Question 2

The r (correlation) values range from-----.

· -1 to 1

· 0 to 1

· 0 to 100

· 0 to infinity

Question 3

The r-square values range------.

· -1 to 1

· 0 to 1

· 0 to 100

· 0 to infinity

Question 4

Assume you ran a bivariate regression model (remember bivariate means there is only one independent variable) in which # of cans of tuna fish consumed per week was the independent variable (IV) and level of hair silkiness was the dependent variable (DV); the hypothesis being eating more tuna is associated with more silky hair (because, logically, sea lions eat more fish and they're more silky feeling so why not humans, too).

The model resulted in a correlation r =.6, beta coefficient = 2.14, and a P-value =.004.

If you switched the independent and dependent variables and ran a new bivariate regression analysis (so cans of tuna = DV and hair silkiness = IV), which of the following would substantially change?

· The correlation “r” value

· The regression coefficient

· The P-value

· All of the above would substantially change

· None of the above would substantially change

Question 5

A correlation of r = 0 indicates that

· X and Y don’t have a linear relationship

· X and Y are unrelated

· X and Y a linear relationship

· None of the above

Question 6

According to the professor, if we find the P-value associated with an unstandardized regression coefficient = .05, then the associated 95% confidence interval:

· is the range of the effect across 95% of the population

· indicates we are 95% confident that the coefficient is accurate

· both of the above are true

· None of the above are true

Question 7

According to the instruction on statistical significance in class, the confidence level is the probability that a confidence interval will include the population parameter.

· True

· False

Question 8

Which of the following is true?

· If a difference in measurement is statistically significant, then it is also practically significant

· If a difference in measurement is practically significant, then it is also statistically

· Significant

· both of the above are true

· None of the above are true

Question 9

According to the discussion on sample size and statistical significance (hint: think of the excel sample size calculator), when calculating sample size

· You need to provide the estimated margin of error

· You need to provide the Z score (or the corresponding P- Value)

· both of the above are true

· None of the above are true

Question 10

According to the discussion on sample size and statistical significance (hint: think of the excel sample size calculator), when the level of variance increases, the sample size needed to maintain a particular level of statistical significance

· Increase

· Decrease

· Is not affected (can main the same)

· Can’t be determined

Question 11

As described in class, when it comes to parallel computation of regression analysis on very large datasets (in which the data is spread over many machines, analysis is run on each machine, and then averages are taken across all of the analyses), the resulting averages are good approximations for which of the following computations (assuming no extra data manipulation):

· regression coefficients

· 95% confidence intervals

· both of the above

· none of the above

Question 12

As described in class, when it comes to parallel computation of regression analysis on

very large datasets (in which the data is spread over many machines, analysis is run on

each machine, and then averages are taken across all of the analyses), the resulting P-value scores (assuming no extra data manipulation) are --------- what they would be if the analysis of the dataset was performed on a single supercomputer

· good approximations of

· larger than

· smaller than

Question 13

One of the advantages of very large datasets (big data) is that we no longer have to worry about multicollinearity among independent variables in regression analysis

· True

· Fales

Question 14

Multicollinearity is problematic in a typical sized dataset (ex: couple hundred observations and several variables) if high correlation (above .85 or .90) exists between:

· Two or more independent variables used in the multivariate regression model

· Any of the independent variables and the dependent variable used in the multivariate regression model

· both of the above

· none of the above

Question 15

Factor analysis can be used in which of the following?

· To identify underlying dimensions, or factors, that explain the correlations among a set of variables

· To identify a new smaller set of uncorrelated variables to replace the original set of correlated variables in subsequent multivariate analysis.

· To identify a smaller set of salient variables from a larger set for use in subsequent analysis.

· All of the above are correct circumstances.

Question 16

According to the professor, including all of the criterion variables related to the managerial decision under question (aka including every variable we are interested in that is in the database) in cluster formulation will IMPROVE the insight of cluster analysis.

· True

· False

Question 17

Which of the following is NOT one of the 3 requirements of market segmentation?

· Identifiable

· Reachable

· Sizeable

· Loyal

Question 18

The three benefits of market segmentation include

· Identifies opportunities for new product development

· Improves the strategic allocation of marketing resources

· both of the above

· none of the above

Question 19

Consumer Market Major Segment Bases include all of the following EXCEPT:

· Demographic

· Psychographic

· Teleographic

· Behavioral

Question 20

In the usage based approach to market segmentation, every company at least _______segments:

· One

· Two

· Three

· Four

· Five

· Eight

Question 21

Cluster analysis does not classify variables as dependent or independent

· True

· False

Question 22

When it comes to linking methods in cluster analysis, which of these statements are true?

· each of the linkage methods can yield different results when used on the same dataset

· each linking method has its specific properties

· both of the above are true

· none of the above are true

Question 23

Which statement is NOT true about cluster analysis?

· Cluster analysis is a technique for analyzing data when the criterion or dependent variable is categorical and the independent variables are interval in nature.

· Cluster analysis is also called classification analysis or numerical taxonomy.

· Groups or clusters are suggested by the data, not defined a priori.

· Objects in each cluster tend to be similar to each other and dissimilar to objects in the other clusters.

Question 24

When it comes to K-Means clustering, which of the following is true?

· we do not need to tell it how many clusters there are to begin with

· we can assess the best configuration by examining the dissimiliarly coefficient in the K-means generated agglomeration schedule

· both of the above are true

· none of the above are true

Question 25

When it comes to K-Means clustering, which of the following is true?

· we can save the cluster membership

· using different combinations of variables can result in different cluster assignments for each observation

· both of the above are true

· none of the above are true

Question 26

When it comes to very large datasets (i.e., a form of big data) and cluster analysis, it was recommended in class to:

· just use hierarchical cluster analysis

· just use k-means cluster analysis

· use both hierarchical and k-means cluster analysis together

· never use cluster analysis

Question 27

Clustering can be performed using:

· Observable (directly measured) variables such as dollars spent on different products, etc

· Unobservable (inferred) variables measured on surveys such as attitudes and moods etc. (ex: happiness, sadness, etc. liked scales)

· both of the above

· none of the above

Question 28

If you switch the dependent and independent variables in a regression analysis, which of the following will usually change:

· Regression Coefficient Value

· P-value

· both of the above change

· none of the above change

Question 29

If you switch the dependent and independent variables in a regression analysis, which of the following will usually change:

· Regression Coefficient standard error

· Correlation (r) score

· both of the above change

· none of the above change

Question 30

The 3 types of linkage in cluster analysis include:

· single linkage

· complete linkage

· both of the above true

· none of the above true

Question 31

If we have several hundred independent variables that might be related, which technique do we use to reduce the number of variables?

· Factor Analysis

· Cluster Analysis

· Compare Means Analysis

· Numerical Taxonomy

· Classification Analysis

Question 32

When running Factor Analysis, which of the following is recommended by the professor:

· We do not rotate the data

· We select a Promax rotation

· I We select a Varimax rotation

· We select a Quadmax rotation

· We select a Expert rotation so SPSS will decide which rotation is best

Question 33

When running Factor Analysis, we select to suppress absolute value smaller than

· .01

· .05

· .1

· .4

· .95

For questions 34 to 36, assume that you ran a regression to predict which

students are most likely to agree to dress up as a circus clown for $5 at a charity baseball game at the local elementary school. The results are:

Dependent Variable: Likelihood of Dressing Up as a Clown (Very Unlikely 1, 2, 3, 4, 5, 6, 7 Very Likely)

	Coefficient	Standard Error of Coefficient
Intercept	2.1	.12
Blue hair color (vs all others)	1.3	.23
Age (in years)	.05	.01
Income (in $1000s)	-.5	.04
Rides a Moped to School	2.3	.07
Likes Watching Dr. Who	1.4	.89

Question 34

Is someone with red hair more or less likely to dress up as a clown than someone with blue hair?

· More likely

· Less likely

· Equal likelihood

· It is not possible to determine from the provided data

Question 35

Referring back to the regression coefficients from question 34....

Is the coefficient for “Age” statistically significant at a 95% confidence interval?

· Yes

· No

· It can’t be determined from the provided information

Question 36

Referring back to the regression coefficients from question 34....

Is the coefficient for 'likes watching Dr. Who' statistically significant at a 95%

confidence interval?

· Yes

· No

· It can’t be determined from the provided information

Question 37

Referring back to the regression coefficients from question 34....

Who of the following is most likely to dress up as a clown at the charity?

· ride a moped, has red hair, 20 years old

· rides a bicycle, has blue hair, 20 years old

· like watching dr who, has blue hair, 20 years old

Question 38

When it comes to the exponential smoothing model in forecasting, which following are true:

· The model has received widespread acceptance among American business firms that employ sales forecasts for managerial planning and control

· It uses special weighted moving averages and seasonal factor that is multiplied by the weighted moving average to calculate the forecast.

· both of the above are true

· none of the above are true

Question 39

Generally, the exponential smoothing model uses _________ smoothed statistics that are weighted.

· Zero

· One

· Two

· Three

· Four

· Five

Question 40

One of the most widely used techniques for short-term forecasting that is autoregressive integrated moving average (ARIMA) model associated with G. E. P. Box and G. M. Jenkins

· True

· False

Question 41

Which of the following is true regarding the Arima forecasting technique?

· It is somewhat mathematically tedious and complex

· It relies of using past sales data exclusively

· Both of the above are true

· None of the above are true

Question 42

In excel, the forecast sheet button produced a graph and a set of forecast estimates. According to the professor, this techniques uses which forecasting approach?

· simple moving average

· exponential smoothing

· ARIMA modeling

· ARMA modeling

Question 43

In excel, if you have five columns each with a different year of data (think 2015, 2016, 2017, 2018, 2019) the spreadsheet has to be sorted so the oldest/smallest columns (ex:2015) always are to the left and newest/largest values always to the right (ex: 2019) or the forecast can’t ever calculate the next predicted value correctly.

· True

· False