quant A1

Queenstar

Chat1IntroductiontoStatistics.ppt

Home >Mathematics homework help >Statistics homework help >quant A1

Module 1: Introduction to Statistics

Module 1 Overview

The Role of Statistics
The Research Process
Threats to Validity
Statistical Terminology
Scales of Measurement
Introduction to Descriptive and Inferential Statistics

The Role of Statistics

The goal of virtually all quantitative research studies is to identify and describe relationships among constructs, or variables.
Data, or observations, are collected in a very systematic manner, and conclusions are drawn based on the data.
At a basic level, statistical techniques allow us to aggregate and summarize data in order for researchers to draw conclusions from their study.

The Typical Research Process

The typical quantitative study involves a series of steps, one of which is statistical analysis.
NOTE: These are steps in the research process and not sections of the dissertation.

27.bin

Step 1: Research Questions

Research questions reflect the problem that the researcher wants to investigate.
Research questions can be formulated based on theories, past research, previous experience, or the practical need to make data-driven decisions in a work environment.
Research questions are vitally important because they, in large part, dictate what type of statistical analysis is needed as well as what type of research design may be employed.

Examples of Research Questions

How is financial need related to retention after the freshman year of college?
What types of advertising campaigns produce the highest rates of inquiries among prospective applicants at NSU?
How do males and females differ with respect to statistics self-efficacy?
How does a body image curriculum improve body image in college females?

Hypotheses

While research questions are fairly general, hypotheses are specific predictions about the results, made prior to data collection.
As financial need increases, the likelihood of retention decreases.
Personalized letters result in more inquiries than brochures.
Males have higher levels of self-efficacy than females.
Body image will improve as a result of the new curriculum.

Step 2: Operationalize & Choose Measures

Many variables of interest in education and psychology are abstract concepts that cannot be directly measured.
This doesn’t preclude us from studying these things, but requires that we clearly define the specific behaviors that are related to the concept of interest.

Measuring Abstract Concepts

How does one measure retention, inquiry rate, statistics self-efficacy, and body image?
The process of defining variables and choosing a reliable and accurate measurement tool is called operationalizing your variables.
Good measurement is vital to the trustworthiness of your results!

Step 3: Choose a Research Design

In Step 3, we develop a plan for collecting the data we need (i.e., a “blueprint” for the study)
This is called research design, and includes things such as:
Who will participate in the study?
Who will receive the intervention?
Will there be a “control group”?
Will data be collected longitudinally?
What instrument will be used to collect data?
What type of data will be collected?

Threats to Validity

Internal Validity
Problems associated with the experimental procedures or experiences of participants
External Validity
Problems that affect the generalizability of the results

The choice of design impacts the validity of your final results

Step 4: Analyze The Data

Once the data have been collected, the results must be organized and summarized so that we can answer the research questions.
This is the purpose of statistics
The choice of analysis at this stage depends entirely on two prior steps:
The research questions
How the variable is measured

Step 5: Draw Conclusions

After analyzing the data, we can make judgments about our initial research questions and hypotheses.
Are these results consistent with previous studies?
The conclusions drawn from a study may provide a starting point for new research.

The Role of Statistics

Despite the anxiety usually associated with statistics, data analysis is a relatively small piece of the larger research process.
There is a misconception that the trustworthiness of statistics is independent of the research process itself
This is absolutely incorrect!
A statistical analysis can in no way compensate for a poorly designed study!!!!

Statistical Terminology

Population

A population is the entire set of individuals that we are interested in studying.
This is the group that we want to generalize, or apply our results to.
Although populations can vary in size, they are usually quite large.
Thus, it is usually not feasible to collect data from the entire population.

Sample

A sample is simply a subset of individuals selected from the population.
In the best case, the sample will be representative of the population.
That is, the characteristics of the individuals in the sample will mirror those in the population.

Parameters vs. Statistics

In most studies, we wish to quantify some characteristic of the population.
Example:
The retention rate, inquiry rate, average level of self-efficacy, average level of body image
This is the population parameter
Parameters are generally unknown, and must be estimated from a sample
The sample estimate is called a statistic

Variables

A characteristic that takes on different values for different individuals in a sample.
Examples:
Retention (yes/no)
Inquiry about NSU (yes/no)
Self-efficacy (score on self-efficacy questionnaire)
Body image (score on body image questionnaire)
In statistical formulas, variables are usually referred to generically as X or Y

Independent Variables (IV)

The “explanatory” variable
The variable that attempts to explain or is purported to cause differences in a second variable.
Some texts refer to the IV as the one manipulated by the investigator.
Example:
Does a new curriculum improve body image?
The curriculum is the IV

Dependent Variables (DV)

The “outcome” variable
The variable that is thought to be influenced by the independent variable
Example:
Does a new curriculum improve body image?
Body image is the DV

Confounding Variables

Researchers are usually only interested in the relationship between the IV and DV.
Confounding variables represent unwanted sources of influence on the DV, and are sometimes referred to as “nuisance” variables.
Example:
Does a new curriculum improve body image?
Such things as heredity, family background, previous counseling experiences, etc. can also impact the DV.

Controlling Confounding Variables

Typically, researchers are interested in excluding, or controlling for, the effects of confounding variables.
This is generally not a statistical issue, but is accomplished by the research design.
Certain types of designs (e.g., experiments) better control the effects of confounding variables (because they use a control group).
If an experiment or an equivalent control group is not possible ANCOVA

Examples

Do students prefer learning statistics online or face to face?
What is the IV? DV?

Are there differences in the anxiety levels of students who have had statistics before versus students who have never had statistics?
What is the IV? DV?

Scales of Measurement

Variable Measurement Scales

For any given variable that we are interested in, there may be a variety of measurement scales that can be used:
What is your annual income? _________
What is your annual income?

a. 10,000-20,000 b. 20,000-30,000 c. 30,000-40,000 d. 40,000-50,000 e. 50,000 or above

Variable measurement is the second factor that influences the choice of statistical procedures (research question is the first).

Scales of Measurement

Nominal
Ordinal
Interval
Ratio

Nominal Scale

Observations fall into different categories or groups.
Differences among categories are qualitative, not quantitative.
Examples:
Gender
Ethnicity
Counseling method (cognitive vs. humanistic)
Retention (retained vs. not retained)

Ordinal Scale

Categories can be rank ordered in terms of amount or magnitude.
Categories possess an inherent order, but the amount of difference between categories is unknown.
Examples:
Class standing
Letter grades (A,B,C,D,F)
Likert-scale survey responses (SD, D, N, A, SA)

Interval Scale

Categories are ordered, but now the intervals for each category are exactly the same size.
That is, the distance between measurement points represent equal magnitudes (e.g., the distance between point A and B is the same as the distance between B and C).
Examples:
Fahrenheit scale of measuring temperature
Chronological scale of dates (1997 A.D.)
Standard scores (z-scores)

Ratio Scale

Same properties as the interval scale, but with an additional feature
Ratio scale has an absolute 0 point.
Absolute 0 point permits the use of ratios (e.g., A is “twice as large” as B).
Examples:
Number of children
Weight
Annual income

Categorical vs. Continuous Variables

In practice, it is not usually necessary to make such fine distinctions between measurement scales.
Two distinctions, categorical and continuous are usually sufficient.
Categorical variables consist of separate, indivisible categories (i.e., men/women).
Continuous variables yield values that fall on a numeric continuum, and can (theoretically) take on an infinite number of values.

Level of Measurement Summary

In practice, the four levels of measurement can usually be classified as follows:

Continuous variables are generally preferable because a wider range of statistical procedures can be applied

28.bin

Examples

What is the level of measurement of
Temperature OC?
Color?
Income of professional baseball players?
Degree of agree (1 = Strongly Disagree,
5 = Strongly Agree)?

Descriptive Statistics

Procedures used to summarize, organize, and simplify data (data being a collection of measurements or observations) taken from a sample (i.e., mean, median, mode).
Examples:
The average score on the Rosenberg Self-Esteem Scale was 7.5
63% of the sample described themselves as Caucasian

Inferential Statistics

Techniques that allow us to make inferences about a population based on data that we gather from a sample.
Study results will vary from sample to sample strictly due to random chance (i.e., sampling error).
Inferential statistics allow us to determine how likely it is to obtain a set of results from a single sample.
This is also known as testing for “statistical significance.”

Module 1 Summary

The Role of Statistics
Statistical Terminology
Scales of Measurement
Introduction to Descriptive and Inferential Statistics

Step 1:

Research

Questions &

Hypotheses

Step 2:

Operationalize

& Choose

Measures

Step 3:

Choose a

Research

Design

Step 4:

Analyze

Data

Step 5:

Draw

Conclusions

The Research Process

Nominal

Ordinal

Categorical Variables

Interval

Ratio

Continuous Variables