quant A1
Module 1: Introduction to Statistics
.
*
Module 1 Overview
- The Role of Statistics
- The Research Process
- Threats to Validity
- Statistical Terminology
- Scales of Measurement
- Introduction to Descriptive and Inferential Statistics
The Role of Statistics
- The goal of virtually all quantitative research studies is to identify and describe relationships among constructs, or variables.
- Data, or observations, are collected in a very systematic manner, and conclusions are drawn based on the data.
- At a basic level, statistical techniques allow us to aggregate and summarize data in order for researchers to draw conclusions from their study.
*
The Typical Research Process
- The typical quantitative study involves a series of steps, one of which is statistical analysis.
- NOTE: These are steps in the research process and not sections of the dissertation.
*
27.bin
Step 1: Research Questions
- Research questions reflect the problem that the researcher wants to investigate.
- Research questions can be formulated based on theories, past research, previous experience, or the practical need to make data-driven decisions in a work environment.
- Research questions are vitally important because they, in large part, dictate what type of statistical analysis is needed as well as what type of research design may be employed.
*
Examples of Research Questions
- How is financial need related to retention after the freshman year of college?
- What types of advertising campaigns produce the highest rates of inquiries among prospective applicants at NSU?
- How do males and females differ with respect to statistics self-efficacy?
- How does a body image curriculum improve body image in college females?
*
Hypotheses
- While research questions are fairly general, hypotheses are specific predictions about the results, made prior to data collection.
- As financial need increases, the likelihood of retention decreases.
- Personalized letters result in more inquiries than brochures.
- Males have higher levels of self-efficacy than females.
- Body image will improve as a result of the new curriculum.
*
Step 2: Operationalize & Choose Measures
- Many variables of interest in education and psychology are abstract concepts that cannot be directly measured.
- This doesn’t preclude us from studying these things, but requires that we clearly define the specific behaviors that are related to the concept of interest.
*
Measuring Abstract Concepts
- How does one measure retention, inquiry rate, statistics self-efficacy, and body image?
- The process of defining variables and choosing a reliable and accurate measurement tool is called operationalizing your variables.
- Good measurement is vital to the trustworthiness of your results!
*
Step 3: Choose a Research Design
- In Step 3, we develop a plan for collecting the data we need (i.e., a “blueprint” for the study)
- This is called research design, and includes things such as:
- Who will participate in the study?
- Who will receive the intervention?
- Will there be a “control group”?
- Will data be collected longitudinally?
- What instrument will be used to collect data?
- What type of data will be collected?
*
Threats to Validity
- Internal Validity
- Problems associated with the experimental procedures or experiences of participants
- External Validity
- Problems that affect the generalizability of the results
- The choice of design impacts the validity of your final results
Step 4: Analyze The Data
- Once the data have been collected, the results must be organized and summarized so that we can answer the research questions.
- This is the purpose of statistics
- The choice of analysis at this stage depends entirely on two prior steps:
- The research questions
- How the variable is measured
*
Step 5: Draw Conclusions
- After analyzing the data, we can make judgments about our initial research questions and hypotheses.
- Are these results consistent with previous studies?
- The conclusions drawn from a study may provide a starting point for new research.
*
The Role of Statistics
- Despite the anxiety usually associated with statistics, data analysis is a relatively small piece of the larger research process.
- There is a misconception that the trustworthiness of statistics is independent of the research process itself
- This is absolutely incorrect!
- A statistical analysis can in no way compensate for a poorly designed study!!!!
*
Statistical Terminology
Population
- A population is the entire set of individuals that we are interested in studying.
- This is the group that we want to generalize, or apply our results to.
- Although populations can vary in size, they are usually quite large.
- Thus, it is usually not feasible to collect data from the entire population.
*
Sample
- A sample is simply a subset of individuals selected from the population.
- In the best case, the sample will be representative of the population.
- That is, the characteristics of the individuals in the sample will mirror those in the population.
*
Parameters vs. Statistics
- In most studies, we wish to quantify some characteristic of the population.
- Example:
- The retention rate, inquiry rate, average level of self-efficacy, average level of body image
- This is the population parameter
- Parameters are generally unknown, and must be estimated from a sample
- The sample estimate is called a statistic
*
Variables
- A characteristic that takes on different values for different individuals in a sample.
- Examples:
- Retention (yes/no)
- Inquiry about NSU (yes/no)
- Self-efficacy (score on self-efficacy questionnaire)
- Body image (score on body image questionnaire)
- In statistical formulas, variables are usually referred to generically as X or Y
*
Independent Variables (IV)
- The “explanatory” variable
- The variable that attempts to explain or is purported to cause differences in a second variable.
- Some texts refer to the IV as the one manipulated by the investigator.
- Example:
- Does a new curriculum improve body image?
- The curriculum is the IV
*
Dependent Variables (DV)
- The “outcome” variable
- The variable that is thought to be influenced by the independent variable
- Example:
- Does a new curriculum improve body image?
- Body image is the DV
*
Confounding Variables
- Researchers are usually only interested in the relationship between the IV and DV.
- Confounding variables represent unwanted sources of influence on the DV, and are sometimes referred to as “nuisance” variables.
- Example:
- Does a new curriculum improve body image?
- Such things as heredity, family background, previous counseling experiences, etc. can also impact the DV.
*
Controlling Confounding Variables
- Typically, researchers are interested in excluding, or controlling for, the effects of confounding variables.
- This is generally not a statistical issue, but is accomplished by the research design.
- Certain types of designs (e.g., experiments) better control the effects of confounding variables (because they use a control group).
- If an experiment or an equivalent control group is not possible ANCOVA
*
Examples
- Do students prefer learning statistics online or face to face?
- What is the IV? DV?
- Are there differences in the anxiety levels of students who have had statistics before versus students who have never had statistics?
- What is the IV? DV?
Scales of Measurement
Variable Measurement Scales
- For any given variable that we are interested in, there may be a variety of measurement scales that can be used:
- What is your annual income? _________
- What is your annual income?
a. 10,000-20,000 b. 20,000-30,000 c. 30,000-40,000 d. 40,000-50,000 e. 50,000 or above
- Variable measurement is the second factor that influences the choice of statistical procedures (research question is the first).
*
Scales of Measurement
- Nominal
- Ordinal
- Interval
- Ratio
Nominal Scale
- Observations fall into different categories or groups.
- Differences among categories are qualitative, not quantitative.
- Examples:
- Gender
- Ethnicity
- Counseling method (cognitive vs. humanistic)
- Retention (retained vs. not retained)
*
Ordinal Scale
- Categories can be rank ordered in terms of amount or magnitude.
- Categories possess an inherent order, but the amount of difference between categories is unknown.
- Examples:
- Class standing
- Letter grades (A,B,C,D,F)
- Likert-scale survey responses (SD, D, N, A, SA)
*
Interval Scale
- Categories are ordered, but now the intervals for each category are exactly the same size.
- That is, the distance between measurement points represent equal magnitudes (e.g., the distance between point A and B is the same as the distance between B and C).
- Examples:
- Fahrenheit scale of measuring temperature
- Chronological scale of dates (1997 A.D.)
- Standard scores (z-scores)
*
Ratio Scale
- Same properties as the interval scale, but with an additional feature
- Ratio scale has an absolute 0 point.
- Absolute 0 point permits the use of ratios (e.g., A is “twice as large” as B).
- Examples:
- Number of children
- Weight
- Annual income
*
Categorical vs. Continuous Variables
- In practice, it is not usually necessary to make such fine distinctions between measurement scales.
- Two distinctions, categorical and continuous are usually sufficient.
- Categorical variables consist of separate, indivisible categories (i.e., men/women).
- Continuous variables yield values that fall on a numeric continuum, and can (theoretically) take on an infinite number of values.
*
Level of Measurement Summary
- In practice, the four levels of measurement can usually be classified as follows:
- Continuous variables are generally preferable because a wider range of statistical procedures can be applied
*
28.bin
Examples
- What is the level of measurement of
- Temperature OC?
- Color?
- Income of professional baseball players?
- Degree of agree (1 = Strongly Disagree,
5 = Strongly Agree)?
Descriptive Statistics
- Procedures used to summarize, organize, and simplify data (data being a collection of measurements or observations) taken from a sample (i.e., mean, median, mode).
- Examples:
- The average score on the Rosenberg Self-Esteem Scale was 7.5
- 63% of the sample described themselves as Caucasian
*
Inferential Statistics
- Techniques that allow us to make inferences about a population based on data that we gather from a sample.
- Study results will vary from sample to sample strictly due to random chance (i.e., sampling error).
- Inferential statistics allow us to determine how likely it is to obtain a set of results from a single sample.
- This is also known as testing for “statistical significance.”
*
Module 1 Summary
- The Role of Statistics
- Statistical Terminology
- Scales of Measurement
- Introduction to Descriptive and Inferential Statistics
Step 1:
Research
Questions &
Hypotheses
Step 2:
Operationalize
& Choose
Measures
Step 3:
Choose a
Research
Design
Step 4:
Analyze
Data
Step 5:
Draw
Conclusions
The Research Process
Nominal
Ordinal
Categorical Variables
Interval
Ratio
Continuous Variables