cj research methods paper2

profileismails95
Chapter6-Sampling.ppt

Sampling

Chapter 6

*

Introduction

Sampling is the process of selecting observations

Often not possible to collect information from all units you wish to study

Often not necessary to collect data from everyone out there

Allows researcher to make a small subset of observations and then generalize to the rest of the population

The Logic of Probability Sampling

Samples: a group of subjects selected from a population

Probability sampling: a method of selection in which each member of a population has a known chance of being selected

Enables us to generalize findings from observing cases to a larger unobserved population

Because we are not completely homogeneous, our sample must be representative of the variations that exist among us

Conscious and Unconscious Sampling Bias

Be conscious of bias – when sample is not fully representative of the larger population from which it was selected

Sampling bias is not always obvious

Use techniques to help avoid bias

Representativeness and Probability of Selection

  • A sample is representative of the population from which it is selected if the aggregate characteristics of the sample closely approximate the same aggregate characteristics in the population
  • Samples that are representative of the population are often labeled equal probability of section method (EPSEM) samples because all members of the population have an equal chance of being included in the sample

Sampling Terminology 1

Sample Element: who or what are we studying (student)

Population: whole group (college freshmen)

Population Parameter: summary description of a given variable in a population

Sample Statistic: summary description of a given variable in a sample; we use sample statistics to make estimates or inferences of population parameters

Sampling Terminology 2

  • Sampling distribution: a range of sample statistics we obtain if we select many samples from a population
  • Sampling frame: actual list of units to be selected (our school’s enrollment list)
  • Binomial variable: a variable with only two values

Sampling Terminology 3

Standard error: a measure of sampling error; we can estimate the degree to be expected

Confidence Levels and Confidence Intervals

Two key components of sampling error

We express the accuracy of our sample statistics in terms of a level of confidence that the statistics fall within a specified interval from the parameter

Sampling Designs 1

Simple Random Sampling: each element in a sampling frame is assigned a number, choices are then made through random number generation as to which elements will be included in your sample

Systematic Sampling: elements in the total list are chosen (systematically) for inclusion in the sample

List of 10,000 elements, we want a sample of 1,000, select every tenth element

Choose first element randomly

Sampling Designs 2

Stratification: modification to random and systematic sampling; ensures that appropriate numbers are drawn from homogeneous subsets of that population

Disproportionate stratified sampling: way of obtaining sufficient number of rare cases by selecting a disproportionate number

Cluster sampling: compile a stratified group (cluster), sample it, then subsample that set

This process can go on for many cluster levels

National Crime Victimization Survey

Seeks to represent the nationwide population of persons 12+ living in households (≈ 42K units, 74K occupants in 2004)

First defined are primary sampling units (PSUs)

Largest are automatically included, smaller ones are stratified by size, population density, reported crimes, and other variables into about 150 strata

Census enumeration districts are selected (CED)

Clusters of 4 housing units from each CED are selected

British Crime Survey

First stage – 289 Parliamentary constituencies, stratified by geographic area and population density

Two sample points were selected, which were divided into four segments with equal #’s of delivery addresses

One of these four segments was selected at random, then disproportionate sampling was conducted to obtain a greater number of inner-city respondents

Household residents aged 16+ were listed, and one was randomly selected by interviewers (n=37,213 in 2004)

Nonprobability Sampling

Nonprobability Sampling: the likelihood any element will be include in the sample is unknown

Purposive sampling: selecting a sample on the basis of your judgment and the purpose of the study

Quota sampling: units are selected so that total sample has the same distribution of characteristics as are assumed to exist in the population being studied

Snowball sampling: You interview some individuals, and then ask them to identify others who will participate in the study, who ask others…etc., etc.

Example: Snowball Sampling

To study cannabis users, Hammersley and Leon (2006) gathered a snowball sample of 176 University students who had used marijuana at least once. Extensive interviews were then conducted with the University students in the sample. Their results showed that there were two types of users—those who used cannabis on a regular basis and those who used cannabis on occasion. The results also showed that users experienced both positive and negative effects from using marijuana and the patterns of use were more similar to patterns of alcohol and tobacco use than to patterns of controlled substance use.

Hammersley, R. & Leon, V. (2006). Patterns of cannabis use and positive and negative experiences of use amongst university students. Addiction Research and Theory, 14(2), 189-205.