Research methods- Discussion Post- QUALITATIVE/QUANTITATIVE

profileRose2015
social-research-a-practical-introduction.pdf

: Experimental Research – in the Laboratory

and Beyond

In: Social Research: A Practical Introduction

By: Bruce Curtis & Cate Curtis

Pub. Date: 2017

Access Date: July 29, 2020

Publishing Company: SAGE Publications, Inc.

City: 55 City Road

Print ISBN: 9781847874757

Online ISBN: 9781526435415

DOI: https://dx.doi.org/10.4135/9781526435415

Print pages: 146-169

© 2011 SAGE Publications, Inc. All Rights Reserved.

This PDF has been generated from SAGE Research Methods. Please note that the pagination of the

online version will vary from the pagination of the print book.

: Experimental Research – in the Laboratory and Beyond

Key Words

experimental research

experiment

Lost Letter

Milgram

obedience

behavioural study

ethics

Summary

An outline of experimental research in psychology, and the social sciences generally, as an

approach with which causal relationships can be effectively measured, including: (i) a brief

history of experimental research;(ii) the key themes; and (iii) the use of a classic example

to provide an overview of the key concepts relevant to experimental research.

Experimental research has been a core method in psychology, including social psychology. In this chapter,

we argue that while other approaches have become increasingly popular among social psychologists,

experimental research has much to offer all social scientists. In particular, it is the only approach with which

cause-and-effect relationships can be confidently measured. We'll begin with a brief history of the approach

before working through our key themes, making use of a well-known classic experiment to provide examples

of key concepts.

Doing Data Collection and Analysis

In the social sciences, experimental research has been largely confined to the discipline of psychology. When

other social scientists do conduct experiments, the basis is often derived from psychology. For example, the

‘Lost Letter Technique’ (Milgram et al., 1965), which has been widely replicated, including by sociologists

(e.g., Ahmed, 2010; Burwell, 1987; Tykocinski and Bareket-Bojmel, 2009), is based on the work of the

extremely influential psychologist, Stanley Milgram.

SAGE

2011 SAGE Publications, Ltd. All Rights Reserved.

SAGE Research Methods

Page 2 of 25 : Experimental Research – in the Laboratory and Beyond

The Lost Letter Technique

This experiment involves leaving stamped and addressed envelopes in public spaces and

tallying how many are picked up and posted. The purpose is to examine prejudice and

altruism. In Milgram, Mann and Harter's original experiment (1965), the envelopes were

addressed to four bogus recipients: either Medical Research Associates, Friends of the

Communist Party, Friends of the Nazi Party, or an individual, Mr Walter Carnap. A total of

400 envelopes were dispersed across various locations (100 for each of the ‘recipients’).

As expected, frequency of return of the envelopes was related to social desirability of the

recipient (Medical Research Associates: 72%; Mr Carnap: 71%; 25% each for Friends of

the Communist Party and Friends of the Nazi Party). That is, assumptions made about the

supposed recipients of the letters ‘caused’ unwitting participants to decide either to post

the letter, or not.

Variations conducted by other researchers have included envelopes or misdirected email

messages to individuals who appear to belong to a particular ethnic group (for example,

Bushman and Bonacci, 2004; Tykocinski and Bareket-Bojmel, 2009) or religious group

(Kremer et al., 1986), and in another case, students and non-students (Burwell, 1987). In

the latter case, additional variables included male and female names, different types of

envelope, and type of location.

Note that the Lost Letter Technique is an example of an experiment being conducted in

the field, rather than in a laboratory.

So why are experiments more popular among psychologists?

• Psychology initially developed from the ‘hard sciences’ – chemistry, biology and so on – in which the

experiment is by far the preferred, if not the only, approach to empirical research.

• Psychology has traditionally taken the individual as the unit of analysis: experiments are relatively

easy to conduct on individuals. In contrast, many of the social science disciplines take groups or

societies as the unit of analysis, and it is far more difficult (though not impossible) to involve a large

group in a robust experiment.

• Other disciplines have other well-established research approaches, such as unobtrusive research

(see Chapter 8), secondary analysis (see Chapter 10), or (supposedly) more naturalistic approaches

such as interviews and focus groups (see Chapters 2 and 5, respectively). It is worth noting that

some of these social scientists – and including some social psychologists – have been highly critical

of experimental research, as will be discussed in more detail later in this chapter.

SAGE

2011 SAGE Publications, Ltd. All Rights Reserved.

SAGE Research Methods

Page 3 of 25 : Experimental Research – in the Laboratory and Beyond

Practice Point 1

Experimental research dominates psychology, but also has potential for use across the

social sciences. It avoids some of the pitfalls of other forms of primary research through

allowing the researcher a high degree of control, lessens reliance on the perceptions

and honest disclosure of participants and allows conclusions about cause-and-effect

relationships to be drawn.

Then why include experimental research in a book aimed at social science students in general, if most social

science disciplines have tended not to use experiments? Because the experiment:

• can be, and has been, used to good effect across the social sciences, despite some limitations

• is arguably the best way of determining cause-and-effect relationships

• may avoid some of the pitfalls of other research approaches, such as participants saying what they

think the researcher wants to hear, or the researcher misinterpreting social actions.

Practice Point 2

An experiment involves the manipulation and measurement of a variable (e.g., obeying

orders, completing a task, demonstrating empathy). It is a variable-centric approach.

Origins of the Approach

The first social psychological experiment is widely attributed to Norman Triplett (1898) – at least in texts

published in the English-speaking world. Triplett appears to have been interested in cycle races, and this was

the impetus for this early experiment. He had observed that cyclists go faster when being paced or riding with

others, than when riding alone, even when racing against the clock alone. He hypothesised that the presence

of others increases competitiveness and nervous energy. To test this hypothesis – the phenomenon which

social psychologists now refer to as ‘social facilitation’ – Triplett had children wind in fishing reels as quickly

as possible, either in pairs, or alone. Each child performed this task alone three times, and in a pair three

times. Although there was some variation in the results, overall the children wound in the fishing reel faster

when doing so as one of a pair, confirming Triplett's hypothesis (it's unclear why Triplett went from cycling to

winding-in fishing reels).

There are, however, other contenders for the crown of ‘first person to conduct a social psychological

experiment’. The claim is contested in part because of difficulty in defining exactly what is a social experiment.

For example, Binet and Henri (1894) published the results of an experiment on suggestibility four years earlier

than Triplett's publication (1898) and Charcot published on suggestibility even earlier (1881), but his work is

SAGE

2011 SAGE Publications, Ltd. All Rights Reserved.

SAGE Research Methods

Page 4 of 25 : Experimental Research – in the Laboratory and Beyond

often considered to be of a clinical nature, rather than social. Another issue is that years may elapse between

conducting an experiment and the results being published. Ringelmann conducted experiments on what is

now called ‘social loafing’ between 1882 and 1887 but his work was not published until some years afterward

(Ringelmann, 1913). (Many of you will have observed social loafing yourselves – the phenomenon in which

members of a group get less work done than if they had worked individually. For example, five students

working on a group assignment that is to result in a 5,000–word paper will typically put in less hours each

than if each of the five is to write a 1,000–word paper.)

Regardless of who conducted the first social psychology experiment, there are several classic examples that

are well known throughout the social sciences and beyond. Perhaps the most famous of all is Milgram's

shocking study of obedience (Milgram, 1963).

Milgram's ‘Behavioral Study of Obedience’ (1963)

Milgram's work stemmed from a desire to understand the mechanisms by which humans

obey the commands of others, in particular in regards to the obedience to commands

that led to the death of millions in Nazi death camps during the Second World War.

The participants in the original study were men aged 20–50, of various income levels.

The experiment consisted of ordering participants to administer increasingly severe

punishment to another person in the context of a learning experiment, the punishment

apparently consisting of an electric shock. Each time the learner answered a question

incorrectly he (all the participants and the ‘learner’ were men) was to be punished. In fact,

the ‘shock generator’ was non-functional and the ‘learner’ was an actor. Of course, the

participants were not aware of this and believed they were actually administering electric

shocks, as marked on the machine from ‘Slight’ to ‘Danger: Severe Shock’, supposedly

ranging up to 450 volts. The ‘learner’ was strapped into a chair, ostensibly to prevent

movement while being shocked, and electrode paste was applied to the learner's skin to

‘avoid burns’. The ‘learner’ and research participant were then put into adjacent rooms so

that they were unable to see each other.

All of the participants continued administering shocks up to the 300 volt ‘Intense Shock’

level, although in many cases they questioned the need to continue and were instructed

to do so by the experimenter, despite earlier complaints and requests from the ‘learner’

to stop. At the 300- and 315-volt levels the learner was heard to pound on the wall and

thereafter failed to respond to questions. Participants were told to treat a non-response as

an incorrect answer – in other words, to continue ‘shocking’. Thirty-five of the 40 original

participants continued beyond the 300–volt level, and 26 (65%) continued to the maximum

shock level, although many participants exhibited signs of extreme stress. This point is

SAGE

2011 SAGE Publications, Ltd. All Rights Reserved.

SAGE Research Methods

Page 5 of 25 : Experimental Research – in the Laboratory and Beyond

worth emphasising – participants who continued did not appear to lack empathy, but this

empathy was overridden by the need to be obedient.

We will return to this classic experiment throughout this chapter, to illustrate key terms,

discuss controversies and, finally, in our discussion of ethics.

Hypothesis-testing

Deductive reasoning works from the general to the specific. It is top-down, starting with a theory for which

confirmation is sought (see Chapter 1) via development of a hypothesis to be tested. It may be associated

with objectivity and seeking patterns of causation.

Milgram's theory could be conceived as ‘… the individual who is commanded by a legitimate authority

ordinarily obeys. Obedience comes easily and often’ (Milgram, 1963: 372).

Conceptual Concern 1

The development of an experiment begins with a theory to be tested. It is a deductive

approach (inductive approaches typically end with a theory).

When using experimental research, a theory is followed by a hypothesis, which is an explicit, testable

prediction about the conditions under which an event or action will occur. A hypothesis takes the form ‘If a,

then b’, thus in situation a, b will occur. For example, a hypothesis about Milgram's research may have been ‘If

40 American men are placed in the experimental situation described above, less than 5% will obey commands

to continue to the end of the shock series’. Hypothesis-testing is discussed in more detail in Chapter 1.

Once we have our hypothesis, the next step is to move from an abstract concept to develop operational

definitions and variables. An operational definition states specifically how the variable will be measured,

manipulated, or otherwise studied. One of the operational definitions that Milgram used (1963: 374) was that

a participant who refuses to continue with the experiment before administering the thirtieth shock is deemed

a defiant subject, while any participant who complies fully with the experimenter's commands is termed an

obedient subject.

Variables in Experiments

Experimenters make use of several types of variables:

Independent variables: The factors experimenters manipulate to see if they affect the dependent variable.

SAGE

2011 SAGE Publications, Ltd. All Rights Reserved.

SAGE Research Methods

Page 6 of 25 : Experimental Research – in the Laboratory and Beyond

Dependent variables: The factors experimenters measure to see if they are affected by the independent

variable.

Subject variables: Variables that characterise pre-existing differences among study participants.

The independent variables include the experimental setting itself, for example, the formality of a laboratory

setting in a university with a man in a laboratory coat overseeing the experiment and urging the participant to

continue, versus a less formal setting in an office block, with the experimenter dressed casually and leaving

the room as soon as the experiment begins. There are many other potential independent variables, some of

which have been used in further studies by Milgram and others, such as having the ‘learner’ in the same room

as the participant and shock machine during the experiment (when the ‘learner’ was in the same room, the

compliance rate dropped from 65% to 40% (Milgram, 1974)).

Conceptual Concern 2

Experimental research is a variable-centric approach. Following the development of the

hypothesis, the key concern is the development of appropriate variables and ways of

measuring these.

Milgram stated that the primary dependent variable was the maximum shock a participant administers.

This was measured by noting the shock levels, from 0 (refusal to administer any shock at all) to 30

(willing to administer the maximum shock), and measuring the duration of shocks with timers. In addition,

experiments were filmed and/or audio-taped, photographs were taken through one-way mirrors and notes of

any unusual behaviours were kept. Finally, after the experiment, participants were interviewed using open-

ended questions, projective measures and attitude scales (see Chapter 5 on focus groups and Chapter 2 on

in-depth interviews for further discussion of these).

Subject variables include the gender of participants (in later experiments females were just as likely to

administer the maximum shock as males), or education levels or social class.

Conceptual Concern 3

Experimental research is initially characterised by fixed framing. Once the key components

of the experiment are decided upon – the hypothesis to be tested and the means for doing

so – it is very difficult, if not impossible, to change them. In order to conduct statistical

analysis, the conditions of the experiment must be the same in each case. However, it is

common for variables to be altered in subsequent experiments, incorporating some fluidity.

SAGE

2011 SAGE Publications, Ltd. All Rights Reserved.

SAGE Research Methods

Page 7 of 25 : Experimental Research – in the Laboratory and Beyond

Consider Milgram's later variations conducted in an office building, with the ‘learner’ in

the same room as the participant, with the experimenter in a different location, etc. These

variations provide important additional insights.

Given the fixed framing of this approach, the validity of the variables is imperative. Validity refers to the extent

to which the variables measure what they are intended to measure. With some other approaches this is quite

straightforward. For example, when designing a survey the researcher makes sure the questions (variables)

are easily understood and that the available answers (values) capture all possibilities accurately. This may

be tested by discussing the questions in a focus group (see Chapter 5) or a pilot survey (see Chapter 6).

However, in an experiment the question of validity may be more complex.

Validity and Reliability in Data Collection

Validity and reliability refer to the robustness of the research – how likely it is that experiment ‘works’

and produces results that are accurate and meaningful beyond the experiment itself. They are often

conceptualised as:

• internal validity, and

• reliability (or external validity).

The key concern of internal validity is ‘How certain can we be that the independent variable caused the effects

obtained on the dependent variable?’ Thus, is the experiment itself valid? That is, do changes in the setting

in which electric shocks are administered (e.g., from university laboratory to anonymous office) really cause

changes in obedience, or is there some other reason changes occur?

Construct validity is a type of internal validity and is used to evaluate the manipulation (change) and

measurement of variables. It refers to the extent to which the manipulations in an experiment really

manipulate the conceptual variables (constructs) they were designed to manipulate, and the measures used

in a study really measure the conceptual variables (such as obedience) that they were designed to measure.

For example, is the strength of the electric shocks (the dependent variable) that someone is prepared to

administer really an indication of how obedient that person is? Control groups may be useful for ruling out

alternative explanations for results. In a variation on his original experiment, Milgram included a control group

who were not given continuing commands. Once participants were given the initial instructions, they were left

to decide at what point to cease giving shocks, in contrast to the experimental (non-control) groups who were

continually encouraged to keep shocking. Less than 5% of control group members gave the maximum shock

level, indicating that the ongoing commands (a manipulation of the independent variable) were an important

aspect of continued obedience.

Reliability (or external validity) refers to the degree to which the findings can be generalised to other people

and to other (external) situations, as opposed to behaviour that occurs in an experimental setting only. It also

SAGE

2011 SAGE Publications, Ltd. All Rights Reserved.

SAGE Research Methods

Page 8 of 25 : Experimental Research – in the Laboratory and Beyond

refers to the extent to which the results can be reliably produced at another time, or by another researcher

(assuming all the conditions/variables are the same). Some key considerations are:

• Is the sample representative of the population as a whole?

• Is the setting in which the research is conducted likely to affect participants' behaviour?

Reliability may be measured by repeating the experiment with another sample of participants, to see if

the same findings are achieved. If the samples are representative, you would expect to get the same

results, providing the setting and other experimental conditions are the same. (Briefly, we say that a sample

is representative when it largely reflects the composition of the population from which it is drawn – the

same proportions of men and women, of various age groups, ethnicities, etc.) This type of repetition of an

experiment is known as replication. We know that Milgram's original experiment was not representative of

the population in so far as all the participants were men, therefore it was not possible to assume that women

would behave in the same way.

Although the setting of Milgram's experiment was a laboratory, as opposed to a ‘real world’ setting, we can

assume that the experiment had a relatively high level of realism in some respects; that is, participants really

believed they were administering electric shocks. However, it is worth noting again that the results varied

according to the setting. We cannot conclude that the 65% of participants who administered the maximum

level of shock in the laboratory would also do so in other settings – the original experiment only tells us

what people would do in a laboratory. Although participants were less likely to administer the maximum

level of shock in later experiments conducted in more neutral settings, this is likely to have been due to

the deliberate manipulation of the independent variable: the more casual atmosphere, and the experimenter

therefore seeming to have less authority. It seems plausible that the experiment appeared real regardless of

whether it was conducted in the laboratory or an office building. Experimental realism is achieved most readily

in field experiments, as summarised in Table 7.1:

Table 7.1 Laboratory and Field Experiments Compared

Laboratory experiments Field experiments

Conducted in settings designed for the purpose.

• Advantages: The environment can be

controlled; the participants can be carefully

studied.

• Disadvantage: Participants may behave

differently from how they normally would.

• Example: Milgram's ‘Behavioural study of

obedience’.

Conducted in real-world settings.

• Advantages: Realism; participants may feel more

comfortable; participants are more likely to act naturally.

• Disadvantage: Experimenter has less control.

• Example: Milgram's ‘Lost Letter Technique’.

Experimental realism is also discussed below, in relation to the use of technology – video games – in a study

on prejudice.

SAGE

2011 SAGE Publications, Ltd. All Rights Reserved.

SAGE Research Methods

Page 9 of 25 : Experimental Research – in the Laboratory and Beyond

Cases in Experiments

Our discussion of data collection has focused on issues around variables. This is natural, as the experiment

is a variable-centred approach. However, we have also discussed cases. In experimental research, the cases

are the participants.

Conceptual Concern 4

As noted, experimental research is a variable-centric approach. Here the cases are

participants in the experiment. The number of cases is relatively few compared to other

variable-centric approaches (such as the survey). Like the survey, experimental research

is primarily quantitative, with data analysed using statistics. However, there is often

untapped potential for incorporating qualitative material, for example in debriefing the

participants.

Figure 7.1 shows us the data from Milgram's original experiment, as it might look entered into a computer

spreadsheet. There are only two variables showing: ‘Participant Number’ and ‘Shock’. The rows represent the

cases – the 40 people who took part in the experiment. The information within each of the cells in the column

headed ‘Shock’ are the values, i.e., the possible ‘answers’ to the variable (question) ‘What is the maximum

level of shock a person is prepared to give in this experiment?’ Because the cases have been entered in

order according to maximum shock level, we can see that 26 people (65% of the sample) gave a maximum

shock of 450 volts, one person refused to continue past 375 volts, and so on.

Compared to deciding on the variables, and appropriate ways of measuring them, determining the cases

is relatively easy. The key question is whether it is desirable for the cases/participants to be representative

of the population in general. Often the answer will be yes – you will want to be able to generalise your

findings to the population at large. In this situation, your next concern is recruiting a representative sample.

For example, how will you make your research known to people? Are there any incentives you can offer

them for participating? Sometimes, though, you may wish to conduct your experiment with a particular group

of people. For example, a repeat of Milgram's experiment with members of the armed forces may yield

interesting results, especially given that this experiment came about as a result of Milgram's interest in the

obedience of Nazi soldiers. Even in a situation such as this, however, you will probably want to be confident

that your sample of soldiers represents the population of soldiers from which it is drawn, for example in terms

of rank, gender, age and time served.

Figure 7.1 Word Data Matrix Showing the Variable ‘Shock’ by 40 Cases

Participant Number Shock (volts

1 300

SAGE

2011 SAGE Publications, Ltd. All Rights Reserved.

SAGE Research Methods

Page 10 of 25 : Experimental Research – in the Laboratory and Beyond

2 300

3 300

4 300

5 300

6 315

7 315

8 315

9 315

10 330

11 330

12 345

13 360

14 375

15 450

16 450

17 450

18 450

19 450

20 450

21 450

22 450

23 450

24 450

25 450

26 450

27 450

28 450

29 450

30 450

SAGE

2011 SAGE Publications, Ltd. All Rights Reserved.

SAGE Research Methods

Page 11 of 25 : Experimental Research – in the Laboratory and Beyond

31 450

32 450

33 450

34 450

35 450

36 450

37 450

38 450

39 450

40 450

Practice Point 3

Generally speaking, it is better to have a large sample if you intend to undertake statistical

analysis, as is usually the norm with experiments. You will want to be able to make some

claims that your sample represents the population from which it is drawn, and that your

results didn't occur by chance. If you only have five people in your sample, it is possible

that they could all be unusual in some way, and you wouldn't have achieved the same

results with another sample. If you have a sample of 50, it is likely that any individual

idiosyncrasies will be averaged out, and is even more likely with a larger sample.

A final point to consider before we move on to discuss data analysis is that psychologists have often

recruited participants from first-year psychology classes. This is convenient because psychology students are

reasonably likely to be interested in psychology experiments, and they are often offered an incentive such as

a credit of marks. What are some situations in which it may be desirable to recruit participants solely from this

group? What are some situations in which it would be problematic?

Analysing Experiments

Experimental data analysis is usually analysed using software such as SPSS. A thorough discussion of

statistical analysis is beyond the scope of this text. However, we will introduce you to some core concepts

regarding the analysis of experimental data. These basics include statistical significance, normal distribution

and standard deviation.

Statistical Significance

SAGE

2011 SAGE Publications, Ltd. All Rights Reserved.

SAGE Research Methods

Page 12 of 25 : Experimental Research – in the Laboratory and Beyond

In order to be able to understand research findings that you are likely to encounter in the course of your

studies, there are some key terms you should be familiar with. Chief among these is ‘statistical significance’.

This term refers to the likelihood that the results could merely have occurred by chance or coincidence. If

a result has occurred five or fewer times in 100 possible outcomes, then it is considered to be ‘statistically

significant’, i.e., it is unlikely to be a coincidence. This is often expressed in academic journal articles as a ‘p

value’: p <0.05 (the ‘p’ stands for probability, and 0.05 is another way of saying five in a hundred or 5%). The

more zeros after the decimal point, the more significant the result: 0.0005 is much more significant than 0.05.

However, a p value does not necessarily include a 5. In fact, although 0.05 (or 5%) is the general cut-off point

for significance, you are more likely to see p values of <0.01 (or 1%, which is more significant than 5%) and

<0.001 (which is the same as one in a thousand or 0.1%).

Normal Distribution (also Called the Bell Curve) and Standard Deviation

The normal distribution tells us how we would expect scores to be distributed, according to the mean

(average) score for the sample. A standard deviation is used to demonstrate how much a score deviates

(is different) from the mean. For example, imagine that the marks for your final examination have just been

posted, showing that the mean was 56.3%, and that you scored 82%. It seems likely that you did well in

comparison to your classmates, but you can't be sure unless you know what is within the normal range –

in other words, how much your score has deviated from the norm. It could be that lots of people got scores

above 80% (although this would also mean that a lot of people got very low scores since the mean is 56.3%).

Figure 7.2 illustrates how these issues may be represented graphically. To the right of the graph is some

basic information: the mean, the standard deviation, and the number of participants. You can see that your

score was more than one standard deviation from the mean (56.3 + 16.9 = 73), so was unusually high. In

fact, you are one of only three people out of the 42 who sat the exam to score above 80% (the vertical y axis

is labelled ‘Frequency’ and indicates how frequent each score was – i.e., how many students scored within

each range of marks – while the horizontal x axis shows the marks, with each bar representing a range of 10

marks, from 0–10, and so on). The line forming an upside-down U is the normal curve. We can see that the

marks attained reflect the normal curve fairly closely. A few more people scored in the 50–60% and 0–10%

ranges than the mean and normal curve would suggest, while the 30–40% and 40–50% are perhaps a little

low, but only slightly.

SAGE

2011 SAGE Publications, Ltd. All Rights Reserved.

SAGE Research Methods

Page 13 of 25 : Experimental Research – in the Laboratory and Beyond

Figure 7.2 Histogram Illustrating Standard Deviation and Distribution – Exam Results

Table 7.2 Frequency Table – Milgram's Experiment

Volts Frequency Percentage Valid percentage Cumulative percentage

300 volts 5 12.5 12.5 12.5

315 volts 4 10.0 10.0 22.5

330 volts 2 5.0 5.0 27.5

345 volts 1 2.5 2.5 30.0

360 volts 1 2.5 2.5 32.5

375 volts 1 2.5 2.5 35.0

450 volts 26 65.0 65.0 100.0

Total 40 100.0 100.0

Let's look again at our core example for this chapter. A simple calculation tells us that the mean shock level

SAGE

2011 SAGE Publications, Ltd. All Rights Reserved.

SAGE Research Methods

Page 14 of 25 : Experimental Research – in the Laboratory and Beyond

administered by the 40 participants in Milgram's original experiment was 405 (in the ‘Danger: Severe Shock’

range). Table 7.2 gives us some basic information on the frequency of administration of maximum shock

levels, while the graph in Figure 7.3 gives us some additional information, including the normal curve and the

standard deviation. Both have been exported from SPSS.

The x axis (the one that goes across) represents the level of volts. The y axis (the one that goes up)

represents the frequency at which participants stopped giving shocks – or, to put it another way, the number

of people/cases who stopped at each volt level. So by reading across to the first bar on the left we can see

that nobody stopped administering shocks before they reached 300 volts, but five refused to go on past that

point. (We can also see this in the first line of Table 7.2.) Then four people stopped at 315 volts, etc. If we look

across to the last bar on the right, we can see that 26 people stopped at the 450–volt level (the maximum).

The gently curving line across the graph is the normal curve; it tells us what we could expect to happen if

people conformed to the norm that is predicted by the mean – where we would have expected the tops of

the bars to reach. It looks quite different from the normal curve in the graph of exam results (Figure 7.2), in

which the normal curve is quite close to the tops of the bars. Given that the mean in Milgram's study is 405

volts, we would expect the highest bar to be located at the 400 point on the x axis, the next highest bars

to be immediately to the left and right at 380, and so on, with the lowest bars being at either end. Instead,

something interesting has happened. The actual height of the bars – the numbers of participants and the

maximum shocks they were prepared to give – does not conform to the normal curve at all. The highest bar is

at the extreme right, not the middle of the curve. This suggests that something very important has happened

in this experiment.

It is also interesting to note that no one stopped between 375 volts, which had the additional notation on the

machine of ‘Danger: Severe Shock’, and the maximum – people continued through 390, 405, 420 and 435

volts. Unfortunately, Milgram gives no explanation for this, but we will return to it later in the chapter.

SAGE

2011 SAGE Publications, Ltd. All Rights Reserved.

SAGE Research Methods

Page 15 of 25 : Experimental Research – in the Laboratory and Beyond

Figure 7.3 Histogram Illustrating Standard Deviation and Distribution – Milgram's Experiment

Some Issues in Research

The use of experimental research in the social sciences has been highly criticised. In this section we discuss

the key concerns: ethics and epistemology. We also consider the impact of the availability of resources.

Ethical Issues

Researchers have a moral and legal responsibility to abide by ethical principles, based on moral values.

Chief among these is the minimisation of harm, such that if there is a risk of distress, procedures must be

in place to prevent this occurring wherever possible, and if it should occur, it should be dealt with as quickly

and effectively as possible. To further minimise the risk of harm, ‘informed consent’ is considered paramount.

Every participant should be clear about what the research will involve before agreeing to take part, and

should be able to refuse to participate, or to stop participating at any time, without negative consequences.

SAGE

2011 SAGE Publications, Ltd. All Rights Reserved.

SAGE Research Methods

Page 16 of 25 : Experimental Research – in the Laboratory and Beyond

These strictures may seem quite sensible and straightforward at first glance, but consider how they apply

to Milgram's experiment, in which deception was an essential ingredient and participants were very strongly

encouraged to continue whenever they showed reluctance, with repeated exhortations to complete the

‘learning task’: ‘It is absolutely essential that you continue’ and ‘You have no other choice, you must continue’.

Indeed, Milgram used the phrase ‘bring the subject [participant] into line’ when discussing unwillingness

to continue (1963: 374). As you can imagine, the use of deception such as this has caused particular

concern in social psychology. Virtually every study now has to be evaluated for its ethical appropriateness by

experienced researchers before the study can be conducted. Debriefing is generally considered a necessity

whenever a deception has occurred as part of an experiment.

Debriefing refers to the process of discussing the research with the participants after completion of an

experiment (or interview, etc.). It allows the participant to ask questions about the procedure, and for the

researcher to explain, which is particularly important where deception has taken place. Milgram describes

‘dehoaxing’, or procedures to ensure that the participant would ‘leave the laboratory in a state of well-being.

A friendly reconciliation was arranged between the subject and the victim, and an effort was made to reduce

any tensions that arose as a result of the experiment’ (1963: 374). No doubt participants were greatly relieved

that they had not, in fact, inflicted pain and suffering on anyone. However, it seems reasonable to assume

that the experiment had ongoing consequences for at least some of the participants. How would you feel if

you had behaved in a way that seemed to be causing someone great pain, even endangering their life, and

you had continued to do so, despite their protests (even if you later found it had been a hoax)?

As mentioned above, moral values inform ethical practice. But do moral values affect science in areas other

than ethical issues? Should values affect scientific inquiry? Traditionally, (social) psychologists strived for an

objective, value-free approach to research. It was felt that one's personal values could impinge on research

in a negative manner, resulting in unconsciously leading participants to behave in ways that justify the

researcher's values or beliefs, or the misinterpretation of findings. However, can science be totally unbiased

and objective? The experiment is based on a positivist approach, and the positivist would likely say that

science can, and should, be value-free and objective. However, we can see that researchers' values are often

inextricably intertwined with their research. Milgram's research was based on his desire to contribute to our

understanding of some of the most reprehensible behaviour to ever occur. This is clearly based on a moral

value. Further, social research is often undertaken with a goal of ‘making the world a better place’. A major

figure in the social sciences of the 1930s to 1950s, Kurt Lewin, argued that research should be applied to

important, practical issues (Lewin, 1951). This in itself is a value-laden statement.

Despite a desire for scientific neutrality, our personal ethics and values often do contribute to the research

process, through the research topics we find interesting to the application of findings in the wider community,

whether through informing social policy or social service and health programmes. Many current researchers

would argue that it is better to recognise personal positions and consider how they may contribute to our

research.

SAGE

2011 SAGE Publications, Ltd. All Rights Reserved.

SAGE Research Methods

Page 17 of 25 : Experimental Research – in the Laboratory and Beyond

Conceptual Concern 5

The experiment is firmly rooted in the positivist tradition. Developing from the hard

sciences, experimenters attempt to apply scientific approaches to the understanding of

society. Theories are formulated and tested with reference to measurable, quantifiable

results.

While it appears to be unusual for experimenters also to use other research approaches, there is certainly the

potential to do so. Again, Milgram provides an example. As mentioned above, he gathered various types of

additional information, including notes of participants' behaviour, tape-recordings and photographs. Although

the use made of these is quite limited, some illuminating observations are reported. For example:

I observed a mature and initially poised businessman enter the laboratory smiling and confident.

Within 20 minutes he was reduced to a twitching, stuttering wreck, who was rapidly approaching a

point of collapse. … At one point he pushed his fist into his forehead and muttered: ‘Oh God, let's

stop it.’ And yet he continued to respond to every word of the experimenter, and obeyed to the end.

(Milgram, 1963: 377)

Milgram also quotes some subjects who did not continue to the maximum shock level (remembering that

everyone began at 15 volts and continued to 300 volts, with a minimum of 15 shocks administered):

‘I think he's trying to communicate, he's knocking. … Well it's not fair to shock the guy … these are

terrific volts. I don't think this is very humane. … Oh, I can't go on with this; no, this isn't right. It's a

hell of an experiment. The guy is suffering in there. No I don't want to go on. This is crazy.’ (Milgram,

1963: 376)

Yet think how interesting it would be if we had information on how participants felt after the experiment, what

their thoughts and feelings were, how they explained their obedience to themselves, both before and after

they were told they had been deceived. Further interviewing may also have helped us to understand why all

those who quit the experiment before the end did so between the 300–volt level (the top of the ‘Intense Shock’

range, at which point five people quit) and the 375–volt level. No one quit the experiment between 375 volts

and 450 volts (there were four shock levels in between, at which participants could have refused to go on, yet

no one did refuse). What was the reasoning of the individuals behind this distinct pattern? Similarly, we have

no information on the longer-term impact of taking part. Issues such as these led to experimental research

coming under increasing criticism from the 1950s to the 1970s.

The Crisis in Social Psychology: A Crisis in Positivist Epistemology?

The 1960s and early 1970s can be thought of as a time of questioning and consciousness-raising in western

society in general, engendering both optimism and expansion, and crisis and debate. This extended to social

SAGE

2011 SAGE Publications, Ltd. All Rights Reserved.

SAGE Research Methods

Page 18 of 25 : Experimental Research – in the Laboratory and Beyond

psychology too, in particular the appropriateness of the dominant research method, the experiment.

Hepburn (2003) conceptualises the crisis in social psychology as consisting of three areas:

• Critique of method.

• Critique of individualism.

• Theoretical critique.

Together, they can be understood as a critique of the positivist standpoint on which experimental research

was based.

The experimental/laboratory approach had been dominant in social psychology for decades, but by the 1950s

it already had critics. These criticisms increased in the 1960s, particularly around the inadequacy of laboratory

methods to understand cultural and historical significance and motivations. In addition, the experiment was

criticised as mechanistic: the study of cause and effect treats people as machines without capacity for plans

and options, while the artificiality of the laboratory meant (it was argued) that findings cannot be extrapolated

to real life (this is why experimental realism is important). In some cases, there are only a certain number

of possible responses. It is assumed that the researcher knows all the possible options and therefore the

possibilities for alternatives are negated. Kenneth Ring stated that social psychological research treated

important social issues as fun-and-games experiments (Ring, 1967).

In regards to individualism, concern was raised that social psychology was focused on social aspects of the

individual, ignoring how the social environment impacts the individual. This is a one-sided view of human

behaviour. For example, Lewin (1951) developed a set of fundamental principles for social psychology:

(social) behaviour depends to a large extent on how we perceive and interpret the world around us; it is

a function of the interaction between the person and the environment; social psychological theories should

be applied to important, practical issues. His concluding statement may be seen in part as an indictment of

the constraints of the prevailing research method: ‘No research without action, no action without research’

(Lewin, 1951: 193), while his emphasis on the interaction between the person and his or her social world

points to the need to consider the social context as much as the individual. In Milgram's obedience experiment

we can learn about a specific aspect of social behaviour in a specific setting – obedience in a laboratory.

However, the broader social and historical context is ignored and the ability to use the findings for social

action is limited. For example, we learn nothing about the impact of upbringing, or ethnicity, on participants'

behaviour, or how the participants felt about their behaviour, either during or after the experiment. Nor do we

learn anything about the 35% of people who did NOT continue administering ‘electric shocks’ to the maximum

level. When the unit of focus in research is the individual, it is likely that solutions to social issues will be

focused on changing individuals, such as behaviour, personality, or moral reasoning, rather than considering

social structures and their impacts, such as inequality, income, (un)employment and social influences.

Traditional psychology places little value on theory, beyond a rather narrow focus on the specific topic. This

lack of focus on wider applications of research was seen increasingly in the 1960s and 1970s as the result of

SAGE

2011 SAGE Publications, Ltd. All Rights Reserved.

SAGE Research Methods

Page 19 of 25 : Experimental Research – in the Laboratory and Beyond

an inappropriate emphasis on positivist epistemology. (We discuss epistemology in more depth in a number

of chapters, particularly in Chapter 1.) The crisis in social psychology was evidence of a broader challenge

to positivism and the positivist emphasis on observation and measurement, and the potential for scientists to

form objective/unbiased understandings of the social world. In contrast, a number of approaches drawn from

social realism and social constructivist epistemologies were advanced by a disparate group of critics. Thus

phenomenology, ethnomethodology, structuralism, Marxism, feminism, and other critical theories offered new

conceptions which claimed to be more appropriate for understanding social psychological behaviour and

society and for improving them.

A major European social psychological work published at this time was Serge Moscovici's The context of

social psychology (1972). Moscovici criticises the individualist and capitalist assumptions of traditional social

psychology, arguing that they reinforce the political status quo by treating it as universal and inevitable. In

contrast, Moscovici argued from a social realist stance that:

• Society both individualises and socialises: it teaches us how to behave.

• Behaviours only acquire their meaning according to the responses received and our interpretations

of these responses, and this is regulated by our common history of norms and rules.

• Therefore, the study of social psychology must include the broader social context.

The ‘crisis’ strengthened psychology as a discipline, and furthered the rise of social psychology, particularly

through the adoption of pluralism: that is, the acceptance of many approaches to research in addition to

the laboratory experiment and the development of international and multicultural perspectives (Kassin et al.,

2008). However, positivism has proven to be more robust in psychology than its sister disciplines in the social

sciences. A cursory glance at almost any introductory psychology or even social psychology textbook will

reveal that experimental research still dominates.

Practice Point 4

The resources required to conduct an experiment can vary enormously, from almost

nothing but the experimenter's time, to extremely expensive equipment and the time of a

technician and a multi-person research team.

Resourcing

Many experiments require little in the way of resources, beyond any single researcher's time and some office

space. Consider the recent variations of the Lost Letter Technique, using email. In these experiments all

that would be required is time and a computer with an email connection and statistical analysis software.

In an experiment one of the authors participated in while an undergraduate student, all that was required

was an office and a large bag of condoms (don't be alarmed – the experiment was on embarrassment and

SAGE

2011 SAGE Publications, Ltd. All Rights Reserved.

SAGE Research Methods

Page 20 of 25 : Experimental Research – in the Laboratory and Beyond

nothing untoward happened). In contrast, Milgram's original obedience experiment required rather more: a

confederate (the ‘learner’), three rooms beside each other (the room in which the ‘learner’ sits, the room with

the shock machine and, on the other side of one-way glass, an observation room), the experimenter himself,

at least one observer, the ‘shock machine’ and associated equipment, and recording equipment. As new

technology becomes available, the potential for more complex and expensive experiments increases.

Putting the Approach in Context

While the availability of new technologies has certainly allowed experimenters to undertake innovative and

complex work, it has also standardised the approach in some respects, particularly in regard to data analysis.

In this section we consider the impact of new technology as both an aid to the development of experiments

and as the impetus behind new methodological orthodoxies.

Practice Point 5

Experimenters have a history of adapting experiments and taking advantage of new

technology as it becomes available. Milgram's ‘shock machine’ is an example of the use

of the available technology in the 1950s being utilised, while his Lost Letter Technique has

been adapted by others using electronic mail. Another example follows.

In an experiment on prejudice, participants played a video game in which a person popped up unpredictably,

holding either a gun or another object (Correll et al., 2002). Sometimes the individual was white, sometimes

African American. Participants were to ‘shoot’ the individual if he was holding a gun, or to press a different

button if he was holding another object – and in either situation to respond as quickly as possible. Participants

made mistakes equally often when the target was white – shooting at an unarmed target and not shooting the

armed target. However, they were significantly more likely to shoot the unarmed African American target, and

significantly less likely to fail to shoot the armed African American target.

In similar experiments that didn't involve video games (such as showing photographs: Judd, Blair and

Chapleau (2004); Payne (2001) a tendency to associate weapon use with African Americans was clearly

displayed. Note that while these experiments do appear to indicate a tendency towards stereotyping African

Americans as more likely to be dangerous, there is an important aspect of a ‘good’ experiment missing –

realism. This lack of realism raises issues of generalisability, as discussed above. While participants were

willing to ‘shoot people’ while playing, they were fully aware that they weren't really harming anyone and it is

highly unlikely that they would so willingly shoot people in real life. While these experiments appear to tell us

something about the frequency of prejudicial thinking, we cannot conclude anything about racist actions (in

addition, they tell us nothing about the participants' underlying beliefs or attitudes, so are not a useful source

of information for counteracting prejudice). On the other hand, experimental realism was an integral part of

SAGE

2011 SAGE Publications, Ltd. All Rights Reserved.

SAGE Research Methods

Page 21 of 25 : Experimental Research – in the Laboratory and Beyond

Milgram's electric shock experiment, but carried with it some major ethical concerns. Here are some points to

ponder regarding Correll, Park, Judd and Wittenbrink's (2002) work:

• How might these experiments be adapted to heighten experimental realism – that is, to encourage

the sense of a ‘real life’ situation rather than a game?

• What would be the ethical concerns with doing so?

• How might these ethical concerns be addressed?

We can see that the availability of computers and other electronic equipment has allowed experimenters to

both update old experiments and develop new ones. The use of video games in experiments probably also

adds an additional element of fun, and may well assist in the recruitment of participants through making the

experience seem attractive.

An undeniable advantage of new technology is the availability of software that allows us to quickly and

accurately undertake statistical analysis. Once data is entered, tables and graphs showing results can be

produced in a matter of seconds – work which once would have taken hours or days. Note that the results

shown in this chapter, including frequencies, norms and standard deviations, are merely the ‘tip of the iceberg’

in terms of the analysis it is possible to do. For example, if an experiment uses a control group and an

experimental group, it would be usual to conduct an ‘analysis of variance’ to determine if any differences

between the two groups are significant (are unlikely to have occurred by chance) or one may perform cross-

tabulation analysis using chi-square to see if there were differences according to subject variables, such as

gender. Such analyses are now considered standard, even necessary.

However, it is interesting to note that until fairly recently many of these analyses were not undertaken. The

paper reporting the results of Milgram's Lost Letter Technique includes the percentages of letters returned

only, broken down by the four addresses on the envelopes. We are left to draw our own conclusions about

the fact that 25% of the letters addressed to ‘Friends of the Communist Party’, and to ‘Friends of the Nazi

Party’ were posted, while 72% of the letters posted to ‘Medical Research Associates’ were posted. We are not

told if these numbers are in any way significant. Similarly, Milgram tells us only the numbers of participants

who stopped administering shocks at each level – he does not even give us percentages. He does tell us

how many participants are in each of the three age groups, and in each of the three occupational groupings,

but he does not tell us if there are any differences between the groups in terms of shocks administered. Of

course, given that the sample is relatively small, if resulting differences are small it could simply be due to

coincidence, BUT differences between the groups could be large (and therefore probably significant) – we

simply do not know. We suspect that if a researcher were to submit a manuscript for publication that contained

so little analysis now, it would be rejected. We suggest that the possibility of undertaking complex analysis

quickly and easily has rendered it a necessity.

Conclusion

SAGE

2011 SAGE Publications, Ltd. All Rights Reserved.

SAGE Research Methods

Page 22 of 25 : Experimental Research – in the Laboratory and Beyond

Experimental research has a somewhat checkered history in the social sciences – arguably it is over-utilised

in social psychology and under-utilised in the other disciplines. Certainly, it has much to offer in terms of

discovering cause-and-effect relationships and uncovering behaviour that may be difficult to investigate using

other means. After all, if most people were asked ‘If I put you in a room with an electric shock machine …

(etc.) … would you give the “learner” a 450 volt shock that could seriously harm him or her?’, we suspect they

would say ‘No!’, yet we know that in some circumstances that is precisely what the majority of people will do.

The experiment is a variable-centric approach and fixed (once the experimental protocol has been decided

upon and experiments begun, it is problematic to change it), so careful planning is essential. Experiments,

like all research, can vary in their efficacy. In order to conduct a successful experiment, we must have

clearly defined variables – it is a variable-centric, fixed approach, based on the positivist perspective, so it

is imperative that variables do measure the thing we want them to measure – and a high level of realism,

especially if deception is involved.

A well-designed experiment can tell us much about individual behaviour in specific circumstances. In addition,

the high level of control the experimenter has and the straightforward analysis process means that it is a

straightforward approach for emergent researchers.

However, experiments alone tell us little about participants' underlying thoughts and motivations, and nothing

at all about the social context that lead to the development of those thoughts and motivations. Indeed, some

of these experiments remind us of Ring's comment about trivialising important issues to ‘fun and games’

experiments. We may learn that people continue to hold racial prejudices, but experiments such as these,

unless supplemented with other approaches, may be of limited practical use. Of course, one may argue

that knowing that people hold prejudices (for example) is important for its own sake. Nevertheless, some

researchers will want to apply their findings in practical ways – to respond to Lewin's call for ‘action’. To do

so, the application of other approaches, perhaps incorporated in a debriefing session, are required.

Further Readings

There are many texts which provide detailed discussion of the use of the experiment for social research. Your

university library will probably have several of good quality. However, such texts tend to gloss the potential

disadvantages. Ring's (1967), Gergen's (1973) and Helmreich's (1975) criticisms of social psychology,

particularly the use of the experiment, are also instructive. Although dating from the time of the crisis in social

psychology, many of the concerns of the day remain relevant.

Milgram's original account of his obedience experiment is worth reading in its entirety, both as an example of

the approach as used 50 years ago and for the interesting quotes from participants.

References

SAGE

2011 SAGE Publications, Ltd. All Rights Reserved.

SAGE Research Methods

Page 23 of 25 : Experimental Research – in the Laboratory and Beyond

Ahmed A. M. 2010. Muslim discrimination: Evidence from two lost-letter experiments. Journal of Applied

Social Psychology, 40, 888–898.

Binet A., & Henri V. 1894. De la suggestibilité naturelle chez les enfants. Revue Philosophique, 38, 337–347.

Burwell R. J. 1987. The lost letter experiment: A class exercise for research methods. Teaching Sociology,

15, 195–196.

Bushman B. J., & Bonacci A. M. 2004. You've got mail: Using e-mail to examine the effect of prejudiced

attitudes on discrimination against Arabs. Journal of Experimental Social Psychology, 40, 753–759.

Charcot J. M. 1881. Oeuvres complètes. Paris: Bureaux du Progrès Médical.

Correll J., Park B., Judd C., & Wittenbrink B. 2002. The police officer's dilemma: Using ethnicity to

disambiguate potentially threatening individuals. Journal of Personality and Social Psychology, 83,

1314–1329.

Gergen K. J. 1973. Social psychology as history. Journal of Personality and Social Psychology, 26, 309–320.

Helmreich R. 1975. Applied social psychology: The unfulfilled promise. Personality and Social Psychology

Bulletin, 1, 548–560.

Hepburn A. 2003. An introduction to critical social psychology. London: Sage.

Judd C. M., Blair I. V., & Chapleau K. M. 2004. Automatic stereotypes vs. automatic prejudice: Sorting

out the possibilities in the Payne (2001) weapon paradigm. Journal of Experimental Social Psychology, 40,

75–81.

Kassin S., Fein S., & Markus H. R. 2008. Social psychology. Boston, MA: Houghton Mifflin.

Kremer J., Barry R., & McNally A. 1986. The misdirected letter and the quasi-questionnaire: Unobtrusive

measures of prejudice in Northern Ireland. Journal of Applied Social Psychology, 16, 303–309.

Lewin K. 1951. Problems of research in social psychology. In: Cartwright D. (ed.), Field theory in social

science. New York: Harper & Row.

Milgram S. 1963. Behavioral study of obedience. Journal of Abnormal and Social Psychology, 67, 371–378.

Milgram S. 1974. Obedience to authority: An experimental view. New York: Harper & Row.

Milgram S., Mann L., & Harter S. 1965. The lost letter technique. Public Opinion Quarterly, 29, 437–438.

Moscovici S. 1972. Society and theory in social psychology. In: Israel J., & Tajfel H. (ed.), The context of

social psychology: a critical assessment. London: Academic Press.

SAGE

2011 SAGE Publications, Ltd. All Rights Reserved.

SAGE Research Methods

Page 24 of 25 : Experimental Research – in the Laboratory and Beyond

Payne B. K. 2001. Prejudice and perception: The role of automatic and controlled processes in misperceiving

a weapon. Journal of Personality and Social Psychology, 81, 181–192.

Ring K. 1967. Experimental social psychology: Some sober questions about some frivolous values. Journal

of Experimental Social Psychology, 3, 113–123.

Ringelmann M. 1913. Recherches sur les moteurs animés: Travail de l'homme. Annales de l'Institut National

Agronomique, 2, 1–40.

Triplett N. D. 1898. The dynamogenic factor in pacemaking and competition. American Journal of

Psychology, 9, 507–533.

Tykocinski O. E., & Bareket-Bojmel L. 2009. The lost e-mail technique: Use of an implicit measure to assess

discriminatory attitudes toward two minority groups in Israel. Journal of Applied Social Psychology, 39, 62–81.

http://dx.doi.org/10.4135/9781526435415.n7

SAGE

2011 SAGE Publications, Ltd. All Rights Reserved.

SAGE Research Methods

Page 25 of 25 : Experimental Research – in the Laboratory and Beyond

  • : Experimental Research – in the Laboratory and Beyond
    • In: Social Research: A Practical Introduction