Measuring N170 Potential using Gaming EEG System

Michelle_Michy
Luck_chapter2.pdf

An Introduction to the Event-Related Potential Technique, second edition Steven J. Luck

© 2014 Massachusetts Institute of Technology

All rights reserved. No part of this book may be reproduced in any form by any electronic or mechanical means (including

photocopying, recording, or information storage and retrieval) without permission in writing from the publisher.

MIT Press books may be purchased at special quantity discounts for business or sales promotional use. For information,

please email special_sales@mitpress.mit.edu.

This book was set in Times LT Std by Toppan Best-set Premedia Limited, Hong Kong. Printed and bound in the United

States of America.

Library of Congress Cataloging-in-Publication Data

Luck, Steven J. (Steven John), 1963 – author.

An introduction to the event-related potential technique / Steven J. Luck — Second edition.

p. ; cm.

Includes bibliographical references and index.

ISBN 978-0-262-52585-5 (pbk. : alk. paper)

I. Title.

[DNLM: 1. Evoked Potentials. WL 102]

QP376.5

616.8'047547 — dc23

2013041747

10 9 8 7 6 5 4 3 2 1

2 A Closer Look at ERPs and ERP Components

Overview

This chapter provides a deeper analysis of the nature of ERPs, with the goal of helping you

understand how ERPs are generated in the brain and how the intracranial signals combine

together to create the waveforms we record on the surface of the scalp. The peaks that you see

in scalp ERP waveforms typically reflect the sum of multiple internal, underlying components,

and the conclusions of an experiment often require determining how these underlying compo-

nents differed across groups or conditions. However, as I mentioned in chapter 1 (and will

continue to emphasize throughout this book), it is very challenging to isolate and measure the

internal underlying components on the basis of the data that you actually record from the scalp.

This is called the superposition problem , and this chapter will explain how it arises and why it is so difficult to solve. This can be a little depressing, but don ’ t despair! Chapter 4 describes

some strategies you can use to solve this problem (or work around it), along with examples of

previous experiments that have successfully used these strategies to provide definitive answers

to important scientific questions. Chapter 3 provides a description of the specific ERP compo-

nents that are most commonly observed in typical experiments.

It is easy to confuse the peaks that we observe in our scalp waveforms with the internal

underlying brain components that sum together to create these peaks. In this chapter, I frequently

use the phrase underlying component to make it clear that I am talking about internal brain activity, not the observed peaks on the scalp.

This chapter will begin with a discussion of basic electrical concepts, followed by a detailed

discussion of the neural origins of ERPs and how they mix together in scalp recordings. We will

then discuss why it is so difficult to localize the internal generators of scalp ERPs. This will be

followed by a description of how the ERPs we record from scalp electrodes can radically mis-

represent the underlying components. We will then discuss how difference waves can be used

to solve this problem and isolate the underlying components. We will end by considering dif-

ferent ways of defining the term ERP component . You might think that it would be more logical to start with the definitions, but they will be much easier to grasp once you ’ ve spent some time

thinking concretely about the nature of the signals we record from scalp electrodes. The chapter

36 Chapter 2

ends with a general discussion of methods that can be used to identify the underlying components

that vary across groups or conditions in a given study.

This chapter is designed to provide you with a deep conceptual understanding of ERPs, and

it is therefore somewhat abstract. You need this kind of understanding to evaluate previous ERP

experiments and to design new ERP experiments. In fact, this chapter includes six specific

“ rules ” for interpreting ERP data. However, if you are in the middle of collecting or analyzing

data from an ERP experiment and you need to know the practical details of how to collect clean

data and perform appropriate analyses, you might want to skip ahead to chapters 5 – 10. You can

then come back to chapters 2 – 4 when you are ready to think more deeply about your results and

to design new experiments. But don ’ t forget to come back later, because chapter 2 is ultimately

the most important chapter in the entire book.

Basic Electrical Concepts

If you are going to measure electrical signals in your research, you need to know a little bit

about electricity. In this section, I will provide an overview of the most basic terms and concepts.

If you took physics in high school or college, you have already learned these terms and concepts

(although many people have told me that electricity was the part of physics they never really

understood). If you didn ’ t take physics, then don ’ t worry, because I didn ’ t either. When I was

in high school, I took shop classes on electronics rather than physics, because I was more inter-

ested in being able to fix my guitar amplifier than I was in learning to predict the velocity of a

ball rolling down a hill. Fortunately, the relevant concepts are pretty simple.

Current Current is the actual flow of electricity (charged particles) through a conductor. It is a measure of

the number of charge units (electrons or protons) that flow past a given point in a specific amount

of time. Current is measured in amperes, where 1 ampere is equal to 1 coulomb (6.24 × 10 18 ) of charge units moving past a single point in 1 second.

By convention, physicists and engineers refer to current flowing in the direction of positively

charged particles. For example, 1 coulomb of positively charged particles flowing from the left

hemisphere to the right hemisphere is viewed as being electrically equivalent to 1 coulomb of

negatively charged particles flowing from the right hemisphere to the left hemisphere. Thus, we

would talk about the current flowing from the left hemisphere to the right hemisphere in both

cases, regardless of whether it is positive charges moving from left to right or negative charges

moving from right to left.

When discussing the flow of electricity, it is useful to use the flow of water as an analogy.

Electrical current is analogous to the flow of water through a pipe or hose. Just as we can measure

the amount of water that passes a given point in a pipe in a given period of time (e.g., 3.6 liters

per minute), we can measure the amount of electricity that passes a given point in a conductor

in a given period of time (e.g., 3.6 coulombs per second).

A Closer Look at ERPs and ERP Components 37

Voltage People often get voltage and current mixed up, thinking that voltage is the actual flow of elec-

tricity. Current is the flow of electricity through a conductor, and voltage is the pressure that

pushes the electrical current through the conductor. A more precise term for voltage is electrical potential , because it is the potential for electrical current to flow from one place to another (and this is why ERPs are event-related potentials ). As an analogy, consider a tank of water at the top of a hill. There is a lot of potential for the water to flow from the tank to the bottom of the

hill, but little potential for the water to flow from the bottom to the top. Importantly, the potential

for water to flow downhill is present even if no water is flowing at a given moment (e.g., because

a valve is closed). Similarly, there is potential for electrical current to flow from one terminal

of a car battery to the other even if no current is flowing. Electrical potential is closely analogous

to water pressure: there can be a lot of water pressure in a hose even if the end of the hose is

closed and no water is actually flowing. However, once the end of the hose is opened, the pres-

sure will cause water to flow. Also, no water will flow out of the end of the hose unless there

is sufficient water pressure, just as no electrical current will flow without an electrical potential

to push it through the conductor.

Electrical potential is measured in units of volts (V). ERPs are so small that they are usually

quantified in microvolts ( μ V), where 1 microvolt is 1 millionth of a volt. You can imagine that the 120 V running through the video monitor in front of a subject could lead to significant

interference when you are looking for a 1 μ V experimental effect. This kind of interference will be discussed in more detail in chapter 5.

Resistance Resistance is the ability of a substance to keep charged particles from passing. It is the inverse

of conductance , which is the ability of a substance to allow charged particles to pass through it. Three main factors contribute to resistance: (1) the composition of the substance, (2) its length,

and (3) its diameter. Because of their molecular properties, some substances conduct electricity

better than others (e.g., copper is a better conductor than zinc). However, the ability of any

substance to conduct electricity will be reduced if it is very thin or very long. Resistance is

measured in ohms ( Ω ). To use yet another hydraulic example, consider a water filtration system in which the water

supply for a house passes through a large tank filled with carbon. If the carbon is tightly packed,

water will not easily flow through the tank, but if the carbon is loosely packed, water will flow

easily. This is analogous to the dependence of electrical resistance on the properties of the

substance. Now imagine water passing through a hose. If the hose is 100 m long and only 1

cm wide, it will resist the flow of water, and a great deal of pressure will be necessary to fill

a bucket in a short amount of time. However, if the hose is shorter (e.g., 1 m) or wider (e.g.,

10 cm), the bucket will fill quickly with only a moderate amount of water pressure. This

is analogous to the dependence of electrical resistance on the length and diameter of the

conductor.

38 Chapter 2

This last analogy also illustrates the relationship between voltage, current, and resistance. If

a thin hose is used, the volume of water that passes out of the end of the hose will be small rela-

tive to what would be obtained with a wider hose and the same water pressure. Similarly, if

voltage stays the same and the resistance increases, the current will decrease. However, if a thin

hose is used, a large volume of water can be obtained in a given time period by increasing the

water pressure. Similarly, it is possible to maintain a constant current when the resistance is

increased by increasing the voltage.

We will talk more about how resistance and its close cousin impedance influence EEG record- ings in chapter 5.

Electricity and Magnetism Electricity and magnetism are fundamentally related to each other, and it is important to under-

stand this relationship to understand how electrical noise is picked up in ERP recordings and

how MEG recordings are related to EEG recordings (which will be described in more detail

later). The flow of current through a conductor is always accompanied by a magnetic field that

flows around the conductor. Moreover, if a magnetic field passes through a conductor, it induces

an electrical current. These two principles are illustrated in figure 2.1 , which shows what happens

when a current is passed through one of two nearby conductors. The flow of current through

one of the conductors generates a magnetic field, which in turn induces current flow in the other

conductor. This is how electrical noise in the environment (e.g., the 120 V in the video display)

Current flows through conductor #1

Magnetic field induces current flow in conductor #2

Magnetic field generated around conductor #1

Conductor #1

Conductor #2

Figure 2.1 Relationship between electricity and magnetism. A current is passed through conductor #1, and this generates a magnetic

field that circles around conductor #1. As this magnetic field passes through conductor #2, it induces a small current in

conductor #2. This is how electrical devices located near the subject can induce artifactual electrical “ noise ” in the

electrodes or electrode wires. It is also the basic principle underlying magnetoencephalogram (MEG) recordings.

A Closer Look at ERPs and ERP Components 39

can induce electrical activity in an ERP subject, in the electrodes, or in the wires leading from

the electrodes to the amplifier (see chapter 5 for details).

The Neural Origins of ERPs

The neural origins of ERPs were described briefly in chapter 1, and here we take a much closer

look. We will also discuss why it is so difficult to localize ERPs on the basis of scalp recordings

(an even more detailed discussion of ERP localization is provided in online chapter 14). For a

more detailed description of the neural and biophysical events underlying the generation of EEG

and ERP signals, see the review by Buzs á ki, Anastassiou, and Koch (2012).

Electrical Activity in Individual Neurons Neurons produce two main types of electrical activity, action potentials and postsynaptic poten- tials . Action potentials are discrete voltage spikes that travel from the beginning of the axon at the cell body to the axon terminals, where neurotransmitters are released. Postsynaptic potentials

are the voltages that arise when the neurotransmitters bind to receptors on the membrane of the

postsynaptic cell, causing ion channels to open or close and leading to a graded change in the

voltage across the cell membrane. It is fairly easy to isolate the action potentials arising from a

single neuron by inserting a microelectrode into the brain, but it is virtually impossible to com-

pletely isolate a single neuron ’ s postsynaptic potentials in an in vivo extracellular recording (because PSPs from different neurons become mixed together in the extracellular space). Con-

sequently, in vivo recordings of individual neurons ( “ single-unit ” recordings) measure action potentials rather than postsynaptic potentials. When many neurons are recorded simultaneously,

however, it is possible to measure either their summed postsynaptic potentials or their action

potentials. Recordings of action potentials from large groups of neurons are called multi-unit recordings, and recordings of postsynaptic potentials from large groups of neurons are called

local field potential recordings. 1 In the vast majority of cases, action potentials cannot be detected by surface (scalp) electrodes

because of the timing of the action potentials and the physical arrangement of axons. When an

action potential is generated, current flows rapidly into and then out of the axon at one point

along the axon, and then this same inflow and outflow occur at the next point along the axon,

and so on until the action potential reaches a terminal. If two neurons send their action potentials

down axons that run parallel to each other, and the action potentials occur at exactly the same

time, the voltages from the two neurons will summate. However, if one neuron fires slightly

after the other, current will be flowing into one axon at the same time that it is flowing out of

the adjacent location on the other axon, and these signals will cancel. Because large populations

of neurons do not usually fire at precisely the same time (i.e., within microseconds of each

other), action potentials cannot usually be recorded from the scalp. As a result, ERPs almost

always reflect postsynaptic potentials rather than action potentials. The main exceptions to

this rule are peaks I and II of the brainstem auditory evoked response, which reflect precisely

40 Chapter 2

synchronized action potentials generated in the cochlea and passing through the auditory nerve

within a few milliseconds after a sudden sound onset (Pratt, 2012). You can safely assume that

any ERPs recorded from the scalp more than 20 ms after stimulus onset reflect postsynaptic

potentials rather than action potentials.

Notably, the neural events leading to the fMRI BOLD signal are more heterogeneous. Any-

thing that increases glucose or oxygen utilization can potentially lead to a change in regional

cerebral blood flow and impact the BOLD signal, whereas ERPs solely reflect postsynaptic

potentials. I like to think about this fact when I ’ m talking to a neuroimaging researcher who is

smug about having millimeter-level spatial resolution, but I try not to say it aloud.

Summation of Postsynaptic Potentials Whereas the duration of an action potential is only about a millisecond, postsynaptic potentials

typically last tens or even hundreds of milliseconds. In addition, postsynaptic potentials are

largely confined to the dendrites and cell body. Under certain conditions, these factors allow

postsynaptic potentials from multiple neurons to summate, making it possible to record them at

a great distance (i.e., at the scalp).

Figure 2.2 shows how postsynaptic potentials are thought to lead to electrical potentials that

can be recorded on the scalp. 2 Scalp ERPs are thought to arise mainly from pyramidal cells , the main input – output cells of the cerebral cortex. These cells have a set of basal dendrites and a

single apical dendrite (all of which branch out after leaving the cell body; see figure 2.2A ). The

overall spatial configuration of the cell body and dendrites looks like a pyramid (if you use a

lot of imagination). Cortical pyramidal cells are all oriented perpendicular to the cortical surface,

with the apical dendrite heading in the direction of the cortical surface and the cell body and

basal dendrites located closer to the white matter (see figure 2.2B ).

If an excitatory neurotransmitter is released at the apical dendrite of a cortical pyramidal cell,

as shown in figure 2.2A , an electrical current (in the form of positively charged ions) will flow

from the extracellular space into the cell, yielding a net negativity on the outside of the cell in

the region of the apical dendrites. To complete the circuit, current will also flow out of the cell

body and basal dendrites, yielding a net positivity in this area. This flow of current creates a

tiny dipole (a pair of positive and negative electrical charges separated by a small distance). If the postsynaptic potential is inhibitory rather than excitatory, this reverses the flow of current

and changes the polarity of the signal recorded on the scalp. If the postsynaptic potential occurs

at the cell body or basal dendrites rather than at the apical dendrite, this also changes the polarity

(see box 2.1 for a discussion of the meaning of an ERP component ’ s polarity).

The dipole from a single neuron is so small that it cannot be recorded from a scalp electrode,

but the dipoles from many neurons will sum together, making it possible to measure the resulting

voltage at the scalp. For the voltages from the individual neurons to summate and be recordable

at the scalp, they must be present at the same time across thousands or millions of neurons, and

the dipoles from the individual neurons must be spatially aligned. If the neurons are at random

orientations with respect to each other, then the positivity from one neuron may be adjacent to

A Closer Look at ERPs and ERP Components 41

Axons from presynaptic cells

Apical dendrite

Postsynaptic cell axon

Basal dendrites

Cell body

A Stimulated

region of cortex

B

Equivalent current dipoleSkull

CortexC

D E

Current dipole

Magnetic field

Skull

Magnetic Field leaving head

Magnetic field entering head

Cortical surface

White matter

Figure 2.2 Principles of ERP generation. (A) Schematic pyramidal cell during neurotransmission. An excitatory neurotransmitter

is released from the presynaptic terminals in the apical dendrite, causing positive ions to flow into this region of the

postsynaptic neuron. This creates a net negative extracellular voltage (represented by the “ – ” symbols) just outside the

apical dendrite. To complete the circuit, voltage will flow through the neuron and then exit in the region of the cell body

and basal dendrites (represented by the “ + ” symbols). This flow of current forms a small dipole. The polarity of this

dipole would be inverted if an inhibitory neurotransmitter were released rather than an excitatory neurotransmitter. It

would also be inverted if the neurotransmission occurred at the cell body or basal dendrites rather than at the apical

dendrite. (B) Folded sheet of cortex containing many pyramidal cells. When a region of this sheet is stimulated, the

dipoles from the individual neurons summate. (C) The summated dipoles from the individual neurons can be approxi-

mated by a single equivalent current dipole, shown here as an arrow. By convention, the arrowhead indicates the positive

end of the dipole. The position and orientation of this dipole determine the distribution of positive and negative voltages

recorded at the surface of the head. (D) Example of a current dipole with a magnetic field traveling around it. (E)

Example of the magnetic field generated by a dipole that lies just inside the surface of the skull. If the dipole is roughly

parallel to the surface, the magnetic field can be recorded as it leaves and enters the head; no field can be recorded if

the dipole is oriented radially (perpendicular to the surface). Reprinted with permission from Luck and Girelli (1998).

Copyright 1998 MIT Press.

42 Chapter 2

the negativity from the next neuron, leading to cancellation. Similarly, if one neuron receives an

excitatory neurotransmitter and another receives an inhibitory neurotransmitter, the dipoles of

the neurons will be in opposite directions and will cancel. However, if the neurons all have a

similar orientation and all receive the same type of input, their dipoles will summate and may

be measurable at the scalp. This is much more likely to happen in cortical pyramidal cells than

in other cell types or in other brain structures, so ERPs mainly arise from the pyramidal cells.

The summation of the individual dipoles is complicated by the fact that the cortex is not flat,

but instead has many folds. Fortunately, however, physicists have demonstrated that the summa-

tion of many nearby dipoles is essentially equivalent to a single dipole formed by averaging the

orientations of the individual dipoles. 3 This averaged dipole is called an equivalent current dipole . It is important to note, however, that whenever the individual dipoles are more than 90 ° from each other, they will cancel each other to some extent, with complete cancellation at 180 ° .

For example, the orientation of the neurons in the basal ganglia is largely random, making it

difficult or impossible to record basal ganglia activity from the scalp.

An important consequence of these facts about ERP generation is that only a fraction of brain

processes will produce a scalp ERP “ signature. ” To produce a measurable signal on the scalp,

the following conditions must be met:

• Large numbers of neurons must be activated at the same time.

• The individual neurons must have approximately the same orientation.

• The postsynaptic potentials for the majority of the neurons must arise from the same part of

the neurons (either the apical dendrite or the cell body and basal dendrites).

• The majority of the neurons must have the same direction of current flow to avoid cancellation.

Box 2.1 What Does the Polarity of an ERP Component Mean?

I am often asked whether it “ means something ” if a component is positive or negative. My response

is that polarity depends on a combination of four factors:

• Whether the postsynaptic potential is excitatory or inhibitory

• Whether the postsynaptic potential is occurring in the apical dendrite or the basal dendrites and

cell body

• The location and orientation of the generator dipole with respect to the active recording electrode

• The location of the reference electrode (which will be discussed in chapter 5)

If you know three of these factors, then the polarity of the ERP can be used to infer the fourth

factor. But we don ’ t usually know three of these factors, so the polarity doesn ’ t usually tell us

anything. Polarity at a given site is meaningful only insofar as most components will have a constant

polarity over a given region of the head, and the polarity of a component can help us determine

which component we are seeing. But it cannot ordinarily be used to determine whether the compo-

nent reflects excitation or inhibition.

A Closer Look at ERPs and ERP Components 43

Volume Conduction When a dipole is present in a conductive medium such as the brain, current is conducted through

that medium until it reaches the surface. This is called volume conduction and is illustrated in figure 2.2C . I should note, however, that I don ’ t like the term volume conduction very much because it might be taken to imply that we are recording charged particles that pass from the

neurons all the way to the scalp electrodes. That isn ’ t the way it works. By analogy, when

electricity is generated in a power plant and flows through power lines, the electrons don ’ t go

all the way from the power plant to your house. Instead, one electron pushes the next one,

which pushes the next one, and so forth. Similarly, when a dipole is active in the brain, you

don ’ t have to wait for charged particles to move all the way from the dipole to the surface.

Instead, a postsynaptic potential in a set of neurons creates an essentially instantaneous voltage

field throughout the entirety of the head, with no meaningful delay. And don ’ t forget that you

are measuring voltage, which is the potential for current to flow and not the actual flow of

current.

Electricity does not just run directly between the two poles of a dipole in a conductive medium,

but instead spreads out across the conductor. Consequently, ERPs do not appear only at an

electrode located directly above the dipole but are instead picked up by electrodes located all

over the head. The high resistance of the skull causes the voltage to be even more widely dis-

tributed. Consequently, the scalp distribution of an ERP component is usually very broad.

Figure 2.2C illustrates a very important fact about ERP scalp distributions. For any dipole

location, the voltage will be positive over one portion of the scalp and negative over the remain-

der of the scalp, with an infinitesimally narrow band of zero that separates the positive and nega-

tive portions. In many cases, one of these portions will fall over a part of the head where you

don ’ t have any electrodes (e.g., the face or the bottom of the brain), so you might not see both

sides of the dipole. In other cases, you will be able to see both the positive and negative sides

of the dipoles. 4 The zero band that separates the positive and negative sides of the head will be

in a different place for each different dipole, and no single zero band will be present when

multiple dipoles are active. Thus, there is no place on the head that consistently has zero voltage

(see chapter 5 for additional discussion).

Relationship between Dipoles and Components An equivalent current dipole represents the summed activity of a large number of nearby neurons.

How is this related to the concept of an ERP component ? We will define the term ERP component more carefully later in this chapter, but we can make a simple link to the concept of a dipole at

this time. Specifically, when an equivalent current dipole represents the activity of a single

functional brain region, this dipole can be considered to be the same thing as an ERP component.

The moment-by-moment changes in the magnitude of the dipole (sometimes called the dipole ’ s

source waveform ) constitute the time course of the ERP component. As will be described soon, all of the different dipoles in the brain sum together to give us the complex pattern of positive

and negative peaks that we record from our scalp electrodes.

44 Chapter 2

Magnetic Fields The blurring of voltage caused by the high resistance of the skull can be largely circumvented

by recording magnetic fields instead of electrical potentials. As illustrated in figure 2.2D , an

electrical dipole is always surrounded by a magnetic field of proportionate strength, and these

fields summate in the same manner as voltages. Thus, whenever an ERP is generated, a magnetic

field is also generated, running around the ERP dipole. Moreover, the skull is transparent to

magnetism, so the magnetic fields are not blurred by the skull, 5 leading to much greater spatial

resolution than is possible with electrical potentials. The magnetic equivalent of the EEG is

called the magnetoencephalogram (MEG), and the magnetic equivalent of an ERP is an event- related magnetic field (EMRF).

As illustrated in figure 2.2E , a dipole that is parallel ( tangential ) to the surface of the scalp will be accompanied by a magnetic field that leaves the head on one side of the dipole and enters

back again on the other side. If a highly sensitive probe called a SQUID (superconducting

quantum interference device) is placed next to the head, it is possible to measure the magnetic

field as it leaves and reenters the head. However, if the dipole is perpendicular to the surface of

the head (a radial dipole), the magnetic field running around the dipole will not leave the head, and it will be “ invisible ” to the SQUID. For orientations that are between tangential and radial,

the strength of the magnetic field that is recorded from outside the head gets weaker and weaker

the more radial the dipole is. Similarly, the strength of the extracranial magnetic field becomes

very weak for deep dipoles. Thus, MEG recordings are primarily sensitive to superficial, tan-

gential dipoles.

Because magnetic fields are not as widely dispersed as electrical potentials, they can provide

more precise localization. However, as will be discussed in online chapter 14, the combination

of ERP and ERMF recordings provides even better localization than ERMF recordings alone.

Unfortunately, three factors make magnetic recordings very expensive: the SQUID is expensive;

the coolant must be continually replenished; and an expensive magnetically shielded recording

chamber is necessary to attenuate the earth ’ s magnetic field, which is orders of magnitude larger

than the MEG signal. Thus, MEG/ERMF recordings are much less common than EEG/ERP

recordings.

The Forward Problem and the Superposition of Components on the Scalp

If I tell you the locations and orientations of a set of dipoles in a brain, along with the shape

and conductances of the brain, skull, and scalp, then it would be possible for you to use a

relatively simple set of equations to compute the distribution of voltage that would be observed

for those dipoles. This is called the forward problem, and it is relatively easy to solve. In this section, I will spend some time explaining how the forward problem is solved in a little bit of

detail, because it will help you to understand exactly how the ERP components generated in the

brain become mixed together in the scalp electrodes. This creates the superposition problem,

which I described briefly in chapter 1 and which is often the biggest impediment in ERP studies.

A Closer Look at ERPs and ERP Components 45

Describing the forward problem in detail requires me to introduce a little bit of math, but only

a very little bit (see box 2.2 ).

The forward problem is illustrated in figure 2.3 , which shows the ERP waveforms for three

hypothetical generator locations. These are called source waveforms , and they show the time course of voltage for the components. For any given electrode location on the scalp, a fixed percent-

age of the source waveform from a given internal generator will propagate to the electrode. That

is, X% of the voltage from a given generator will be propagated to a given electrode site, where X

has a different value for each combination of internal generator and external scalp electrode. The

percentage of the voltage that is propagated will depend on the position and orientation of the

generator with respect to the position of the electrode, along with the conductivity of the various

tissues within the head. For example, a high percentage of the voltage from a superficial radial

dipole will propagate to an electrode on the scalp right over the top of the generator, and a smaller

(and inverted) percentage will propagate to an electrode on the opposite side of the head.

It ’ s actually possible to estimate the percentage of propagation from each possible generator

location to each electrode site in a given subject by creating a model of the subject ’ s head from

a structural MRI scan, combined with normative conductivity values for the different tissues that

constitute the head (e.g., brain, skull, scalp). There are software packages that can provide the

propagation factors if you provide a structural MRI scan.

Box 2.2 Four Kinds of Math

ERP waveforms are sequences of numbers, so you can ’ t escape a little bit of math if you want to

understand ERPs. If you are a “ math person, ” then you are probably looking forward to seeing some

mathematical formalisms as you read the book. But if you are not a math person, the last thing

you ’ ll want to see is a lot of equations. There will be some equations in the following chapters, but

they will be very simple. And I promise not to write something like “ As will be obvious from equa-

tion 6.4 … ” because I don ’ t usually find that an equation makes something obvious.

I am not a math person. I managed to get through a year of calculus in my freshman year of

college, but that ’ s as far as I got. And I remember almost nothing I learned that year. So this book

contains no calculus.

It turns out that you can understand all the key mathematical concepts involved in ERP research

without any calculus. In fact, everything I will describe can be boiled down to addition, subtraction,

multiplication, and division. If you are not a math person, you can use these four simple mathemati-

cal operations to understand things like Fourier analysis, time – frequency analysis, and how filters

work. This will require that you spend some time looking at simple equations that combine these

four simple operations in interesting ways, but ultimately it ’ s all just arithmetic that anyone can

understand.

If you are a math person, I think you will find that boiling things down in this way allows you

to find a new appreciation for the logic that underlies the mathematics. And you will probably

discover that there are some important things about Fourier analysis, time – frequency analysis, and

filtering that you did not fully appreciate before.

46 Chapter 2

Figure 2.3 Relation between the underlying component waveforms and the observed scalp waveforms. In this example, three

components are present (C1, C2, C3), each of which has a source waveform (time course of voltage, shown at the bottom

left) and a generator location (represented by the arrows in the head). The contribution of each component waveform

to the observed waveform at a given electrode site is determined by a weighting factor that reflects the location and

orientation of the generator relative to that electrode, along with the conductivity of the tissues that form the head. The

table shows the weighting factors between the three components, and the three electrode sites are given in the table (but

note that these are made-up values, not the actual weighting factors from a real head). The observed waveform at a given

electrode site (shown at the bottom right) is equal to the sum of each of the component waveforms, multiplied by the

weighting factor between each component and that electrode site. The weights are indicated by the w ’ s on the arrows between the component waveforms and the observed waveforms (e.g., w 2,3 represents the weighting factor between component 2 and electrode 3). Adapted with permission from Kappenman and Luck (2012). Copyright 2012 Oxford

University Press.

A Closer Look at ERPs and ERP Components 47

The proportion of the voltage that is propagated from a given generator location to a given

electrode site is called the weight between the generator and the electrode. There is a separate weight for each combination of generator location and electrode site. In figure 2.3 , the weights

are denoted as w x,y , where x is the generator location and y is the electrode site. For example, the weight from generator 2 to electrode 1 is denoted w 2,1 . I just made up the weights shown in figure 2.3 — they are not the actual weights. For example, real weights would be much less than

1.0, because only a small proportion of the generator voltage makes it all the way out to the

scalp electrodes. The real weights would be provided by a software package on the basis of a

subject ’ s structural MRI data. Note that the weights are proportions (between – 1 and +1), not

percentages.

The “ fake ” weights shown in figure 2.3 make it easy to see how the waveforms at a given

electrode site reflect the weighted sum of the internal underlying components. For example, the

waveform at electrode E1 is simply the C1 waveform plus 25% of the C2 waveform plus 50%

of the C3 waveform (because the weights from components C1 – C3 to electrode E1 are 1.0, 0.25,

and 0.50, respectively). If you think this seems very simple, you ’ re right! The waveform at a

given electrode site is always just the weighted sum of all the source waveforms. This simplicity

arises from the fact that voltages simply add together in a simple conductor like the human head.

Note that the voltage at a given electrode site is a weighted sum of all the underlying com- ponents. The weights will be negative on one side of the head and positive on the other, with

a narrow band where the weights are zero at the transition between the positive and negative

sides of the dipoles. And the weights may be quite small near this band. However, a given

electrode site picks up at least some voltage from almost every component in the brain. This

means that almost all of the components are mixed together at every electrode site. How many

different components are likely to be mixed together in a given experiment? It is difficult to

know for sure, but dozens may be present in a typical experiment. For example, evidence for

at least 10 different sources was found in the brief period from 50 to 200 ms after the onset of

an auditory stimulus in a simple target detection task (Picton et al, 1999), and many more are

presumably active when a longer time range is examined and when the subject is engaged in

a complex task.

In most cases, we are interested in measuring individual components, not the mixture that we

record at a given scalp electrode. Unfortunately, there is no foolproof way to recover the under-

lying components from the recordings obtained at the scalp. There are many different methods

that attempt to recover the underlying components (see box 2.3 ), but all of them are based on

assumptions that are either known to be false or are not known to be true. The mere fact that

each of these techniques will arrive at a different solution is an indication that most (or all) of

them are incorrect.

As this chapter progresses, we will see why this superposition problem makes life so difficult

for ERP researchers. You may find this depressing, but keep in mind that we will eventually

discuss a set of strategies for overcoming the superposition problem. And the online supplement

to chapter 4 describes some excellent examples of previous studies that have overcome the

48 Chapter 2

Box 2.3 Unmixing the Components

The mixing of underlying components in scalp recordings is such an important problem that many

different mathematical procedures have been developed to unmix them. The best known and most

widely used procedures are dipole localization methods, principal component analysis, independent

component analysis, Fourier analysis, and time-frequency analysis. All of these will be covered in

some detail later in the book.

Although these techniques seem very different from each other, they all share a fundamental

underlying structure, and it is worth understanding this structure. Specifically, they all assume that

the waveforms recorded at the scalp consist of the weighted sum of a set of underlying basis func- tions . However, they make different assumptions about the nature of these basis functions. For example, Fourier analysis assumes that the basis functions are sine waves, but it makes no assump-

tions about the pattern across the scalp; in contrast, dipole localization methods make no assumptions

about the time course of the basis functions, but they assume that the scalp distribution reflects the

conductivity of the brain, skull, and scalp. These techniques also differ widely in the mathematical

approach that is used to unmix the observed waveforms. However, it is useful to keep in mind that

they all assume that the observed waveforms consist of the sum of a relatively small set of basis

functions.

superposition problem and made important contributions to our understanding of the mind and

brain.

The Challenges of ERP Localization

This section provides a brief discussion of the challenges involved in localizing ERPs. A more

complete discussion is provided in online chapter 14.

As noted in the description of the forward problem, it is relatively easy to compute the dis-

tribution of voltage on the scalp if you know the locations and orientations of the dipoles (and

have a structural MRI scan). However, if I provide you with an observed voltage distribution

from the scalp and ask you to tell me the locations and orientations of the dipoles, you will not

be able to provide an answer with 100% confidence. This is the inverse problem , and it is what mathematicians call an “ ill-posed ” or “ underdetermined ” problem. This simply means that there

are multiple different sets of dipoles that can perfectly explain a given voltage distribution, and

there is no way to tell which is the correct one. In fact, it has been known for more than 150

years that an infinite number of different dipole configurations can produce any given voltage

distribution (Helmholtz, 1853; Nunez & Srinivasan, 2006; see also Plonsey, 1963). It is impos-

sible to know with certainty which one of these configurations is the one that is actually respon-

sible for producing the observed voltage distribution (unless you have other sources of

information, such as lesion data).

Figure 2.4 illustrates some of the challenges involved in localizing ERPs. When a single dipole

is present just under the skull and points directly outward (a superficial, radial dipole), a very

A Closer Look at ERPs and ERP Components 49

A

B

C

D

Figure 2.4 Scalp distributions (left) produced by different dipole configurations (right). See the text for descriptions of panels A – D

of the figure. Images courtesy of J. Bengson. Reprinted with permission from Luck (2012a). Copyright 2012 American

Psychological Association.

50 Chapter 2

focal voltage distribution is observed on the scalp, with the maximum voltage directly over the

location of the dipole ( figure 2.4A ). If this dipole is moved laterally even a small amount (e.g.,

1 cm), the voltage distribution will be noticeably different. Consequently, a dipole such as this

can be localized quite accurately simply by comparing the observed voltage distribution with

the distribution that would be produced by a single hypothetical dipole with various different

positions and orientations and selecting the dipole that best fits the observed data. Noise in the

data will distort the observed scalp distribution somewhat, but it is still possible to localize a

single superficial dipole like the one shown in figure 2.4A with reasonable accuracy as long as

the noise is not too large (and assuming you know for certain that it is a single superficial dipole). Multiple-dipole configurations also exist that can produce exactly the same distribution of

voltage as a single superficial dipole. Thus, we can localize a single superficial dipole with

considerable precision (assuming very low levels of noise in the average ERP waveforms), but

we cannot know from the scalp distribution that only a single superficial dipole is present.

Now consider the somewhat deeper dipole shown in figure 2.4B . The distribution of voltage

over the scalp is broader than that observed with the superficial dipole, and moving the dipole

laterally by a given amount won ’ t have as large of an effect on the scalp distribution for the

deeper dipole as it would for the superficial dipole. All else being equal, the deeper the dipole,

the smaller and more broadly distributed the voltage will be on the scalp, and the less accurately

it can be localized. An even more important problem arises in this situation, because it becomes

difficult to tell the difference between a deep but focal generator and a superficial generator that

extends across a large region of cortex. For example, the error-related negativity has a fairly

broad distribution that is consistent with a single deep generator in the anterior cingulate cortex

(Dehaene, Posner, & Tucker, 1994), but it is also consistent with the activation of a large patch

of superficial cortex.

Figure 2.4C shows a situation in which two dipoles are simultaneously active. Voltages arising

from different generators simply sum together, so the scalp distribution of the two simultaneous

dipoles is simply equal to the sum of the scalp distributions of the individual dipoles. In this

particular example, the two dipoles are both superficial, and they are quite far apart. It would be

possible for you to localize the dipoles quite accurately in this situation, assuming that the data

were not distorted by much noise and that you already knew that exactly two dipoles were active.

Figure 2.4D shows a different situation in which two dipoles are simultaneously active (the

dipoles from figure 2.4A and B ). This is a “ nightmare scenario ” for people who want to localize

ERPs, because the scalp distribution of the sum of the two dipoles is nearly indistinguishable

from the scalp distribution of the superficial dipole alone. Given even a small amount of noise,

it would be impossible to know whether the data were the result of a single superficial dipole

or two dipoles that are aligned in this fashion. The problem would still be bad even if the two

dipoles were not perfectly aligned. Moreover, imagine an experiment in which these two dipoles

were present, and an experimental manipulation led to a change in the amplitude of the deep

dipole with no effect in the superficial dipole. It is very likely that an experimental effect of this

nature would be attributed to the superficial dipole rather than the deep dipole.

A Closer Look at ERPs and ERP Components 51

As more and more simultaneous dipoles are added, the likelihood of dipoles aligning in this

manner increases, and it becomes more and more difficult to determine how many dipoles are

present and to localize them accurately. This problem becomes even worse when the data are

noisy. Under these conditions, a set of estimated dipole locations that matches the observed scalp

distribution can be quite far from the actual locations.

The examples shown in figure 2.4 are cause for both optimism and pessimism. You should be

optimistic because ERPs can be localized accurately if you can be certain that you have a single

dipole or that you have two to three dipoles that are not aligned in a way that makes them dif-

ficult to localize. But you should also be pessimistic because it will be difficult to localize ERPs

unless you know that you have a single dipole or that you have two to three dipoles that are not aligned in a way that makes them difficult to localize. Fortunately, there are some things you

can do to increase the likelihood that only a single dipole is active (e.g., performing localization

on difference waves that isolate a single component).

In many experiments, the number of dipoles could be very large ( > 10), and localizing ERPs solely on the basis of the observed scalp distribution would be impossible to do with confidence.

The only way to localize ERPs in this case is to add external constraints, and this is how existing

procedures for localizing ERPs solve the non-uniqueness problem. As will be discussed in detail

in online chapter 14, some common procedures allow the user to specify a fixed number of

dipoles (Scherg, 1990), whereas other procedures use structural MRI scans and constrain the

dipoles to be in the gray matter (Dale & Sereno, 1993; H ä m ä l ä inen, Hari, Ilmonieni, Knuutila,

& Lounasmaa, 1993) or choose the solution that best minimizes sudden changes from one patch

of cortex to the next (Pascual-Marqui, Esslen, Kochi, & Lehmann, 2002). Although each of these

methods produces a unique solution, they do not necessarily produce the correct solution. And

they may produce quite different solutions, further reducing our confidence that any of them has

found the correct solution.

The most significant shortcoming of mathematical procedures for localizing ERPs is that they

do not typically provide a well-justified margin of error. That is, they do not indicate the probabil-

ity that the estimated location falls within some number of millimeters from the actual location.

For example, I would like to be able to say that the N2pc component in a particular experiment

was generated within 9 mm of the center of the lateral occipital complex and that the probability

that this localization is incorrect is less than 0.05. I am aware of no mathematical localization

technique that is regularly used to provide this kind of information. 6 Without a margin of

error, it is difficult to judge the credibility of a given localization estimate. In most cases, the

strongest claim that can be made is that the observed data are consistent with a given generator

location.

Although it is usually impossible to definitively localize ERPs solely on the basis of the

observed scalp distributions, this does not mean that ERPs can never be localized. Specifically,

ERPs can be localized using the general hypothetico-deductive approach that is used throughout

science. That is, a hypothesis about the generator location for a given ERP effect leads to a set

of predictions, which are then tested by means of experiments. One prediction, of course, is that

52 Chapter 2

the observed scalp distribution will be consistent with the hypothesized generator location.

However, confirming this prediction is not usually sufficient to have strong confidence that the

hypothesis about the generator location is correct. Thus, it is important to test additional predic-

tions. For example, one could test the prediction that damage to the hypothesized generator

location eliminates the ERP component. Indeed, researchers initially hypothesized that the P3

component was generated in the hippocampus, and this hypothesis was rejected when experi-

ments demonstrated that the P3 is largely intact in individuals with medial temporal lobe lesions

(Polich, 2012). Similarly, one could predict that an fMRI experiment should show activation in

the hypothesized generator location under the conditions that produce the ERP component (see,

e.g., Hopf et al., 2006). It is also possible to record ERPs from the surface of the cortex in

neurosurgery patients, and this can been used to test predictions about ERP generators (see, e.g.,

Allison, McCarthy, Nobre, Puce, & Belger, 1994). This hypothesis-testing approach has been

quite successful in localizing some ERP components.

As was discussed in chapter 1, the main advantages of the ERP technique are its high temporal

resolution, its relatively low cost, its noninvasiveness, and its ability to provide a covert and

continuous measure of processing. Spatial resolution is simply not one of the strengths of the

ERP technique, and it is therefore sensible that this technique should mainly be used in studies

designed to take advantage of its strengths and that are not limited by its weaknesses.

Waveform Peaks versus Underlying ERP Components

Now that you understand how the underlying components become mixed together in scalp

electrodes — and why it is so difficult to recover the underlying components from the observed

scalp waveforms — it is time to consider why this is such a big problem in most ERP studies. In

this section, I will show a set of simple artificial waveforms that demonstrate how the superposi-

tion problem can easily lead to misinterpretations of ERP data, along with a set of “ rules ” for

avoiding these misinterpretations. In general, the goal of this section is to get you to look beyond

the ERP waveforms that you have recorded from the scalp and think about the underlying com-

ponents that might give rise to these surface waveforms. Even though you cannot usually localize

the underlying components, you can learn to make educated guesses about the underlying com-

ponents from the observed scalp waveforms.

The term ERP component will be described more formally later in this chapter. For now, you can just think of a component as electrical activity within a given region of the brain

that reflects some neural process occurring in that region, which then propagates to our scalp

electrodes.

To understand the underlying “ deep structure ” of a set of ERP waveforms, the first thing you

need to do is understand how the peaks in the observed ERP waveforms can provide a mislead-

ing representation of the underlying components. To make this clear, I created some simple

simulated components using Excel and then summed them together to create simulated scalp

ERP waveforms. The results are shown in figure 2.5 . This is a busy and complicated figure, but

A Closer Look at ERPs and ERP Components 53

C1

Baseline

C2

Peak1

Peak1

200

+

– 400 600 800 1000 ms

Peak1

Peak2

Peak2

Peak2

Peak3

Peak3

Peak3

C3

C1'

C2'

C3'

A D

B E

C F

G

H

I

Peak1

Peak2

Peak3

Decrease in amplitude of C2'

in Condition X Observed ERP

Waveform

One possible set of underlying components

Another possible set of underlying components

Increase in amplitude of C3 in Condition Z

Increase in amplitude of C1 in Condition Y

Condition X minus

Baseline

Condition Y minus

Baseline

Condition Z minus

Baseline

Condition X Condition Y Condition Z

Figure 2.5 Examples of the complications that arise when underlying components sum together to form an observed ERP waveform.

Panels B and C show two different sets of underlying components that sum together to produce the observed waveform

shown in panel A. Panels D, E, and F show three simulated experiments in which one component differs between a

baseline condition and three comparison conditions. In panel D, component C2 ′ is decreased in condition X compared to the baseline condition. In panel E, component C1 is increased in condition Y compared to the baseline condition,

which creates an apparent shift in the latencies of both peak 1 and peak 2. In panel F, component C3 is increased in

condition Z relative to the baseline condition, influencing both the amplitude and the latency of peak 2. Panels G, H,

and I illustrate difference waves for conditions X, Y, and Z, respectively.

54 Chapter 2

the next several sections will walk you through the different parts. This is probably the most

important figure in the whole book, so it ’ s worth taking your time and understanding each panel

of the figure.

Voltage Peaks Are Not Special Our eyes are naturally drawn to the peaks in a waveform, but the peaks often encourage incorrect

conclusions about the underlying components. This is illustrated in panels A and B of figure

2.5 , which show how three very reasonable components sum together to create the ERP wave-

form that we observe at the scalp. If you look carefully, you can see how the peaks in the observed

waveform do a poor job of representing the underlying components. First, notice that the first

positive deflection in the observed waveform (panel A) peaks at 50 ms, whereas the first under-

lying component (C1 in panel B) peaks at 100 ms. Similarly, the third underlying component

(C3 in panel B) begins at 100 ms, but there is no obvious sign of this component in the observed

waveform until approximately 200 ms.

It is natural for the human visual system to focus on the peaks in the waveform. However,

the points in an ERP waveform where the voltage reaches a maximally positive or maximally

negative value do not usually have any particular physiological or psychological meaning, and

they are often completely unrelated to the time course of any individual underlying component.

The peaks in the observed waveform are a consequence of the mixture of components, and they

typically occur at a different time than the peaks of the underlying components. For example,

the latency of a given peak in the observed waveform may differ across electrode sites (because

of the different weights from the underlying components at the different electrode sites),

whereas an underlying component has exactly the same time course at every electrode site

(because the voltage propagates instantaneously to all electrodes). Sometimes the peak in the

observed waveform occurs at the same time as the peak in an underlying component (e.g., when

the underlying component is much larger than the other components). However, even in these

cases it is not clear that the peak of activity tells us anything special about the underlying

process. That is, theories of cognition or brain processes do not usually say much about when

a process peaks. Instead, these theories usually focus on the onset of a process, the duration of

the process, or the integrated activity over time (see, e.g., Pashler, 1994; Usher & McClelland,

2001).

This leads to our first and most important “ rule ” about interpreting ERP data:

Rule 1: Peaks and components are not the same thing. There is nothing special about the point

at which the voltage in the observed waveform reaches a local maximum.

In light of this fundamental rule, I am always amazed at how often peak amplitude and

peak latency are used to measure the magnitude and timing of ERP components. These

measures often provide a highly distorted view of the amplitude and timing of the underlying

components. Chapter 9 will describe better techniques for quantifying ERP data that do not rely

on peaks.

A Closer Look at ERPs and ERP Components 55

Peak Shapes Are Not the Same as Component Shapes Panel C of figure 2.5 shows another set of underlying components that also sum together to

equal the ERP waveform shown in panel A. These three components are labeled C1 ′ , C2 ′ , and C3 ′ . From the single observed waveform in panel A, there is no way to tell whether it was actu- ally created from components C1, C2, and C3 in panel B or from C1 ′ , C2 ′ , and C3 ′ (or from some other set of underlying components). There are infinitely many sets of underlying compo-

nents that could sum together to produce the ERP waveform in panel A (or any other ERP

waveform).

If the observed waveform in panel A actually consisted of the sum of C1 ′ , C2 ′ , and C3 ′ , the peaks in the observed panel would be a very poor indicator of the shapes of the underlying

components. For example, the relatively short duration of peak 2 in panel A bears little resem-

blance to the long duration of component C2 ′ in panel C. If you saw a waveform like panel A in an actual experiment, you probably wouldn ’ t guess that a very long-duration negative com-

ponent like C2 ′ was a major contributor to the waveform. This leads to our second rule:

Rule 2: It is impossible to estimate the time course or peak latency of an underlying ERP com-

ponent by looking at a single ERP waveform — there may be no obvious relationship between

the shape of a local part of the waveform and the underlying components.

In a little bit, we will talk about how difference waves can sometimes be used to determine

the shape of an underlying ERP component. You can also gain a lot of information by looking

at the waveforms from multiple scalp sites because the time course of a given component ’ s

contribution to one electrode site will be the same as the time course of its contribution to the

other electrode sites (because ERPs propagate to all electrodes instantaneously).

The Time Course and Scalp Distribution of an Effect Panels D – F of figure 2.5 show three simulated experiments in which a “ baseline ” condition is

compared with three other conditions, named X, Y, and Z. The waveform for the baseline condi-

tion is the same as that shown in panel A, which could be the sum of either components C1, C2,

and C3 from panel B or components C1 ′ , C2 ′ , and C3 ′ from panel C. I am calling this a “ base- line ” condition only because I am using it for comparison with conditions X, Y, and Z.

In the simulated experiment shown in panel D, we are assuming that the observed waveform

for the baseline condition is the sum of components C1 ′ , C2 ′ , and C3 ′ and that component C2 ′ is decreased by 50% in condition X relative to the baseline condition. However, if you just saw

the waveforms in this panel, and you didn ’ t know the shapes of the underlying components, you

might be tempted to conclude that at least two different components differ between the baseline

condition and condition X. That is, it looks like condition X has a decrease in the amplitude

of a negative component peaking at 300 ms and an increase in the amplitude of a positive com-

ponent peaking at 500 ms. You might even conclude that condition X had a larger positive

component peaking at 100 ms. This would be a very misleading set of conclusions about the

56 Chapter 2

underlying components because the effect actually consists entirely of a decrease in the ampli-

tude of a single, broad, negative component (C2 ′ ). These incorrect conclusions arise from our natural inclination to assume that the peaks in the observed waveform have a simple and direct

relationship to the unknown underlying components. This leads us to conclude that the increased

amplitude in peak 3 in panel D reflects an increase in the amplitude of a long-latency positive

component, when in fact it is a result in of a decrease in the amplitude of an intermediate-latency

negative component.

Panel E shows a simulated experiment comparing the baseline condition with condition Y.

This time, we ’ re assuming that the underlying components are C1, C2, and C3 (rather than C1 ′ , C2 ′ , and C3 ′ ). Condition Y is the same as the baseline condition except for an increase in the amplitude of component C1. However, if you saw these observed waveforms, you might be

tempted to conclude that condition Y has an increase in the amplitude of an early positive com-

ponent peaking at 100 ms followed by a decrease in a negative component that peaks at 300 ms.

Similarly, panel F shows a simulation in which the amplitude of component C3 is increased

in condition Z compared to the baseline condition. Although only this late positive component

differs across conditions, the observed waveform for condition Z appears to exhibit a reduction

of a negative component at 300 ms as well. However, this is just a consequence of the fact that

component C3 overlaps in time with component C2.

In panels D – F, a change in the amplitude of a single component caused a change in the

amplitude of multiple peaks, making it likely that the researcher will falsely conclude that mul-

tiple underlying components differ across conditions. I have seen this general pattern of wave-

forms countless times, and people often misinterpret the effects in exactly this way. This leads

to our third rule:

Rule 3: An effect during the time period of a particular peak may not reflect a modulation of

the underlying component that is usually associated with that peak.

Difference Waves as a Solution Difference waves can often be used to provide a better estimate of the time course and scalp dis-

tribution of the underlying components. Difference waves are a very simple concept: You simply

take the voltage at each time point in one ERP waveform and subtract the voltage from a different

ERP waveform at the corresponding time points. The result gives you the difference in voltage

between the two waveforms at each time point. In an oddball experiment, for example, you can

subtract the frequent-trial waveform from the rare-trial waveform (rare-minus-frequent), and the

resulting difference wave will show the difference in voltage between these trial types at each

point in time (see, e.g., figure 1.4 in chapter 1). The difference wave from a given electrode site

is analogous to a difference image in an fMRI experiment, except that differences are computed

at each time point in an ERP difference wave and at each voxel in an fMRI difference image.

Panels G – I of figure 2.5 show difference waves from the three simulated experiments shown

in panels D – F. The baseline waveform has been subtracted from the waveforms for conditions

A Closer Look at ERPs and ERP Components 57

X, Y, and Z. These difference waves are very helpful in showing us the nature of the experimental

effects. They make the time course of the effects very clear, showing that they are different from

the time course of the peaks in the observed waveforms. Indeed, under these conditions — in

which the experimental effects consist entirely of a change in the amplitude of a single underly-

ing component — the difference waves reveal the time course and scalp distribution of the under-

lying component that differs across conditions. That is, if you compare the difference wave in

panel G to the underlying components in panel C, you will see that the difference wave has

exactly the same time course as component C2 ′ . And because the difference waves in these examples subtract away everything except the one component that differs across conditions, the

difference waves will have the same topography as the underlying component.

You cannot be guaranteed that a difference wave will contain only a single component. For

example, the rare-minus-frequent difference waves from the oddball paradigm that were dis-

cussed in chapter 1 (see figure 1.4) contained both an N2 wave and a P3 wave. However, there

is a good chance that you will have a single component in your difference waves if you compare

conditions that are very subtly different.

How can you know if a broad effect in a difference wave consists of a change in a single

underlying component or a sequence of two or more components? First, you can look at the

scalp distribution of the difference wave, collapsed across the entire duration of the effect, to

see if the scalp distribution is too complex to be explained by a single dipole. Second, you can

compare the scalp distributions of the early and later parts of the difference wave. If they are

the same, then you probably have a single component; if they are different, then you have at

least two components. Difference waves are discussed in more detail later in this chapter.

Interactions between Amplitude and Latency Although the amplitude and latency of an underlying component are conceptually independent,

they can become confounded when the underlying components become mixed together at the

scalp. For example, panel F of figure 2.5 illustrates the effect of increasing the amplitude of

component C3. Because component C3 overlaps with peak 2, this change in the amplitude of

component C3 causes a shift of approximately 20 ms in the peak latency of peak 2 in condition

Z relative to the baseline condition. This leads to our next rule:

Rule 4: Differences in peak amplitude do not necessarily correspond with differences in com-

ponent size, and differences in peak latency do not necessarily correspond with changes in

component timing.

This is a somewhat depressing rule, because the most obvious virtue of the ERP technique is

its temporal resolution, and it is unpleasant to realize that estimates of component timing can

be distorted by changes in the amplitude of temporally overlapping components. Keep in mind,

however, that this sort of temporal distortion mainly happens when a large change in a large

component overlaps with a smaller component, leading to a small latency change in the smaller

component. If you think carefully about a given effect and spend some time looking at the

58 Chapter 2

difference waves, you should be able to determine whether the latency differences could plau-

sibly be a result of amplitude changes (or vice versa). In addition, chapters 3 and 4 will discuss

how this problem can be completely avoided by looking at the onset time of a difference wave.

Distortions Caused by Averaging In the vast majority of ERP experiments, the ERP waveforms are isolated from the background

EEG by means of signal averaging. It ’ s tempting to think of averaging as a process that simply

attenuates the nonspecific EEG, allowing us to see what the single-trial ERP waveforms look like.

However, to the extent that the single-trial waveform varies from trial to trial, the averaged ERP

may provide a distorted view of the single-trial waveforms, particularly when component latencies

vary from trial to trial. This is illustrated in figure 2.6 . Panel A illustrates three single-trial ERP

waveforms (without any EEG noise), with significant latency differences across trials, and panel

B shows the average of those three single-trial waveforms. The averaged waveform differs from

the single-trial waveforms in two significant ways. First, it is smaller in peak amplitude. Second,

it is more spread out in time. As a result, even though the waveform in panel B is the average of

the waveforms in panel A, the onset time of the averaged waveform in panel B reflects the onset

time of the earliest single-trial waveform and not the average onset time. In other words, the onset

time in the average waveform is not equal to the average of the single-trial onset times (see chapter

8 and online chapter 11 for additional discussion). This leads to our next rule:

Rule 5: Never assume that an averaged ERP waveform directly represents the individual wave-

forms that were averaged together. In particular, the onset and offset times in the averaged

waveform represent the earliest onsets and latest offsets from the individual trials or individual

subjects that contribute to the average.

Fortunately, it is often possible to measure ERPs in a way that avoids the distortions created

by the averaging process. For example, the area under the curve in the averaged waveform shown

in figure 2.6A is equal to the average of the area under the single-trial curves in figure 2.6B .

Similarly, it is possible to find the time point that divides the area into two equal halves, and

this can be a better measurement of latency than peak measures. These methods are described

in detail in chapter 9.

Comparisons across Experiments It can be very difficult to determine whether an ERP effect in one experiment reflects the same

underlying component as an ERP effect in another experiment. For example, imagine that you ’ ve

conducted an experiment in which you recorded ERPs from bilingual speakers of two languages,

Xtrinqua and Kirbish, focusing on the N400 component elicited by words in these two languages.

You found that the N400 was larger (more negative) for proper nouns in Xtrinqua than for proper

nouns in Kirbish. You therefore conclude that processing proper nouns requires more work in

Xtrinqua than in Kirbish. However, this assumes that the effect you have observed actually

reflects a change in the same N400 component that was observed in prior studies of semantic

A Closer Look at ERPs and ERP Components 59

integration. Perhaps you are instead seeing a smaller P3 rather than a larger N400. Because the

P3 is related to the amount of cognitive resources devoted to processing a stimulus, this would

suggest that it is actually easier to process Xtrinqua proper nouns than Kirbish proper nouns.

This is nearly the opposite of the conclusion that you would draw if the effect was an increase

in N400 amplitude rather than a decrease in P3 amplitude. This type of problem has come up

many times in the language ERP literature (see review by Swaab, Ledoux, Camblin, & Boudewyn,

2012), and analogous problems arise in other ERP domains. Scalp distribution can sometimes

be used to rule out a given component, but this only works when the scalp distributions of two

components are easily distinguished (which is not the case for P3 and N400).

This leads to our final rule of ERP interpretation:

Rule 6: An ERP effect observed in one experiment may not reflect the same underlying brain

activity as an effect of the same polarity and timing in previous experiments.

Solutions to this problem will be described in the final section of this chapter.

Some General Comments about the Component Structure of ERP Waveforms The six rules of ERP interpretation presented in this chapter have been violated in a very large

number of published ERP experiments (including some of my own papers!). Violations of these

rules significantly undermine the strength of the conclusions that can be drawn from these

experiments, so you should look for violations when you read ERP papers. If you are relatively

new to the ERP technique, it would be worth looking back through old ERP papers that you ’ ve

previously read and identifying any violations of these rules. This will be good practice, and it

may cause you to reevaluate your conclusions about some prior findings.

This section of the book can be depressing, because it focuses on the most difficult challenges

that face ERP researchers. But don ’ t despair! Chapter 4 provides some strategies for overcoming

these challenges.

A BP3 wave on 3 different trials

Average of the 3 trials

Figure 2.6 Illustration of the distortions in an averaged ERP waveform that arise from trial-to-trial latency variation. (A) A single-

trial component at three different latencies, representing trial-to-trial variations in timing. (B) The average of these three

single-trial waveforms. The average waveform is broader than the single-trial waveforms and has a smaller peak ampli-

tude. In addition, the onset time and offset time of the average waveform reflect the earliest onset and latest offset times,

not the average onset and offset times.

60 Chapter 2

Also, many of these problems are not nearly as bad when you consider the waveforms

recorded at multiple scalp sites. Because the ERP waveforms at different scalp sites reflect

exactly the same components, but scaled by different weighting factors (as shown in figure 2.3 ),

you can rule out many alternative explanations by looking at the data from multiple electrodes.

Consider, for example, the simulated experiment shown in figure 2.7 (which is based on the C1 ′ , C2 ′ , and C3 ′ components in figure 2.5 ). In this imaginary experiment, ERPs were recorded from the Fz, Cz, and Pz electrode sites in condition A and condition B. If you just saw the data from

the Cz electrode site, you might conclude that the P3 wave was larger and earlier for condition

B than for condition A. However, you can rule out this conclusion by looking at the scalp dis-

tribution of the waveforms (both the parent waveforms and the difference waves). From the

whole set of data, you can see that the P3 wave is largest at Pz, a little smaller at Cz, and quite

a bit smaller at Fz (which is the usual pattern). The experimental effect, however, is largest at

Fz, smaller at Cz, and near zero at Pz, and it extends from approximately 150 to 450 ms. Both

the timing and the scalp distribution of the experimental effect are inconsistent with a modula-

tion of the P3 wave and are more consistent with a frontal N2 effect. You should also note that

the scalp distribution of the effect is the same throughout the time course of the effect, suggest-

ing that it really does consist of a single component that extends from approximately 150 to 450

ms, rather than consisting of separate early and late effects. Thus, a careful examination of the

data from multiple electrode sites, including the difference waves, will allow you to avoid making

na ï ve mistakes in interpreting your data. Moreover, as you gain more experience with ERPs,

you will be able to do a better job of figuring out the underlying components (see box 2.4 ).

Although I am not a big fan of mathematical techniques for determining the underlying com-

ponents in ERP experiments, they can certainly be useful for generating plausible hypotheses

and for showing that some hypotheses are inconsistent with the data. For example, if you were

to perform source localization on the difference waves shown in figure 2.7 (with many more

electrode sites, of course), this would make it clear that the effect is not consistent with the likely

generators of the P3 wave. Source localization would also make it clear that a single generator

source could account for both the early and late portions of the effect. Thus, even though you

should be skeptical about the three-dimensional coordinates of the effect that would be provided

by source localization techniques, these techniques are still valuable for testing more general

hypotheses about the component structure of your data. This is especially true if you combine

source localization with the use of difference waves (see box 2.5 for an example of my own

rather lame attempt at using math to understand the underlying component structure of an

experiment).

Difference Waves as a Tool for Isolating Components

In this section, I want to spend some additional time discussing the value of difference waves

in addressing the problems caused by the mixing of components in our observed ERP wave-

forms. As shown in figures 2.5 and 2.7 , difference waves can sometimes reveal the time course

A Closer Look at ERPs and ERP Components 61

300150 450 750600

300150 450 750600

300150 450 750600

Fz

Cz

Pz

300150 450 750600

300150 450 750600

300150 450 750600

Condition A

Parent Waveforms Difference Waves

Condition B Minus Condition A Condition B

Figure 2.7 Simulated experiment in which three underlying components (similar to components C1 ′ , C2 ′ , and C3 ′ in figure 2.5 ) propagate to electrode sites Fz, Cz, and Pz with different weighting factors. The amplitude of a negative-going component

(C2 ′ ) is reduced in condition B compared to condition A. Because the weighting from this component is greatest at Fz, intermediate at Cz, and smallest at Pz, the difference between conditions A and B is greatest at Fz, intermediate at Cz,

and smallest at Pz. This can be seen in the parent waveforms from the individual conditions (left column) and in the

difference waves (right column). Note that because the difference waves were formed by subtracting condition A from

condition B rather than vice versa, the difference waves show a positive deflection rather than a negative deflection.

62 Chapter 2

Box 2.4 Experience Matters

It appears that people can learn to make good inferences about the underlying components when

examining ERP data. Many years ago, Art Kramer conducted a study in which he created artificial

ERPs from a set of realistic underlying components (Kramer, 1985). He then gave these waveforms

(from multiple conditions at multiple electrode sites) to eight graduate students and two research

associates who had between 1 and 10 years of experience with ERPs. There was a strong effect of

experience: the experienced ERP researchers were able to figure out the nature of the underlying

components, whereas the people who were new to ERPs could not reliably determine the underlying

component structure of the data. Of course, there are limits to this conclusion. It was based on a

small sample of researchers from one particular lab who were shown one type of data, and there is

no guarantee that a given experienced researcher will always (or even usually) reach a correct

interpretation. When writing a journal article, you couldn ’ t cite your years of experience with ERPs

as evidence that your interpretation of the underlying component structure is correct. However, once

you gain enough experience with ERPs (and learn to apply everything you have read in this chapter),

you should be able to figure out the likely pattern of underlying ERP components from a given

experiment with some confidence.

and scalp distribution of an underlying ERP component. A well-constructed difference wave

(i.e., one based on a well-controlled and relatively subtle experimental manipulation) will always

contain fewer components than the parent waveforms from which the difference wave was constructed. And fewer components means less opportunity for confusion due to the mixing of

components.

Given that I ’ m a big fan of difference waves, it ’ s a bit ironic that the first published use of

difference waves to assess the component structure of an experiment were in a paper by Risto

N ä ä t ä nen and his colleagues (N ä ä t ä nen, Gaillard, & Mantysalo, 1978) that was designed to

question the interpretation of a previous study by my graduate mentor, Steve Hillyard (Hillyard,

Hink, Schwent, & Picton, 1973). However, Jon Hansen and Steve Hillyard responded with a

paper in which they embraced the difference wave approach and fought back with their own

difference waves (Hansen & Hillyard, 1980).

The logic behind difference waves is very simple. As shown in figure 2.3 , the ERP waveform

at a given electrode site is the weighted sum of a set of underlying source waveforms (recall

that source waveform is just another term for the time course of an ERP component at the gen- erator site). If two conditions differ in some of the source waveforms and not in others, then the

difference wave created by subtracting one condition from the other will eliminate the source

waveforms that are identical in the two conditions, making it possible to isolate the components

that differ. This is completely reasonable because ERPs from different sources simply sum

together.

As described earlier in this chapter, figures 2.5 and 2.7 show how difference waves can

help reveal the time course of an effect. To see how difference waves can help reveal the scalp

A Closer Look at ERPs and ERP Components 63

When I was making the simulated waveforms shown in figure 2.7 , I realized that these waveforms

looked a lot like the data from one of the first experiments I conducted in graduate school (Luck,

Heinze, Mangun, & Hillyard, 1990). In this experiment, subjects attended either to the left or the

right side of a bilateral visual display, and we wanted to see if the P1 and N1 components would

be larger over the hemisphere contralateral to the attended side (relative to the ipsilateral hemi-

sphere). The waveforms are shown in the illustration that follows.

As you can see, we found that the waveform was more positive over the contralateral hemisphere

from approximately 80 to 200 ms, including the time ranges of the P1, N1, and P2 components.

From visual inspection of the data, it was not clear whether this broad attention effect was a single

component or a modulation of separate P1 and P2 components. Because I thought of myself as a

statistics hotshot at that time, I decided that I would use principal component analysis (PCA) to

answer this question. PCA produced nice results indicating that the attention effect consisted of

modulations of separate P1 and P2 components. However, the more I thought about the results, the

more I realized that they could be an artifact of PCA assumptions. Also, Steve Hillyard was not as

impressed by fancy mathematical techniques as I was (for reasons that I now appreciate). Our com-

promise was to put the PCA results into an appendix in the published paper. Fortunately, the conclu-

sions of the paper did not hinge on the PCA results.

Box 2.5 My Own Experience with Mathematical Approaches

100 200 300 400

Contralateral to attended location

Ipsilateral to attended location

P1

N1

P2

+0.5 µV

–0.5 µV

distribution of an effect, consider the N170 component shown in figure 2.8 . As discussed in

chapter 1, the N170 is larger in response to pictures of faces than pictures of non-face objects such

as cars (for a review, see Rossion & Jacques, 2012). If you simply measure the scalp distribution

at 170 ms for trials on which faces were presented, the distribution will reflect face-specific pro-

cessing plus all of the nonspecific activity that is present at 170 ms. However, if you first construct

face-minus-nonface difference waves and then measure the scalp distribution, you will see the

topography of face-specific brain activity, uncontaminated by the nonspecific activity.

Bruno Rossion ’ s lab conducted a study in which they examined the development of the N170

(Kuefner, de Heering, Jacques, Palmero-Soler, & Rossion, 2010), and this study showed the

importance of measuring scalp distributions from difference waves. In this study, ERPs were

recorded from children between 4 and 17 years of age while they viewed images of faces,

64 Chapter 2

scrambled faces, cars, and scrambled cars. When the scalp distribution was measured at the time

of the N170 component from the ERPs elicited by faces, the scalp distribution changed markedly

over the course of development. Previous studies had found similar effects and concluded that

the neuroanatomy of face processing changed during this period of development. However, these

changes in scalp distribution could instead be a result of distortion from other ERP components

that are also present at 170 ms and that change in relative amplitude over the course of develop-

ment. To test this, Kuefner et al. constructed difference waves in which they subtracted the ERPs

elicited by scrambled faces from the ERPs elicited by intact faces, removing any activity that

was not face specific. The scalp distribution of the resulting difference wave was virtually identi-

cal from 4 to 17 years, indicating that the face-specific processing had the same neuroanatomical

locus across this developmental period.

Difference waves can also help you avoid a visual illusion that sometimes causes people to

misperceive ERP effects. Figure 2.9 shows a simulated experiment in which this illusion arises.

In this experiment, the difference between condition A and condition B is exactly the same at

the Fz and Cz electrode sites. However, because this effect is superimposed on the steeply rising

edge of the P3 wave at the Cz site, the effect looks smaller at Cz than at Fz. The difference wave

shows that the effect is exactly the same at these sites. The illusion arises because our visual

system tends to focus on the distance between two curves along an axis that is approximately

perpendicular to the curves (e.g., the horizontal difference between the condition A and condition

B waveforms at the Cz site). However, the difference in amplitude at a given time point is the

vertical distance between the two waveforms at that time point. The steeper the slope of two

overlaid waveforms, the more our visual system focuses on the horizontal difference between

the waveforms rather than the vertical distance, and this leads us to underestimate the true

P1 P2

N170 Face Car

Figure 2.8 ERPs elicited over visual cortex by car and face stimuli, with the N170 effect highlighted by the shaded region. Adapted

with permission from Rossion and Jacques (2012). Copyright 2012 Oxford University Press.

A Closer Look at ERPs and ERP Components 65

difference in amplitude between the two conditions. Once you look at the difference waves in

figure 2.9 , you can see that the difference between conditions A and B is exactly the same at Fz

and Cz in this experiment. This is yet one more advantage of difference waves.

Although difference waves are an incredibly useful way to isolate the underlying components

from the mixture of components in the parent waveforms, they do have some limitations. First,

you are making implicit assumptions about the nature of the difference between conditions A

and B when you choose whether to compute A minus B or to compute B minus A. Consider,

for example, the simulated experiment shown in figure 2.7 . The difference waves in this experi-

ment make it look as if a frontal positive component is larger in condition B than in condition

A, but the effect was actually created (in my simulations) by decreasing the amplitude of a frontal

negative component. Thus, you need to be careful about the implicit assumptions you are making

about the direction of the effect when you subtract one waveform from another.

Another important issue is that the difference wave will always be noisier than the parent

waveforms. To understand why this is true, consider what happens when you add two waveforms

rather than subtract two waveforms. When you add two waveforms, A and B, some of the noise

in A cancels out noise in B, but mostly the noise increases rather than canceling, and the overall

noise in the summed waveform increases (although it doesn ’ t double). It turns out that exactly

the same thing happens when you subtract one waveform from the other instead of adding the

two waveforms. The reason for this is that the noise at a given point in the waveform can be

either positive-going or negative-going, so it doesn ’ t matter whether you add or subtract pure

noise (on average). The end result will be a noisier waveform. However, this is not usually a

problem in practice. If you do a statistical analysis on the difference waves, the p value may be exactly the same as if you did the analysis on the parent waveforms (this is true for linear mea-

sures, such as mean amplitude, as will be discussed in chapter 9). For example, if you have

Condition A

Condition B Minus Condition A Condition B

300150 450 750600

Fz Cz

300150 450 750600

Figure 2.9 Visual illusion in which the effect of an experimental manipulation appears to be smaller when it is superimposed on a

steeply sloped ERP wave. In this example, the difference between conditions A and B is exactly the same size at the Fz

and Cz scalp sites (as shown by the difference waves). However, because this difference is superimposed on a steeply

sloped P3 wave at the Cz site, but not at the Fz site, the effect looks smaller at Cz than at Fz when judged by looking

at the parent waveforms rather than by looking at the difference waves.

66 Chapter 2

waveforms for conditions A and B in two groups of subjects, you could do a two-way analysis

of variance (ANOVA) on the mean amplitude from 400 to 600 ms with factors of condition and

group. Alternatively, you could measure the mean amplitude from 400 to 600 ms in difference

waves formed by subtracting the condition B waveforms from the condition A waveforms and

then do a t test comparing these values for the two groups. The p value from this t test will be identical to the p value from the group × condition interaction in the ANOVA.

What Is an ERP Component?

A Conceptual Definition We are now ready to provide a formal definition of ERP component . In the early days of ERP research, a component was defined primarily on the basis of its polarity, latency, and general

scalp distribution. For example, the P3a and P3b components were differentiated on the basis

of the earlier peak latency and more frontal distribution of the P3a component relative to the

P3b component (Squires, Squires, & Hillyard, 1975). However, polarity, latency, and scalp dis-

tribution are superficial features that don ’ t really capture the essence of a component. For

example, the peak latency of the P3b component may vary by hundreds of milliseconds depend-

ing on the difficulty of the target – nontarget discrimination (Johnson, 1986), and the scalp dis-

tribution of the auditory N1 wave depends on the pitch of the eliciting stimulus in a manner that

corresponds with the tonotopic map of auditory cortex (Bertrand, Perrin, & Pernier, 1991). Even

polarity may vary: the C1 wave, which is generated in primary visual cortex, is negative for

upper-field stimuli and positive for lower-field stimuli due to the folding pattern of this cortical

area (Clark, Fan, & Hillyard, 1995). The difficulty of using latency, scalp distribution, and polar-

ity to identify components becomes even greater when multiple components are mixed together

at a given electrode site. For example, the N2 elicited by an oddball is superimposed on top of

P2 and P3 waves, so the overall voltage during the N2 time range is often positive.

I do not mean to imply that you should not use latency, scalp distribution, and polarity to help

identify ERP components. In fact, these factors are extremely useful in determining whether a

given experimental effect reflects, for example, a P3a or P3b component. However, these factors

do not define an ERP component; they are merely superficial consequences of the nature of the component. Instead, a component should be defined by the factors that are intrinsic to the com-

ponent. For example, an influential discussion of ERP components by Manny Donchin, Walter

Ritter, and Cheyne McCallum noted that, “ an ERP component is a subsegment of the ERP whose

activity represents a functionally distinct neuronal aggregate ” (Donchin, Ritter, & McCallum,

1978, p. 353). My own, very similar definition is this:

Conceptually, an ERP component is a scalp-recorded neural signal that is generated in a specific

neuroanatomical module when a specific computational operation is performed.

By this definition, a component may occur at different times under different conditions, as

long as it arises from the same module and represents the same computational operation (e.g.,

A Closer Look at ERPs and ERP Components 67

the encoding of an item into working memory in a given brain area may occur at different delays

after the onset of a stimulus because of differences in the amount of time required to identify

the stimulus and decide that it is worth storing in working memory). The scalp distribution and

polarity of a component may also vary according to this definition, because the same cognitive

function may occur in different parts of a cortical module under different conditions (e.g., when

a visual stimulus occurs at different locations and therefore stimulates different portions of a

topographically mapped area of visual cortex). It is logically possible for two different cortical

areas to accomplish exactly the same cognitive process, but this probably occurs only rarely and

would lead to a very different pattern of voltages, and so this would not usually be considered

a single ERP component (except in the case of mirror-image regions in the left and right

hemispheres).

Although this definition captures the essence of an ERP component, it is completely useless

as an operational definition. That is, because we are recording the mixed signals from multiple

generators at every electrode site, we have no way of directly determining the contribution of

each neuroanatomical module to the observed waveform. And we don ’ t ordinarily know the

computational operation reflected by a scalp voltage. So we cannot use this definition to deter-

mine which component is being influenced by a given experimental manipulation or to determine

whether the same component is active in multiple experiments. However, this definition is useful

insofar as it helps to formalize what we believe is going on under the skull.

You should also keep in mind that many well-known ERP components (e.g., N400, mismatch

negativity, P3b) are probably not individual components according to this definition. That is,

multiple brain areas that are carrying out similar but somewhat different computations likely

contribute to these scalp-recorded voltage deflections (see, e.g., the chapters on these compo-

nents in Luck & Kappenman, 2012b). As research progresses, we gain the ability to subdivide

these multicomponent conglomerations into their parts. In some cases, this leads to new names

for the individual components. For example, the “ N2 component ” that was identified in early

ERP studies turned out to consist of a set of many different individual components that can be

separated by means of various experimental manipulations, and these individual components

(sometimes called subcomponents ) were given new names (e.g., anterior N2 , N2c , N2pc ). In some cases, these subcomponents have been divided even further (see, e.g., Luck, 2012b). In

other cases, we still use a single name to refer to something that likely consists of multiple

similar components, especially when we don ’ t yet have good methods for isolating the individual

subcomponents (e.g., N400 is still used even though there are probably at least two neurally distinct subcomponents that contribute to it).

An Operational Definition Manny Donchin provided an alternative definition that is more like an operational definition: “ A

component is a set of potential changes that can be shown to be functionally related to an

experimental variable or to a combination of experimental variables ” (Donchin et al., 1978, p.

353). They summarized this idea more compactly by saying that, “ an ERP component is a source

68 Chapter 2

of controlled , observable variability ” (Donchin et al., 1978, p. 354). This is very similar to the idea, illustrated in figures 2.5 and 2.7 , that difference waves can be used to isolate ERP compo-

nents. Donchin et al. noted that electrode site is one key source of variability, which corresponds

to the idea that changes in the scalp distribution of a difference wave between the early and late

parts of the effect mean that different components are active in the early and late parts of the

effect. Thus, by this definition, any consistent difference between the waveforms in different

experimental conditions is an ERP component. This is a clear and easily applied operational

definition of ERP component . However, there are two shortcomings of this operational definition. First, it ignores the key

idea that an ERP component is generated by a specific neuroanatomical module (or “ neural

aggregate ” to use the terminology of Donchin et al., 1978). That is, even if the scalp distribution

of an experimental effect did not vary over time or over the conditions of a particular experiment,

the scalp distribution might clearly indicate that more than one dipole was active (e.g., if

the scalp distribution did not follow a simple dipolar pattern). In such a case, we would want

to conclude that multiple components were active, but tightly interlinked. And we could reason-

ably assume that these components could be shown to be functionally distinct in future

experiments.

Second, Donchin ’ s operational definition focuses exclusively on experimental (controlled) manipulations, completely ignoring variations that are spontaneous or correlational in nature.

To take an obvious example of why this is an unnecessary limitation, consider a case in which

a patient group and a control group differ in terms of the voltage between 150 and 250 ms over

lateral frontal electrode sites. In this case, there must be at least one ERP component (according

to my conceptual definition) that varies between these groups. This is just as convincing as an

experimentally controlled within-subject difference in the voltage between 150 and 250 ms over

lateral frontal electrode sites. Subtler examples are also important to consider. For example, if

you look at the distribution of voltage over the scalp from moment to moment as the EEG

spontaneously varies, it is possible to used advanced statistical techniques (e.g., principal com-

ponent analysis, independent component analysis) to isolate systematic patterns of covariation

across electrode sites that reflect spontaneous fluctuations in the amplitude of underlying ERP

components.

When taking these factors into consideration, we can update Donchin ’ s operational definition

as follows:

An ERP component can be operationally defined as a set of voltage changes that are consistent

with a single neural generator site and that systematically vary in amplitude across conditions,

time, individuals, and so forth. That is, an ERP component is a source of systematic and reliable

variability in an ERP data set.

This should not be taken to imply that we cannot isolate a component until we have determined

its generator. If the scalp distribution of a putative component has a complex structure that is

inconsistent with a single dipole, then this should not be considered a single component. But if

A Closer Look at ERPs and ERP Components 69

it has a simple structure that is consistent with a single dipole (or with a pair of mirror-symmetric

dipoles in the left and right hemispheres), then we can provisionally accept it as a single

component.

How Can We Identify Specific ERP Components?

Now that we have both a conceptual definition and an operational definition for ERP compo-

nents, we can consider in more detail a question that has come up many times in this chapter:

when we see differences in the ERPs between different conditions or different groups in a

given study, how can we determine which underlying components are responsible for these

differences?

This is still not an easy question to answer, because our operational definition applies to a

single data set and cannot be used to determine if the components (sources of consistent vari-

ability) that we see in one experiment are the same as the components that were observed in

another experiment. Scalp distributions can be helpful, especially when derived from difference

waves to eliminate the contribution of other overlapping components. If the scalp distribution

of the effect in the new experiment is the same as the scalp distribution observed in previous

experiments, then this is one piece of evidence in favor of it being the same component; if the

scalp distribution is different, then this is evidence against the hypothesis. In practice, this can

provide strong evidence against the hypothesis that two effects reflect the same component, but it cannot provide strong evidence in favor of the hypothesis. First, it is often impossible to provide direct quantitative comparisons of scalp distributions across experiments (e.g., when you

are comparing your data with published data from another lab), and this will make it difficult

to rule out the possibility that the scalp distribution in the new experiment is subtly but signifi-

cantly different from the scalp distribution in a previous experiment. Second, concluding that

two scalp distributions are the same requires accepting the null hypothesis, and conclusions of

this sort are almost always weak. Source localization techniques can also provide evidence about

whether generator sources are the same for ERP effects observed in different experiments, but

this is essentially equivalent to asking whether the scalp distributions are the same and therefore

has the same limitations.

We can also ask whether the component that we isolate in one experiment shows the same

pattern of interaction with other factors as the component that was isolated in prior experiments.

For example, if we believe we are seeing a P3b effect, we can ask whether it interacts with the

probability of the stimulus category (see Luck et al., 2009; and see Experiment 4 in Vogel, Luck,

& Shapiro, 1998). This can be a useful adjunct to comparisons of scalp distribution, but it

requires a very solid theory of the interactions that should occur. Some of the clearest cases

involve combinations of experimental factors and scalp distribution; as will be described in

chapter 3, the N2pc, contralateral delay activity (CDA), and LRP components are isolated by

computing a difference wave in which the waveforms recorded over the ipsilateral hemisphere

(relative to an attended location or a response hand) are subtracted from the waveforms recorded

70 Chapter 2

over the contralateral hemisphere. These components can be very easily isolated from other ERP

components, and they have been particularly useful for testing precise hypotheses about atten-

tion, working memory, and response selection.

In general, the difficult problem of identifying ERP components is solved in the same way

that every other difficult problem in cognitive neuroscience is solved: with converging evidence.

If you make a slight change in an experimental paradigm and you find the same effects as before

(in terms of latency and scalp distribution), it is very likely that you have isolated the same

underlying ERP components. If you then make a very large change in the paradigm, and now

you find a very different latency or a somewhat different scalp distribution, you have to do some

further research to determine whether you are still looking at the same component. If the latency

is quite different, you can look for other evidence that the timing of the relevant process has

changed. If the scalp distribution is a little different, you can look at whether the scalp distribu-

tion changes over the time course of the effect, suggesting that you now have two components

contributing to the effect. If the scalp distribution is approximately the same as in your prior

research, you can run additional experiments to see if the effect interacts with other manipula-

tions in a way that is predicted by your hypothesis about the nature of the underlying component.

You could also ask whether subject-to-subject differences in the scalp distribution of your new

experimental effect are correlated with subject-to-subject differences in your previous effect.

The bottom line is that it is more difficult to isolate and identify specific ERP components

than people typically realize, but this challenge can be overcome if you think carefully about

the problem and come up with clever ways to test your hypotheses. Chapter 4 will describe a

variety of strategies for overcoming this challenge, including circumventing the problem entirely

by designing experiments that do not depend on identifying a specific ERP component.

Suggestions for Further Reading

Donchin, E. (1981). Surprise! … Surprise? Psychophysiology , 18 , 493 – 513.

Donchin, E., Ritter, W., & McCallum, W. C. (1978). Cognitive psychophysiology: The endogenous components of the

ERP. In E. Callaway, P. Tueting, & S. H. Koslow (Eds.), Event-Related Brain Potentials in Man (pp. 349 – 441). New York: Academic Press.

Kappenman, E. S., & Luck, S. J. (2012). ERP components: The ups and downs of brainwave recordings. In S. J. Luck

& E. S. Kappenman (Eds.), The Oxford Handbook of ERP Components (pp. 3 – 30). New York: Oxford University Press.

Makeig, S., & Onton, J. (2012). ERP features and EEG dynamics: An ICA perspective. In S. J. Luck & E. S. Kappen-

man (Eds.), The Oxford Handbook of ERP Components (pp. 51 – 86). New York: Oxford University Press.

N ä ä t ä nen, R., & Picton, T. (1987). The N1 wave of the human electric and magnetic response to sound: A review and

an analysis of the component structure. Psychophysiology , 24 , 375 – 425.

Picton, T. W., & Stuss, D. T. (1980). The component structure of the human event-related potentials. In H. H. Kornhuber

& L. Deecke (Eds.), Motivation, Motor and Sensory Processes of the Brain, Progress in Brain Research (pp. 17 – 49). North-Holland: Elsevier.