CogNotes2.pptx

Pattern Recognition

Langston, PSY 4040

Cognitive Psychology

Notes 2

1

What is this?

2

What is this?

3

What is this?

4

What is this?

What's Out There?

We're going to explore how you know what it is that you're seeing, hearing, etc. Three steps:

Input: Get it into the system.

Identification: What is it?

Recognition: What does it mean?

6

Architecture

Recall our three boxes:

Sensory

Store

LTM

STM

Filter

Pattern

Recognition

Selection

Input

(Environment)

Response

7

Where We Start

The sensory system delivers information transduced from the environment, then we start.

Kind of a gray line…

Where We Start

Monocular cues to depth:

Linear perspective

Interposition

Relative size

Relative height

Texture gradient

Lighting/shading

Familiar size

Where We Start

Monocular cues to depth:

Linear perspective

Interposition

Relative size

Relative height

Texture gradient

Lighting/shading

Familiar size

Where We Start

Monocular cues to depth:

Linear perspective

Interposition

Relative size

Relative height

Texture gradient

Lighting/shading

Familiar size

Where We Start

Monocular cues to depth:

Linear perspective

Interposition

Relative size

Relative height

Texture gradient

Lighting/shading

Familiar size

Where We Start

Monocular cues to depth:

Linear perspective

Interposition

Relative size

Relative height

Texture gradient

Lighting/shading

Familiar size

Where We Start

Monocular cues to depth:

Linear perspective

Interposition

Relative size

Relative height

Texture gradient

Lighting/shading

Familiar size

Where We Start

Monocular cues to depth:

Linear perspective

Interposition

Relative size

Relative height

Texture gradient

Lighting/shading

Familiar size

Architecture

Recall our three boxes:

Sensory

Store

LTM

STM

Filter

Pattern

Recognition

Selection

Input

(Environment)

Response

25

Input (Sensory Store)

Generic questions:

Capacity?

Duration?

Code?

Forgetting?

26

Sensory Store

Present some information, get you to report it back:

Sperling (1960, p. 3)

27

Sensory Store

How do you report?

Whole report: Tell me everything you saw.

Partial report: Tell me some of it.

Why make a distinction? If you have a limited duration (and it makes sense for this to be short), it will fade before you can report it.

Look at the whole report data…

28

Sperling (1960, p. 5)

Whole report: Presentation for 50 ms, 5 observers, more information doesn't lead to more being reported, the amount stays around 4.5 items.

29

Sensory Store

Sperling (1960):

Partial report (tone tells you which row, high middle, or low).

Sperling (1960, p. 3)

30

Sperling (1960, p. 5)

Partial report: Presentation for 50 ms, 5 observers, they report about 75% of the available information.

31

Sensory Store

Sperling (1960):

The duration of the store is less than one second.

After the letters go away, present a tone. Vary it from -0.10 seconds to + 1.0 second. Return to whole report level around 0.25 seconds.

Sperling (1960, p. 9)

32

Sensory Store

Sperling (1960):

The code is relatively unprocessed information.

Forgetting is some form of decay.

Compare Sperling's (1960) results to your CogLab data…

33

Sensory Store

So what? Method.

People’s experience is that they can see more than they can report.

Sperling figured out how to verify that empirically.

He also figured out how to assess the duration of the information.

This problem of clever method uncovering private experience is key.

34

Sensory Store

So what? Method.

Let’s figure out how to do the experiment for an auditory sensory store…

35

Sensory Store

So what? Method.

Darwin, Turvey, and Crowder (1972):

Three lists played over headphones, left ear, right ear, middle.

Visual signal tells you what to report.

Capacity was smaller than visual (about 5 letters), duration was longer (about 2 seconds).

36

Sensory Store

So what? Method.

Massaro (1970):

Play a tone, wait, play a masking tone that interferes.

People report the original tone (high/low).

After the tone goes off, people can keep processing it to identify it. If the masking tone is too early, it disrupts this echoic memory.

37

Sensory Store

So what? Method.

Massaro (1970):

Massaro (1970, p. 413)

38

Sensory Store

So what? Method.

Massaro (1970):

If they weren’t using some persistent echoic memory after the tone, then it wouldn’t matter when you masked it.

Note that this kind of turns the task inside out.

The duration is also much shorter (1/4 second).

39

Sensory Store

So what? Input from the environment.

Life moves fast; you need a record of it before it gets away.

This lets you sample for later processing.

This is the start of the process.

40

Sensory Store

So what? Reading.

Eyetracking studies (Rayner & Sereno, 1994):

Three regions:

Foveal: High resolution, about 8 letters.

Parafoveal: Less acuity, some information available, about 12 (more?) letters.

Peripheral: Very low acuity, only gross information (e.g., ends of lines).

41

Sensory Store

So what? Reading.

Three parts:

Fixations: Gather information. Average 200-250 ms, range from 100-500 ms.

Saccades: Move to new fixation, about 8 characters, range from 1 to 15.

Regressions: Saccade to earlier part of the text, 10-15% of saccades.

42

Sensory Store

So what? Reading.

You have to hold information long enough to extract some of it, and then you have to get rid of it.

Think of the speed of fluent reading.

43

Architecture

Recall our three boxes:

Sensory

Store

LTM

STM

Filter

Pattern

Recognition

Selection

Input

(Environment)

Response

Identification

44

Identification

Bottom-up vs. top-down:

Representation

Input

Knowledge, etc.

45

Müller-Lyer

Top down fail?

Müller-Lyer

Top down fail?

Müller-Lyer

Top down fail?

Müller-Lyer

Top down fail?

Top Down

Top Down

Top Down

Three Models

Template model:

Identify by comparing to a copy of everything you’ve seen.

Possible support:

Instance theories.

Perceptual priming.

53

Three Models

Rotated word…

54

Three Models

Template model:

Identify by comparing to a copy of everything you’ve seen.

Problems:

Too much variability.

Problems in matching.

What defines a match?

55

Three Models

Template model:

Identify by comparing to a copy of everything you’ve seen.

Problems:

Too much variability.

Problems in matching.

What defines a match?

Multiple interpretations of the same stimulus.

56

Three Models

E.g.,

57

Three Models

Feature model:

Identify by breaking into features and analyzing those.

Good feature sets have these properties:

Critical (help you to tell things apart).

Same with changes in the input environment.

Unique pattern for every input.

Reasonably small number of features.

58

Three Models

Feature model:

Good things about feature models:

Fit nicely with information theory: With N features, you can classify 2N things.

1 feature -> 2 things.

2 features -> 4 things.

8 features -> 256.

20 features -> 1,048,576.

Work well with computers.

59

Three Models

Feature model:

Support:

Confusion matrices.

Cluster analyses (projector).

Face recognition.

Brain organization.

60

Three Models

Feature model:

Problems:

Hard to get the right set.

Can't ignore how features combine.

61

Three Models

Structure model:

Identify by grouping

Gestalt idea: The whole is greater than the sum of its parts. Grouping principles:

Proximity: Close = part of same group.

Similarity: Similar = part of same group.

Continuity: Group into continuous forms.

62

Three Models

Structure model:

Grouping principles:

Closure: Prefer closed figures.

Connectedness: Connected part of same group.

63

Closure

Closure

Three Models

Lanthier, Risko, Stolz, & Besner (2009; doi:10.3758/PBR.16.1.167):

In addition to features, information about how features combine is also important.

Delete information from midsegments:

Or vertices:

Lanthier et al. (2009, p. 68)

66

Three Models

Lanthier et al. (2009): The kind of information deleted mattered:

Lanthier et al. (2009, p. 68)

67

Three Models

Let's try a mini-experiment based on Biederman (1987).

Break into groups A and B.

You will write down the name of the object that you see.

All of the images that follow are from Biederman (1987, p. 135)

68

Group A get ready:

69

1.

70

2.

71

3.

72

4.

73

5.

74

Group B get ready:

75

1.

76

2.

77

3.

78

4.

79

5.

80

Answers…

81

1.

82

2.

83

3.

84

4.

85

5.

86

Recoverable stimuli:

“The contours have been deleted in regions where they can be replaced through collinearity or smooth curvature” (p. 135).

87

Non-recoverable stimuli:

“The contours have been deleted at regions of concavity so that collinearity or smooth curvature of the segments bridges the concavity. In addition, vertices have been altered, for example, from Ys to Ls, and misleading symmetry and parallelism have been introduced” (p. 135).

88

Biederman (1987, p. 136)

89

Biederman (1987, p. 136)

90

Biederman (1987, p. 137)

91

Biederman (1987, p. 142)

92

Biederman (1987, p. 143)

93

Three Models

How can gestalt features help explain this?

94

Gestalt—Figure-Ground

Gestalt—Figure-Ground

Direct Perception

Gibson (e.g., 1950) proposed a theory of direct perception. A caricature:

Light is structured into an ambient optic array. Each point in this array carries potential information.

This information takes the form of affordances if a particular organism happens to be there to pick it up.

Information pick-up is direct.

97

Direct Perception

Perceiving affordances:

If I'm looking for a place to sit, sit-on-ableness will be perceived by me in objects that have that property. If I need something to throw, that will be afforded.

It's kind of like an automatic process in that there isn't conscious mediation. The affordance is just there.

98

Direct Perception

Evidence:

Warren (1984) had people look at various configurations of stairs to rate climbability.

People could accurately perceive climbability (based on a biomechanical model) from looking at the stairs.

102

Direct Perception

Evidence:

Warren (1984).

When short or tall people look at stairs, their climability judgements occur in different places.

Warren (1984, p. 689)

Direct Perception

Evidence:

Warren (1984).

When scaled to their bodies, the lines converge, and the values are almost identical and the same as predicted by the model.

Warren (1984, p. 689)

Direct Perception

Evidence:

Warren (1984) had people look at various configurations of stairs to rate climbability.

Also: There's an optimal configuration for minimum energy expenditure, people preferred stairs that fit this configuration; those preferences conformed to the model.

105

Direct Perception

Evidence:

People overestimate the slope of a hill when wearing a backpack full of rocks…

Proffitt (2006, p. 112)

106

Direct Perception

Proffitt (2006, p. 112)

107

Direct Perception

Note:

Verbal and visual subject to the illusion (effort is greater).

Haptic is not so much (that’s how you know what you’re actually seeing).

Solves a paradox: If doors look wider in some circumstances, how do you still walk through them correctly?

108

Recognition/Meaning

Meaning and identification appear to be separate.

What we'll do is look at language recognition (reading and listening) at several levels:

Letters

Orthography

Word superiority

Speech sounds

109

Recognition/Meaning

Letters: A number of features influence letter identification:

Serif vs. sans-serif (f vs. f)

Weight difference (e vs. e)

Bias

X-height

Spacing (proportional vs. non-proportional)

Proportions

110

Recognition/Meaning

Features can influence identification, as in this (admittedly not ideal) example:

111

Recognition/Meaning

Find the x:

N N Z N Z N Z N Z

Z N Z Z N Z Z N N

N N N Z N X N Z N

N N Z N Z N Z N Z

Z N Z Z N Z Z N N

112

Recognition/Meaning

Find the x:

O O P O P O P O P

P O P P O P P P O

O O P P O X P O P

O O P O P O P O P

P O P P O P P P O

113

Recognition/Meaning

Word recognition: Additional features:

Word envelope

Orthography: Rules for combining letters

Avoid doubling letters

To pronounce:

V C V

V C C V

V C

114

Recognition/Meaning

Word recognition:

Orthography

To correct a V C pattern, add a dummy e

Fin, fine, can, cane

To correct a V C V pattern, double

Runing, running

Orthography can help with:

Pronunciation: mab, mabing, mabe, mabbing

Letter expectations

115

Recognition/Meaning

An example of combining the additional features on word recognition: Word superiority effect.

Which should be identified better:

d

word

Look at CogLab data for word superiority…

116

Recognition/Meaning

Lanthier et al. (2009): When they put degraded letters into words:

Lanthier et al. (2009, p. 70)

117

Recognition/Meaning

A similar finding: Huey (1908) finds that words can be perceived at distances that are too far for the letters within the words to be perceived.

Note the paradox: How do you see the word without seeing the letters first?

118

Recognition/Meaning

Interactive activation: A possible explanation (McClelland & Rumelhart, 1981).

McClelland & Rumelhart (1981, p. 378)

119

Recognition/Meaning

McClelland & Rumelhart (1981, p. 380)

Click to edit Master text styles

Second level

Third level

Fourth level

Fifth level

120

Recognition/Meaning

McClelland & Rumelhart (1981, p. 383)

Click to edit Master text styles

Second level

Third level

Fourth level

Fifth level

121

Recognition/Meaning

McClelland & Rumelhart (1981, p. 384)

122

Recognition/Meaning

McClelland & Rumelhart (1981, p. 384)

123

Recognition/Meaning

Putting it together:

Aoccdrnig to a rscheearch at an Elingsh uinervtisy, it deosn't mttaer in waht oredr the ltteers in a wrod are, the olny iprmoetnt tihng is taht frist and lsat ltteer is at the rghit pclae. The rset can be a toatl mses and you can sitll raed it wouthit porbelm. Tihs is bcuseae we do not raed ervey lteter by it slef but the wrod as a wlohe. ceehiro. (c.f., http://www.snopes.com/language/apocryph/cambridge.asp)

124

Recognition/Meaning

OK. Let's try unfamiliar text:

Tihs was onriglaily caeertd a lnog tmie ago in rposnese to an off-tpioc cmnoemt in a Sosaldht aciltre taht I cn'at fnid any mroe. Snice tehn, i'ts been pssaed aunord a lot. I hvae no ieda how or why, but mnay tosdnauhs of polpee cmoe hree ecah week to sblarmce wodrs.

125

Recognition/Meaning

First and last letters:

Gciarcdno ot esreacrh ta na Eilsngh evrsyuinti, ti eodsn't rettma ni hwta roedr teh relstte ni a rodw rae, eth nyol rpatotinm hitng si atht rftis nad satl tetrel si ta teh rihgt ecalp. Eht rets anc eb a toalt smse nda yuo nca lltis arde ti houtitw opmlrbe.

126

Recognition/Meaning

And the unfamiliar text:

Isht aws inialrylog redtcea a lnog miet goa ni opersesn ot na ffo-tiocp nmmotce ni a Lshsatod tacrlei ttah I ctan' fidn yan meor. Ncise nthe, 'tsi eenb daspes dornau a lto. I haev on edia ohw ro wyh, ubt nyma dostansuh fo eeplop coem heer hcae ekwe ot cresmbla rwods.

127

Recognition/Meaning

Clearly, the “first letters” claim has some validity. I would suggest that it gives two sources of constraint that still makes reading relatively easy:

Some of the letter order information is preserved in every word (word envelope).

Words of three letters or less are intact. Given the role of function words, that's a big deal.

128

Recognition/Meaning

Relate this to the interactive activation model. How might it account for your being able to read scrambled words? Or, why might it have a problem accounting for this?

129

Recognition/Meaning

What about these?

Tihs txet uess olny wdors taht are shrot.

Hilpapy aoutrhs sitll issnit on fwinollog dullfready oslotebe snellpig.

Psleae ntoe taht any uaeicipnntatd hmras to paiitprtancs or asrvede eetvns must be reroeptd to the oficfe of cnpioalmce.

Rsaerech tseihs taht cltionlecg cmlteope reday yuor hvae you sbuimt mnaes fisihend you dtaa are and to.

130

Recognition/Meaning

What about these?

sbalermcd

saremlbcd

 

unirevisty

ustveniriy

ueiinrstvy

131

Recognition/Meaning

What about these?

This list of words: painters, pertains, pantries, in loco parentis.

a. anhtrsicit, piacvrote, lsngoievns, chlrucanbe

132

Recognition/Meaning

What about these?

This list of words: painters, pertains, pantries, in loco parentis.

a. anhtrsicit, piacvrote, lsngoievns, chlrucanbe

b. antichisrt, proaictve, lonivgness, ccrnuhable

133

Recognition/Meaning

What about this?

4|)V4|\|C3D l3e+$peA|< i$ whEn J00 +4lK L1K3 t|-|15. t0 u|\|d3r$+@|\|D jOo |\/|u5+ be lEET. 1f J00 4r3 NO+ lEe+ jOO C@|\|N0T 5p3A|< 0r ReAd +|-|I5.

134

Recognition/Meaning

What about speech?

Phone: Sound you can make (4,096)

896 used

about 100 account for almost all languages

Phoneme: Group of phones treated as one sound by a language

Morpheme: Smallest unit that conveys meaning

Listeme: An entry in your mental dictionary

135

Recognition/Meaning

What about speech?

Features matter. For example:

136

Recognition/Meaning

What about speech?

Two ways to describe speech:

Articulatory phonetics: Based on how you make a sound. A speech sound is:

Air + voicing + manner + place

Here's a link to a page with a clickable glossary: https://en.wikipedia.org/wiki/Place_of_articulation

137

Recognition/Meaning

English:

138

Recognition/Meaning

What about speech?

Two ways to describe speech:

Acoustic phonetics: Based on the actual, physical sound wave that is produced.

139

Recognition/Meaning

What about speech?

Acoustic phonetics uses the spectrogram.

Recognition/Meaning

What about speech? A couple of issues to provide an example:

Parallel transmission:

Recognition/Meaning

What about speech? A couple of issues to provide an example:

Context conditioned variation:

Recognition/Meaning

What about speech? An example of a top-down influence on speech perception:

McGurk effect: https://www.youtube.com/watch?v =aFPtc8BVdJk

Recognition/Meaning

Other context effects in recognition: Palmer (1975): Objects are recognized in an appropriate scene more than in an inappropriate context.

Palmer (1975, p. 519)

Recognition/Meaning

Palmer (1975, p. 521)

Meaning

A word means:

What it refers to (what it stands for)?

The image it calls up?

What else?

Meaning

A word means: A two-part model:

Propositions: Ideas

Models derived from them

E.g., the star is to the left of the circle

Grounding

Chinese-room problem: How are symbols grounded?

Embodied cognition?

Other ideas?

Dirty Disney

Dirty Disney

Dirty Disney

Dirty Disney

End of Pattern Recognition Show