project Ahmed

profileAboRedha
-1224432696.pdf

Scandinavian Journal of Psychology, 2009, 50, 395–403 DOI: 10.1111/j.1467-9450.2009.00742.x

Background and Basic Processes

Cognition and hearing aids

THOMAS LUNNER,1,2,3,4 MARY RUDNER3,4 and JERKER RÖNNBERG3,4

1Oticon A/S, Research Centre Eriksholm, Snekkersten, Denmark 2Department of Clinical and Experimental Medicine, Linköping University, Sweden 3Linnaeus Centre HEAD, Swedish Institute for Disability Research, Linköping University, Sweden 4Department of Behavioural Sciences and Learning, Linköping University, Sweden

Lunner, T., Rudner, M. & Rönnberg, J. (2009). Cognition and hearing aids. Scandinavian Journal of Psychology, 50, 395–403.

The perceptual information transmitted from a damaged cochlea to the brain is more poorly specified than information from an intact cochlea and requires more processing in working memory before language content can be decoded. In addition to making sounds audible, current hearing aids include several technologies that are intended to facilitate language understanding for persons with hearing impairment in challenging listening situ- ations. These include directional microphones, noise reduction, and fast-acting amplitude compression systems. However, the processed signal itself may challenge listening to the extent that with specific types of technology, and in certain listening situations, individual differences in cognitive processing resources may determine listening success. Here, current and developing digital hearing aid signal processing schemes are reviewed in the light of individual working memory (WM) differences. It is argued that signal processing designed to improve speech understanding may have both positive and negative consequences, and that these may depend on individual WM capacity.

Key words: Working memory, hearing aids, signal processing, cognition, noise reduction.

Thomas Lunner, PhD, Oticon A/S Research Centre Eriksholm, Kongevejen 243, DK-3070 Snekkersten, Denmark. Tel: +45 48 29 89 18; fax: +45 49 22 36 29; e-mail: [email protected]

INTRODUCTION

Advances in hearing aid technology are of great potential

benefit to persons with hearing impairment. It is estimated that

approximately 15% of the western population have a hearing

impairment of such an extent that they would benefit from

amplified hearing by way of hearing aids. Modern hearing aids

incorporate technologies such as multiple-band wide dynamic

range compression, directional microphones, and noise

reduction. Individual settings for most of these functions are

primarily based on pure-tone thresholds. Therefore, persons

with hearing impairment with the same audiogram will receive

similar hearing aid fitting even though they may have different

supra-threshold auditory abilities relating to different patholo-

gies or individual cognitive abilities. The research community

has acknowledged that successful (re)habilitation of persons

with hearing impairment must be individualized and based on

understanding of underlying mechanisms, especially the mech-

anisms of cochlear damage and language understanding. This

paper is based on recent data suggesting that ease of language

understanding is highly dependent on the individual’s working

memory (WM) capacity in challenging speech understanding

conditions, and focuses especially on a discussion of how

different types of signal processing concepts in hearing aids

may support or challenge the storage and processing functions

of WM.

The main issues delineated in this paper concern the

trade-off between individual WM capacity – seen from an

� 2009 The Authors. Journal compilation � 2009 The Scandinavian Psycho Garsington Road, Oxford OX4 2DQ, UK and 350 Main Street, Malden, MA

intra-individual as well as an inter-individual perspective – and

the benefits and costs involved in more advanced signal pro-

cessing, as well as factors that modulate the signal-cognition

interaction. The paper ends with a rather radical suggestion of

a concept that takes the individual storage and processing

function of WM into account to steer the function of the signal

processing in the hearing aid.

WORKING MEMORY AND INDIVIDUAL DIFFERENCES

This section is largely inspired by Pichora-Fuller (2007), and

sets up the framework of WM differences under listening con-

ditions that challenge cognitive capacity in different ways.

When listening becomes difficult, for example because of irrel-

evant sound sources interfering with the target signal or

because of a poorly specified input signal due to hearing

impairment, listening must rely more on prior knowledge and

context than would be the case when the incoming signal is

clear and undistorted. This shift from mostly bottom-up (sig-

nal-based) to mostly top-down (knowledge-based) processing

is accompanied by a sense of listening being more effortful.

In a review of different models of WM, Miyake and Shah

(1999) concluded that many fitted the following generic

description: working memory is those mechanisms or pro-

cesses that are involved in the control, regulation, and active

maintenance of task-relevant information in the service of

complex cognition, including novel as well as familiar, skilled

tasks.

logical Associations. Published by Blackwell Publishing Ltd., 9600 02148, USA. ISSN 0036-5564.

396 T. Lunner et al. Scand J Psychol 50 (2009)

The WM model for Ease of Language Understanding (ELU,

Rönnberg 2003; Rönnberg, Rudner, Foo & Lunner, 2008) pro-

poses that under favorable listening conditions, language input

can be rapidly and implicitly matched to stored phonological

representations in long-term memory, whereas under subopti-

mum conditions, it is more likely that this matching process

may fail. In such a mismatch situation, the model predicts that

explicit, or conscious, cognitive processes must be engaged to

decode the speech signal. Thus, under taxing conditions, lan-

guage understanding may be a function of explicit cognitive

capacity; whereas under less taxing conditions it may not.

WM has been proposed to consist of a number of different

components including processing buffers (Baddeley, 1986,

2000), and individual differences in WM function (e.g. Engle,

Cantor & Carullo, 1992) could relate to any of them. Indeed,

researchers have investigated a variety of properties that

contribute to individual differences in WM (e.g., resource

allocation, Just & Carpenter, 1992; buffer size, Cowan, 2001;

Wilken & Ma, 2004; processing capacity, Feldman Barrett,

Tugade & Engle, 2004; Halford, Wilson & Phillips, 1998). In

the following discussion it is assumed that, within the capacity

constraint, resources can be allocated to either processing or

storage, or both. A simple additive model is assumed;

C ¼ P þS ð1Þ

where C is the available individual WM capacity, P is the pro-

cessing component of WM, and S is the storage component of

WM. This is schematically illustrated in Fig. 1b, where the

black bars illustrate the processing (P) component, and the

grey bars illustrate the storage (S) component. For a given C,

the additive relationship defines how much of either P or S

that will be left if the other is used. If the processing and stor-

age demands of a particular task exceed available capacity this

may result in task errors, loss of information from temporary

storage (temporal decay of memories, forgetting) or slower

processing.

(a)

(b)

Fig. 1. Schematic representations of inter-individual differences in working memory capacity (a) suggesting that two individuals may differ in their working memory capacity, and (b) intra-individual differences suggesting that for a given individual the allocation of the person’s limited capacity to the processing and storage functions of working memory varies with task demands. (Adapted from Pichora- Fuller, 2007).

� 2009 The Authors. Journal compilation � 2009 The Scandinavian Psycho

For any given individual, the greater the demands made on

the processing function of WM, the fewer resources can be

allocated to its storage function. For example, distorting the

signal or reducing the signal-to-noise ratio (SNR) or the avail-

ability of supportive contextual cues (e.g., Pichora-Fuller,

Schneider & Daneman, 1995) would all increase processing

demands with possible consequent reduction of available stor-

age capacity. Thus, recall of words or sentences is better when

target speech can be clearly heard (Rabbitt, 1968; Tun &

Wingfield 1999; Wingfield & Tun 2001; Pichora-Fuller et al.,

1995).

Complex WM tasks require simultaneous storage (maintain-

ing information in an active state for later recall) and process-

ing (manipulating information for a current computation;

Daneman & Carpenter, 1980). In the reading span task, a WM

task based on sentence processing, the participant reads a sen-

tence and completes a task that requires trying to understand

the whole sentence (by reading it aloud, repeating it, or judg-

ing it for some property such as whether the sentence make

sense or not). Following the presentation of a set of sentences,

the respondent is asked to recall the target word (such as the

first or last word in the sentence) of each sentence in the set.

The number of sentences in the recall set is increased and

recall errors noted as a function of number of sentences in the

set (WM span, WMS). The span score typically reflects the

maximum number of target words that are correctly recalled.

Span size is significantly correlated with language comprehen-

sion (Daneman & Carpenter, 1980; Daneman & Merikle,

1996). WM span measured in this way can also vary within

individuals as a function of task (Fig. 1b).

WORKING MEMORY AND HEARING LOSS

Speech recognition performance is affected for people with

hearing impairment even under relatively favorable external

SNR conditions (e.g., Larsby, Hällgren, Lyxell & Arlinger,

2005; McCoy, Tun, Cox, Colangelo, Stewart & Wingfield,

2005; Plomp, 1988; van Boxtel, van Beijsterveldt, Houx,

Anteunis, Metsemakers & Jolles, 2000). For persons with

hearing loss, perceived listening effort (as assessed by ratings

of subjective effort in different situations), may indicate the

degree to which limited WM resources are allocated to per-

ceptual processing (Rudner, Lunner, Behrens, Sundewall

Thorén & Rönnberg, 2009). Higher levels of perceived effort

may indicate fewer resources for information storage, suggest-

ing that listeners who are hard of hearing would be poorer

than listeners with normal hearing on complex auditory tasks

involving storage. Indeed, results by Rabbitt (1990) suggest

that listeners who are hard of hearing allocate more informa-

tion processing resources to the task of initially perceiving

the speech input, leaving fewer resources for subsequent

recall.

Figure 2 shows results from an experiment by Lunner

(2003) with 72 patients who had similar levels of hearing loss

as indicated by pure-tone audiograms. The participants’

logical Associations.

Correlation: r = –0.61

0 10 20 30 40 50 60 70 80 90 100 Reading Span (% correct)

–8

–6

–4

–2

0

2

4

6

8

S R

T in

n o

is e

( d

B )

Fig. 2. Scatterplot and regression line showing correlation between reading span and speech recognition in noise (n = 72). Pearson correlations with 95% confidence limits for the correlation coefficient are shown. Low (negative) SRT means high performance in noise. (Replotted from Lunner, 2003).

Scand J Psychol 50 (2009) Cognition and hearing aids 397

hearing aids were adjusted to assure audibility of the target

signal and their speech reception thresholds (SRT) in noise

were determined. SRT was defined as the level at which 50%

of words presented were correctly recalled. Individual WM

capacity, as measured by the reading span test (Andersson,

Lyxell, Rönnberg & Spens, 2001; Daneman & Carpenter,

1980; Rönnberg, 1990), accounted for 40% of the inter-indi-

vidual variance. That is, WMS was a good predictor of SRT.

These findings have been confirmed in subsequent studies

(Akeroyd, 2008; Foo, Rudner, Rönnberg & Lunner, 2007;

Rudner, Foo, Sundewall-Thorén, Lunner & Rönnberg, 2008).

HEARING AID SIGNAL PROCESSING AND INDIVIDUAL

WM DIFFERENCES

Below, we review evidence indicating that some types of hear-

ing aid signal processing may release WM resources, resulting

in better storage capacity and faster information processing in

challenging listening situations. However, hearing aid signal

processing may also challenge listening by generating

unwanted processing artifacts by distorting the auditory scene

(i.e. the distinct auditory objects builds up the listening envi-

ronment, see e.g. Shinn-Cunningham, 2008), or generating

audible artifacts and other unintended side-effects such as dis-

tortions of the target signal waveform (Stone & Moore, 2004;

2008), thereby taxing WM resources. The trade-off between

WM benefits and signal processing artifacts may depend on

the individually available cognitive resources, and therefore

individual differences in cognitive processing resources may

determine listening success with specific types of technology.

Inter-individual differences in capacity limitations constrain-

ing WM processing and storage may explain why one listening

situation may be too challenging for one individual but not for

another. Increases in WM span post hearing-aid intervention

� 2009 The Authors. Journal compilation � 2009 The Scandinavian Psycho

(i.e. intra-individual improvements in WM storage) would sug-

gest that the intervention has resulted in listening becoming

easier with fewer WM processing resources needing to be

allocated.

Signal processing in hearing aids is designed to help users

specifically in challenging listening situations. Usually the

objective is, by some means, to remove signals that are less

important in a particular situation and/or to emphasize or

enhance signals that are more important. However, the conse-

quences for the individual in terms of communicative benefit

may depend on individual WM capacity. Several studies indi-

cate that pure tone hearing threshold elevation is the primary

determinant of speech recognition performance in quiet back-

ground conditions, e.g. in a conversation with one person or

listening to the television under otherwise undisturbed condi-

tions (see e.g., Dubno, Dirks & Morgan, 1984; Magnusson,

Karlsson & Leijon, 2001; Schum, Matthews & Lee, 1991).

Thus, in less challenging situations, individual differences in

WM are possibly of secondary importance for successful lis-

tening. The individual peripheral hearing loss is the main con-

straint on performance, and the most important objective for

the hearing aid signal processing is to make sounds audible.

This can be by means of slow-acting compression (e.g. Dillon,

1996; Lunner, Hellgren, Arlinger & Elberling, 1997). A slow-

acting compression system maintains near constant gain-fre-

quency response in a given speech/noise listening situation,

and thus preserves the differences between short-term spectra

in the speech signal. In less challenging listening situations,

greater WM capacity confers relatively little benefit and the

same is true of advanced signal processing designed to

enhance target speech and/or to reduce interfering noise. In

more challenging situations, however, signal processing

designed to enhance speech and/or to reduce noise may – or

may not – benefit the hearing aid user, depending on the

implementation.

Even though speech recognition performance may not

always be improved by the hearing aid signal processing,

reductions in subjectively rated listening effort may result (e.g.

Schulte, Vormann, Wagener, Büchler, Dillier, Dreschler,

Eneman et al., 2009). SRT in noise is typically negative (see

e.g. Fig. 2). Speech-to-noise ratios of 5dB or higher are

realistic values for real-life conversation situations, such as

conversing inside or outside urban homes (Pearsons, Bennett

& Fidell, 1977). In such listening situations, conventional SRT

tests are insensitive to signal processing improvements, and

other measures such as subjective rating of listening effort

(Rudner et al., 2009; Schulte et al., 2009) or increases in WM

span-scores post hearing-aid intervention may be a better

predictor of hearing aid signal processing effects.

Directional microphones

Modern hearing aids can usually be switched between omni-

directional and directional microphones. Directional micro-

phone systems are designed to take advantage of the spatial

logical Associations.

398 T. Lunner et al. Scand J Psychol 50 (2009)

differences between the relevant signal and noise. Directional

microphones are more sensitive to sounds coming from the

front than sounds coming from the back and the sides. The

assumption is that because people usually turn their heads to

face a conversational partner, frontal signals are most impor-

tant, while sounds from other directions are of less impor-

tance. Several algorithms have been developed to provide

maximum attenuation of moving or fixed noise source(s)

behind the listener (see e.g. van den Bogaert, Doclo, Wouters

& Moonen, 2008). Usually, switching between directional

microphone and omni-directional microphone takes place auto-

matically in situations that are determined by the SNR-estima-

tion algorithm to be beneficial for the particular type of

microphone. The directional microphone usually comes into

play when estimated SNR is below a given threshold value,

and the target signal is estimated to be coming from the frontal

position.

A review by Ricketts (2005) addressed the benefit of direc-

tional microphones compared to omni-directional, showing that

with the directional microphone, SNR improvement could be

as high as 6–7 dB, and was typically 3–4 dB, in certain noisy

environments. The noisy environments where directional bene-

fit was seen were characterized by (a) no more than moderate

reverberation, (b) the listener facing the sound source of inter-

est, and (c) the distance to this source being rather short. The

SRT in noise shows improvements in accordance with the

SNR improvements (Ricketts, 2005). Thus, at least in particular

situations, directional microphones give a clear and documented

benefit.

However, if the target is not in front or if there are multiple

targets, the attenuation of sources from directions other than

frontal by directional microphones may interfere with the audi-

tory scene (Shinn-Cunningham, 2008; Shinn-Cunningham &

Best, 2008). In natural communication, the listener often

switches attention to different locations. Therefore, omni-direc-

tional microphones may be preferred in situations requiring

frequent shifts of attention or monitoring of sounds at multiple

locations. Unexpected or unmotivated automatic switches

between directional and omni-directional microphones may

be cognitively disturbing if the switching interferes with the

listening situation (Shinn-Cunningham & Best, 2008). Van den

Bogaert et al. (2008) have shown that directional microphone

algorithms substantially interfere with localization of target and

noise sources, suggesting that directional microphones may, in

addition to attenuating lateral sources, distort natural monitor-

ing of sounds at multiple locations.

Sarampalis et al. (2009) investigated WM performance

under different SNRs, ranging from )2 dB to +2 dB, to simu- late the improvement in SNR by directional microphones com-

pared to omni-directional microphones. The WM test was a

dual-task paradigm with (a) a primary perceptual task involv-

ing repeating the last word of sentences presented over head-

phones, and (b) a secondary memory task involving recalling

these words after each set of eight sentences (Pichora-Fuller

et al., 1995). The sentences were high- and low-context

� 2009 The Authors. Journal compilation � 2009 The Scandinavian Psycho

sentences from the Revised Speech Perception in Noise Test (Bil-

ger, Nuetzel, Rabinowitz & Rzeczkowski, 1984). Performance on

the secondary (memory) task improved significantly in the +2dB

SNR condition which simulated directional microphones. The

directional microphone intervention may have freed some

WM resources, increasing storage capacity in the (tested) noisy

situations.

Inter-individual and intra-individual differences in WM

capacity may also play a role in determining the benefit of

directional microphones for a given individual in a given

situation. Consider, for example, Fig. 2, in a situation with

0 dB SNR (dash-dotted line). If we assume that the individual

SRT in noise reflects the SNR at which WM capacity is

severely challenged, Fig. 2 indicates that the WM capacity limit

is challenged at about )5 dB for a high WM capacity person. At 0 dB SNR, the person with high WM capacity probably

possesses the WM capacity to use the omni-directional micro-

phone, while at )5 dB this person may need to sacrifice the omni-directional benefits and use the directional microphone to

release WM resources. However, for the person with low WM

capacity, even the 0 dB situation probably challenges WM

capacity limits. Therefore, this person is probably best helped

by selecting the directional microphone at 0 dB to release WM

resources, thereby sacrificing the omni-directional benefits.

Thus, it may be the case that the choice of SNR at which the

directional microphone is invoked should be a trade-off

between omni-directional and directional benefits and individ-

ual WM capacity, and that inter-individual differences in WM

performance may be used to individually set the SNR thresh-

old at which the hearing aid automatically shifts from omni-

directional to directional microphone.

Noise reduction systems

Noise reduction systems, or more specifically, single micro-

phone noise reduction systems, are designed to separate target

speech from disturbing noise by using a separation algorithm

operating on the input. Different amplification is applied to the

separated estimates of speech and noise, thereby enhancing the

speech and/or attenuating the noise (e.g. Bentler & Chiou,

2006; Chung, 2004).

There are several approaches to obtaining separate estimates

of speech and noise signals. One approach applied in current

hearing aids is to use the modulation index (or modulation

depth) as a basis for the estimation. The idea is that speech

includes more level modulations than noise (see e.g., Plomp,

1994) and thus that the higher the modulation index the greater

the likelihood that a target signal has been identified. Algo-

rithms to calculate the modulation index usually operate in

several frequency bands. If a frequency band has a high modu-

lation index, it is classified as including speech and is given

more amplification, while frequency bands with less modula-

tion are classified as noise and thus attenuated (see e.g., Hol-

ube, Hamacher & Wesselkamp, 1999). Other noise reduction

approaches include the use of the level-distribution function

logical Associations.

Scand J Psychol 50 (2009) Cognition and hearing aids 399

for speech (Ludvigsen, 1997) or voice-activity detection by

synchrony detection (Schum, 2003). However, the relative esti-

mation of speech and noise components on a short-term basis

(milliseconds) is very difficult, and misclassifications may

occur. Therefore, commercial noise reduction systems in hear-

ing aids are typically very conservative in their estimation of

speech and noise components, and only give a rather long-term

estimation (seconds) of noise or speech. Such systems do not

seem to aid speech recognition in noise (Bentler & Chiou,

2006). Nevertheless, typical commercial noise reduction

systems do give a reduction in overall loudness of the noise

compared to the target signal, which is rated as improving

comfort (Schum, 2003) and thus may reduce the annoyance

and fatigue associated with using hearing aids.

Noise reduction systems with more aggressive forms of sig-

nal processing are described in the literature, including ‘‘spec-

tral subtraction’’ or weighting algorithms where the noise is

estimated either in brief pauses of the target signal or by mod-

eling the statistical properties of speech and noise (e.g.

Ephraim & Malah, 1984; Lotter & Vary 2003; Martin, 2001;

Martin & Breithaupt, 2003; for a review see Hamacher, Chal-

upper, Eggers et al., 2005). The estimates of speech and noise

are subtracted or weighted on a short-term basis in a number

of frequency bands, which gives a less noisy signal. However,

this comes at the cost of another type of distortion usually

called ‘‘musical noise’’ (Takeshi, Takahiro, Yoshihisa & Tets-

uya, 2003). This extraneous signal may increase cognitive load

during listening since it is a competing, and probably distract-

ing signal, the suppression of which may consume WM

resources. Thus, in optimizing noise reduction systems there is

a trade-off between the amount of noise-reduction and the

amount of distortion.

Sarampalis et al. (2006, 2008, 2009) investigated the WM

capacity of listeners with normal hearing and listeners with

mild to moderate sensorineural hearing loss, using the dual-

task paradigm described earlier. Auditory stimuli were pre-

sented with or without a short-term noise reduction scheme

based on the algorithm proposed by Ephraim & Malah (1984).

For people with normal hearing there was some recall

improvement with noise reduction in low-context sentences.

The authors interpreted this as demonstrating that the algorithm

mitigated some of the deleterious effects of noise by reducing

cognitive effort. However, the results for the listeners with

hearing impairment were not easily interpreted. More research

is needed with regard to individual WM differences and short-

term noise reduction systems to determine the circumstances

under which these systems may release WM resources.

Another recent approach to the separation of speech from

speech-in-noise is the use of binary time-frequency masks (e.g.

Wang, 2005; Wang, 2008; Wang, Kjems, Pedersen, Boldt &

Lunner, 2009). The aim of this approach is to create a binary

time-frequency pattern from the speech/noise mixture. Each

local time-frequency unit is assigned to either a 1 or a 0

depending on the local SNR. If the local SNR is favorable for

the speech signal this unit is assigned a 1, otherwise it is

� 2009 The Authors. Journal compilation � 2009 The Scandinavian Psycho

assigned a 0. This binary mask is then applied directly to the

original speech/noise mixture, thereby attenuating the noise

segments. A challenge for this approach is to find the correct

estimate of the local SNR.

Ideal binary masks (IBM) have been used to investigate the

potential of this technique for hearing impaired test subjects

(Anzalone, Calandruccio, Doherty & Carney, 2006; Wang,

2008; Wang et al., 2009). In IBM-processing, the local SNR is

known beforehand, which is not the case in a realistic situation

with non-ideal detectors of speech and noise signals. Thus,

IBM is not directly applicable in hearing aids. Wang et al.

(2009) evaluated the effects of IBM processing on speech

intelligibility for listeners with hearing impairment by assess-

ing the SRT in noise. For a cafeteria background, the authors

observed a 15.6 dB SRT reduction (improvement) for listeners

with hearing impairment, which is a very large effect. If indi-

vidual SRTs reflect the situation where the WM capacity is

severely challenged, applying IBM processing in difficult lis-

tening situations would release WM resources. However, IBM

may produce distortions that increase the cognitive load,

especially in realistic binary mask applications where the

speech and noise are not available separately, but have to be

estimated. Thus, a trade-off may have to be made between

noise reduction and distortion in a realistic noise reduction

system.

In situations where the listener’s cognitive system is unchal-

lenged, using a noise reduction system may be redundant or

even counterproductive, since distortion of the signal could

outweigh any possible gain in SNR. However, since realistic

short-term noise reduction schemes (including realistic binary

mask processing) will rely on a trade-off between amount of

noise reduction and minimization of processing distortions, the

use of such systems may be dependent on the inter-individual

WM differences, suggesting that persons with high WM capac-

ity may tolerate more distortions and thus more aggressive

noise reduction than persons with low WM capacity in a given

listening situation.

Fast acting wide dynamic range compression

A fast-acting wide dynamic range compression (WDRC) sys-

tem is usually called fast compression or syllabic compression

if it adapts rapidly enough to provide different gain-frequency

responses for adjacent speech sounds with different short-term

spectra. This contrasts with slow-acting WDRC (slow com-

pression or automatic gain control). These systems maintain

near constant gain-frequency response in a given speech/noise

listening situation, and thus preserve the differences between

short-term spectra in the speech signal. Hearing-aid compres-

sors usually have frequency-dependent compression ratios,

because hearing loss generally varies with frequency. However,

WDRC can be configured in many ways, with different goals

in mind (Dillon, 1996; Moore, 1998). In general, compression

may be applied in hearing aids for at least three different rea-

sons (e.g., Leijon & Stadler, 2008):

logical Associations.

400 T. Lunner et al. Scand J Psychol 50 (2009)

(1) To present speech at a comfortable loudness level, com-

pensating for variations in voice characteristics and speaker

distance.

(2) To protect the listener from transient sounds that would be

uncomfortably loud if amplified with the gain-frequency

response needed for conversational speech.

(3) To improve speech understanding by making also very

weak speech segments audible, while still presenting lou-

der speech segments at a comfortable level.

A fast compressor can to some extent meet all three purposes,

whereas a slow compressor alone can fulfill only the first

objective. Fast compression may have two opposing effects

with regard to speech recognition: (a) it can provide additional

amplification for weak speech components that might other-

wise be inaudible, and (b) it reduces spectral contrast between

speech sounds. It has yet to be fully investigated which of

these effects has the greatest impact on speech recognition in

noise for the individual, with regard to individual WM capac-

ity. The first studies that systematically investigated individual

differences in coping with the speed of compression were

those of Gatehouse, Naylor and Elberling (2003, 2006a,

2006b). These studies indicated that both cognitive capacity

and auditory ecology have explanatory value as regards indi-

vidual outcome of, for example, speech recognition in noise

and subjectively assessed listening comfort. In a study that rep-

licated some of the findings of the Gatehouse et al studies

(Fig. 3; Lunner & Sundewall-Thorén, 2007), the cognitive test

scores of listeners with hearing loss were significantly corre-

lated with the differential advantage of fast compression versus

slow compression in conditions of modulated noise. Other

studies have shown that cognitive performance is related to the

Correlation: r = 0.49

0.0 0.5 1.0 1.5 2.0 2.5 3.0 3.5 4.0 Cognitive performance score (d')

–10

–8

–6

–4

–2

0

2

4

6

8

F a s t

m in

u s S

lo w

b e n

e fi

t (d

B )

Fig. 3. Scatter plot and regression line showing the Pearson correlation between the cognitive performance score and differential benefit in speech recognition in modulated noise of fast versus slow compression. A positive value on the Fast minus Slow benefit (dB) axis means that fast compression obtained better SRT in noise compared to slow compression. (Replotted from Lunner & Sundewall- Thorén, 2007).

� 2009 The Authors. Journal compilation � 2009 The Scandinavian Psycho

ability to cope with new compression settings (Foo et al.,

2007, Rudner et al., 2008).

Figure 3 shows a scatterplot and regression line showing the

Pearson correlation between the cognitive performance score

and differential benefit of fast versus slow compression in

speech recognition in modulated noise. The correlation in

Fig. 3 is plausibly explained as an interaction between cogni-

tive performance and fast compression as suggested by Naylor

and Johannesson (2009). These authors have shown that the

long-term SNR at the output of an amplification system that

includes amplitude compression may be higher or lower than

the long-term SNR at the input, dependent on interactions

between the actual long-term input SNR, the modulation char-

acteristics of the signal and noise being mixed, and the ampli-

tude compression characteristics of the system under test.

Specifically, fast compression in modulated noise may increase

output SNR at negative input SNRs, and decrease output SNR

at positive input SNRs. Such shifts in SNR between input and

output values may potentially affect perceptual performance

for users of compression hearing aids. The compression-related

SNR shift affects perceptual performance in the corresponding

direction (G. Naylor, R.B. Johannessen & F.M. Rønne, per-

sonal communication, December 2008); a person performing at

negative SNRs may therefore be able to understand speech

better with fast compression while the same may not be true

for a person performing at positive SNRs. Thus, the relative

SNR at which listening takes place is another factor which

determines if fast compression is beneficial or not. A person

with high WM capacity and SRT at a negative SNR would

probably benefit from fast compression in that particular situa-

tion, while a person with low WM capacity and SRT at a posi-

tive SNR might be put at a disadvantage.

COGNITION-DRIVEN SIGNAL PROCESSING

From the examples above it seems that inter-individual and

intra-individual WM differences should be taken into account

in the development of hearing-aid signal-processing algorithms

and when they are adjusted for the individual hearing-aid user.

Often it will be a case of balancing the trade-off between

opposing effects in relation to the individual’s WM capacity.

For directional microphones the trade-off is between omni-

directional and directional benefits; for realistic short-term

noise reduction schemes it is between amount of noise reduc-

tion and processing distortion and for fast-acting versus slow

compression it is between absolute performance levels in SNR

and the choice to invoke fast compression to improve output

SNR.

In less challenging situations, individual differences in WM

are possibly of secondary importance for successful listening.

The individual peripheral hearing loss is the main constraint

on performance, and the most important objective for the

design of hearing aid signal processing is to make

sounds audible (e.g. by slow acting compression, e.g. Dillon,

1996).

logical Associations.

Scand J Psychol 50 (2009) Cognition and hearing aids 401

In more challenging listening situations, hearing aid signal

processing systems such as directional microphones, noise

reduction systems, and fast compression should be activated

on an individual basis to release WM resources, taking into

account the above mentioned trade-offs between signal pro-

cessing benefits and drawbacks. Below, a new and rather radi-

cal concept is suggested where knowledge of individual WM

resources is combined with knowledge on hearing aids signal

processing concepts.

One way to conceptualize the hearing aid processing

requirements would be as a ‘‘hearing aid with cognition-driven

signal-processing’’, where the hearing aid signal processing is

designed to take individual cognitive capacity into account to

optimize speech understanding. The construction of such a

cognition-driven hearing aid requires monitoring of the indi-

vidual ‘‘cognitive workload’’ on a real-time basis, in order to

determine the level at which the listening situation starts to

challenge WM resources. WM resources are challenged differ-

ently depending on listening situation, and different individuals

may have different cognitive resources available to handle such

specific workloads. Therefore, there is a need to develop

monitoring methods for estimating cognitive workload. Two

different lines of research can be foreseen; indirect estimates

of cognitive workload and direct estimates of cognitive

workload.

Indirect estimates of cognitive workload would use some

form of cognitive model that is continuously updated with

environment detectors that monitor the listening environment

(e.g., level detectors, SNR detectors, speech activity detectors,

reverberation detectors), as well as the conversational situa-

tion; the identity, mood and behavior of the conversational

partner as well as the purpose of the communication (social,

information exchange) and feedback pattern. The cognitive

model should produce at least two states, indicating cognitive

High load or cognitive Low load. If cognitive High load is

detected, hearing aid signal processing systems, such as direc-

tional microphones, noise reduction systems and fast compres-

sion, should be invoked to release cognitive resources. The

cognitive model needs to be calibrated with the individual

cognitive capacity (e.g., WM capacity, verbal information pro-

cessing speed), and connections between listening environ-

ment monitors, hearing aid processing system, and cognitive

capacities have to be established. Inspiration might be found

in the ease of language understanding (ELU) model of Rönn-

berg et al. (2008), which has a framework for suggesting

when a listener’s WM system switches from effortless implicit

(bottom-up) processing to effortful explicit (top-down) pro-

cessing.

However, a more direct way to assess cognitive workload

would be through physically measurable correlates (e.g. Kra-

mer, 1991). Given direct estimates of cognitive load, measures

of cognitive High and Low load could be established. How-

ever, relations between environment characteristics, signal

processing features and cognitive relief would still have to be

established. A straightforward, but technically challenging

� 2009 The Authors. Journal compilation � 2009 The Scandinavian Psycho

example of a direct estimate of cognitive High and Low load

could be obtained by electroencephalographic measurements

(EEG; Gevins, Smith, McEvoy & Yu, 1997). A wearable sys-

tem has been proposed by Lan, Erdogmus, Adami, Mathan &

Pavel (2007), which could be used to produce a cognitive state

classification system based on EEG measurements. This could

possibly be used to control the parameters of hearing aid sig-

nal processing algorithms to individually reduce cognitive

‘‘workload’’ in challenging listening situations.

In summary, the concept of cognition-driven hearing aid sig-

nal processing is at the meeting point between the audiological

and cognitive psychology disciplines, and mutual research is

of great benefit to the development of our understanding of

how hearing aid signal processing interacts with cognitive abil-

ities. In the long term, a cognition-driven hearing aid could be

beneficial not only for optimizing signal processing but also

for minimizing the negative impact of sensory impairment on

cognitive function.

We would like to thank Stig Arlinger, Kathleen Pichora-Fuller, Graham Naylor, and two anonymous reviewers for their helpful and insightful comments on earlier versions of this manuscript.

REFERENCES

Akeroyd, M. A. (2008). Are individual differences in speech reception related to individual differences in cognitive ability? A survey of twenty experimental studies with normal and hearing-impaired adults. International Journal of Audiology, 47(Suppl. 2), S125– S143.

Andersson, U., Lyxell, B., Rönnberg, J. & Spens, K.-E. (2001). Cogni- tive correlates of visual speech understanding in hearing-impaired individuals. Journal of Deaf Studies and Deaf Education, 6, 103– 116.

Anzalone, M. C., Calandruccio, L., Doherty, K. A. & Carney, L. H. (2006). Determination of the potential benefit of time-frequency gain manipulation. Ear and Hearing, 27, 480–492.

Baddeley, A. D. (1986). Working memory. Oxford: Oxford University Press.

Baddeley, A. D. (2000). The episodic buffer: A new component of working memory? Trends in Cognitive Science, 4(11), 417–423.

Bentler, R. & Chiou, L.-K. (2006). Digital noise reduction: An over- view. Trends in Amplification, 10(2), 67–82.

Bilger, R. C., Nuetzel, J. M., Rabinowitz, W. M. & Rzeczkowski, C. (1984). Standardization of a test of speech perception in noise. Journal of Speech and Hearing Research, 27, 32–48.

Chung, K. (2004). Challenges and recent developments in hearing aids. Part I. Speech understanding in noise, microphone technologies and noise reduction algorithms. Trends in Amplification, 8(3), 83–124.

Cowan, N. (2001). The magical number 4 in short-term memory: A reconsideration of mental storage capacity. Behavioral and Brain Sciences, 24, 87–185.

Daneman, M. & Carpenter, P. A. (1980). Individual differences in inte- grating information between and within sentences. Journal of Experi- mental Psychology: Learning, Memory and Cognition, 9, 561–584.

Daneman, M. & Merikle, P. M. (1996). Working memory and lan- guage comprehension: A meta-analysis. Psychonomic Bulletin and Review, 3(4), 422–433.

Dillon, H. (1996). Compression? Yes, but for low or high frequencies, for low or high intensities, and with what response times? Ear and Hearing, 17, 287–307.

logical Associations.

402 T. Lunner et al. Scand J Psychol 50 (2009)

Dubno, J. R., Dirks, D. D. & Morgan, D. E. (1984). Effects of age and mild hearing loss on speech recognition in noise. Journal of the Acoustical Society of America, 76(1), 87–96.

Engle, R. W., Cantor, J. & Carullo, J. J. (1992). Individual differences in working memory and comprehension: A test of four hypotheses. Journal of Experimental Psychology: Learning, Memory, and Cog- nition, 18, 972–992.

Ephraim, Y. & Malah, D. (1984). Speech enhancement using a mini- mum mean-square error short-time spectral amplitude estimator. IEEE Transactions on Acoustics, Speech and Signal Processing, 32(6), 1109–1121.

Feldman Barrett, L., Tugade, M. M. & Engle, R. W. (2004). Individual differences in working memory capacity and dual-process theories of the mind. Psychological Bulletin, 130(4), 553–573.

Foo, C., Rudner, M., Rönnberg, J. & Lunner, T. (2007). Recognition of speech in noise with new hearing instrument compression release settings requires explicit cognitive storage and processing capacity. Journal of the American Academy of Audiology, 18, 553– 566.

Gatehouse, S., Naylor, G. & Elberling, C. (2003). Benefits from hear- ing aids in relation to the interaction between the user and the envi- ronment. International Journal of Audiology, 42(Suppl. 1), S77– S85.

Gatehouse, S., Naylor, G. & Elberling, C. (2006a). inear and nonlinear hearing aid fittings – 1. Patterns of benefit. International Journal of Audiology, 45, 130–152.

Gatehouse, S., Naylor, G. & Elberling, C. (2006b). Linear and non-lin- ear hearing aid fittings – 2. Patterns of candidature. International Journal of Audiology, 45, 153–171.

Gevins, A., Smith, M. E., McEvoy, L. & Yu, D. (1997). High resolu- tion EEG mapping of cortical activation related to working mem- ory: effects of task difficulty, type of processing, and practice. Cerebral Cortex, 7(4), 374–385.

Halford, G. S., Wilson, W. H. & Phillips, S. (1998). Processing capac- ity defined by relational complexity: Implications for comparative, developmental, and cognitive psychology. Behavioral and Brain Sciences, 21, 803–865.

Hamacher, V., Chalupper, J., Eggers, J., Fischer, E., Kornagel, U., Puder, H. & Rass, U. (2005). Signal processing in high-end hearing aids: State of the art, challenges, and future trends. EURASIP Journal on Applied Signal Processing, 18, 2915–2929.

Holube, I., Hamacher, V. & Wesselkamp, M. (1999). Hearing instru- ments: Noise reduction strategies, in Proceedings of the 18th Dan- avox Symposium: Auditory Models and Non-linear Hearing Instruments, Kolding, Denmark, September.

Just, M. A. & Carpenter, P. A. (1992). A capacity theory of compre- hension - individual differences in working memory. Psychological Review, 99, 122–149.

Kramer, A. F. (1991). Physiological metrics of mental workload: A review of recent progress. In D. L. Damos (Ed.), Multiple-task per- formance (pp. 279–328). London: Taylor & Francis.

Lan, T., Erdogmus, D., Adami, A., Mathan, S. & Pavel, M. (2007). Channel selection and feature projection for cognitive load estima- tion using ambulatory EEG. Computational Intelligence and Neuro- science, Volume 2007, Article ID 74895, 1–12.

Larsby, B., Hällgren, M., Lyxell, B. & Arlinger, S. (2005). Cognitive per- formance and perceived effort in speech processing tasks: Effects of different noise backgrounds in normal-hearing and hearing impaired subjects. International Journal of Audiology, 44(3), 131–143.

Leijon, A. & Stadler, S. (2008). Fast amplitude compression in hear- ing aids improve audibility but degrades speech information trans- mission. Internal report 2008-11; Sound and Image Processing Lab., School of Electrical Engineering, KTH, SE-10044, Stock- holm, Sweden.

� 2009 The Authors. Journal compilation � 2009 The Scandinavian Psycho

Lotter, T. & Vary, P. (2003). Noise reduction by maximum a posteriori spectral amplitude estimation with super Gaussian speech model- ing, in Proceedings of the International Workshop on Acoustic Echo and Noise Control (IWAENC ‘03) (pp. 83–86), Kyoto, Japan, September.

Ludvigsen, C. (1997). Schaltungsanordnung für die automatische Regelung von Hörhilfsgeräten. Europäische Patentschrift, EP 0 732 036 B1.

Lunner, T. (2003). Cognitive function in relation to hearing aid use. International Journal of Audiology, 42(Suppl. 1), S49–S58.

Lunner, T., Hellgren, J., Arlinger, S. & Elberling, C. (1997). A digital filterbank hearing aid: Three DSP algorithms – user preference and performance. Ear and Hearing, 18, 373–387.

Lunner, T. & Sundewall-Thorén, E. (2007). Interactions between cog- nition, compression, and listening conditions: Effects on speech-in- noise performance in a two-channel hearing aid. Journal of the American Academy of Audiology, 18, 539–552.

McCoy, S. L., Tun, P. A., Cox, L. C., Colangelo, M., Stewart, R. A. & Wingfield, A. (2005). Hearing loss and perceptual effort: Down- stream effects on older adults’ memory for speech. Quarterly Jour- nal of Experimental Psychology A, 58, 22–33.

Magnusson, L., Karlsson, M. & Leijon, A. (2001). Predicted and mea- sured speech recognition performance in noise with linear amplifi- cation. Ear and Hearing, 22(1), 46–57.

Martin, R. (2001). Noise power spectral density estimation based on optimal smoothing and minimum statistics. IEEE Transactions on Speech and Audio Processing, 9(5), 504–512.

Martin, R. & Breithaupt, C. (2003). Speech enhancement in the DFT domain using Laplacian speech priors, in Proceedings of the Inter- national Workshop on Acoustic Echo and Noise Control (IWAENC ‘03) (pp. 87–90), Kyoto, Japan, September.

Miyake, A. & Shah, P. (1999). Models of working memory. Cam- bridge: Cambridge University Press.

Moore, B. C. J. (1998). A comparison of four methods of implement- ing automatic gain control (AGC) in hearing aids. British Journal of Audiology, 22, 93–104.

Naylor, G. & Johannesson, R. B. (2009). Long-term Signal-to- Noise Ratio (SNR) at the input and output of amplitude compres- sion systems. Journal of the American Academy of Audiology, 20(3), 161–171.

Pearsons, K. S., Bennett, R. L. & Fidell, S. (1977). Speech levels in various environments (EPA-600/1-77-025). Washington, DC: Envi- ronmental Protection Agency.

Pichora-Fuller, M. K. (2007). Audition and cognition: What audiolo- gists need to know about listening. In C. Palmer & R. Seewald (eds.) Hearing care for adults (pp. 71–85). Stäfa, Switzerland: Pho- nak.

Pichora-Fuller, M. K., Schneider, B. A. & Daneman, M. (1995). How young and old adults listen to and remember speech in noise. Jour- nal of the Acoustical Society of America, 97, 593–608.

Plomp, R. (1988). Auditory handicap of hearing impairment and the limited benefit of hearing aids. Journal of the Acoustical Society of America, 63(2), 533–549.

Plomp, R. (1994). Noise, amplification, and compression: Consider- ations for three main issues in hearing aid design. Ear and Hear- ing, 15, 2–12.

Rabbitt, P. (1968). Channel-capacity, intelligibility and immediate memory. Quarterly Journal of Experimental Psychology, 20, 241– 248.

Rabbitt, P. (1990). Mild hearing loss can cause apparent memory fail- ures which increase with age and reduce with IQ. Acta Oto-laryng- ologica, 111(Suppl. 476), 167–176.

Ricketts, T. A. (2005). Directional hearing aids: Then and now. Jour- nal of Rehabilitation Research and Development, 42 (4), 133–144.

logical Associations.

Scand J Psychol 50 (2009) Cognition and hearing aids 403

Rönnberg, J. (1990). Cognitive and communicative function: The effects of chronological age and ‘‘handicap age’’. European Jour- nal of Cognitive Psychology, 2, 253–273.

Rönnberg, J. (2003). Cognition in the hearing impaired and deaf as a bridge between signal and dialogue: A framework and a model. International Journal of Audiology, 42, S68–S76.

Rönnberg, J., Rudner, M., Foo, C. & Lunner, T. (2008). Cognition counts: A working memory system for ease of language under- standing (ELU). International Journal of Audiology., 47(Suppl. 2), S171–S177.

Rudner, M., Foo, C., Sundewall-Thorén, E., Lunner, T. & Rönnberg, J. (2008). Phonological mismatch and explicit cognitive processing in a sample of 102 hearing aid users. International Journal of Audiol- ogy, 47(Suppl. 2), S163–S170.

Rudner, M., Lunner, T., Behrens, T., Sundewall Thorén, E. & Rönn- berg, J. (2009). Self-rated effort, cognition and aided speech recog- nition in noise. EFAS.

Sarampalis, A., Kalluri, S., Edwards, B. & Hafter, E. (2006). Cognitive effects of noise reduction strategies. International Hearing Aid Research Conference (IHCON), Lake Tahoe, CA, August.

Sarampalis, A., Kalluri, S., Edwards, B. & Hafter, E. (2008). Under- standing speech in noise with hearing loss: measures of effort. International Hearing Aid Research Conference (IHCON), Lake Tahoe, CA, August 13-17.

Sarampalis, A., Kalluri, S., Edwards, B. & Hafter, E. (2009). Objective measures of listening effort: Effects of background noise and noise reduction. Journal of Speech, Language, and Hearing Research doi: 10.1044/1092-4388(2009/08-0111).

Schulte, M., Vormann, M., Wagener, K., Büchler, M., Dillier, N., Dreschler, W., Eneman, K. et al (2009). Listening effort scaling and preference rating for hearing aid evaluation. HearCom Work- shop on Hearing Screening and new Technologies. http:// hearcom.eu/about/DisseminationandExploitation/Workshop.html

Schum, D. J. (2003). Noise-reduction circuitry in hearing aids: (2) Goals and current strategies. The Hearing Journal, 56(6), 32–40.

Schum, D. J., Matthews, L. J. & Lee, F. S. (1991). ctual and predicted word-recognition performance of elderly hearing-impaired listeners. Journal of Speech Hearing Research, 34, 636–642.

Shinn-Cunningham, B. G. (2008). Object-based auditory and visual attention. Trends in Cognitive Sciences, 12(5), 182–186.

Shinn-Cunningham, B. G. & Best, V. (2008). Selective attention in nor- mal and impaired hearing. Trends in Amplification, 12(4), 283–299.

� 2009 The Authors. Journal compilation � 2009 The Scandinavian Psycho

Stone, M. A. & Moore, B. C. J. (2004). Side effects of fast-acting dynamic range compression that affect intelligibility in a compet- ing-speech task. Journal of the Acoustical Society of America, 116(4), 2311–2323.

Stone, M. A. & Moore, B. C. J. (2008). Effects of spectro-temporal modulation changes produced by multi-channel compression on intelligibility in a competing-speech task. Journal of the Acoustical Society of America, 123(2), 1063–1076.

Takeshi, H., Takahiro, M., Yoshihisa, I. & Tetsuya, H. (2003). Musical noise reduction using an adaptive filter. Acoustical Society of Amer- ica Journal, 114(4), 2370–2370.

Tun, P. A. & Wingfield, A. (1999). One voice too many: Adult age differences in language processing with different types of distract- ing sounds. Journals of Gerontology Series B: Psychological Sci- ences and Social Sciences, 54(5), 317–327.

van Boxtel, M. P., van Beijsterveldt, C. E., Houx, P. J., Anteunis, L. J., Metsemakers, J. F. & Jolles, J. (2000). Mild hearing impair- ment can reduce verbal memory performance in a healthy adult population. Journal of Clinical Experimental Neuropsychology, 22, 147–154.

van den Bogaert, T., Doclo, S., Wouters, J. & Moonen, M. (2008). The effect of multimicrophone noise reduction systems on sound source localization by users of binaural hearing aids. Journal Acoustical Society of America, 124(1), 484–97.

Wang, D. L. (2005). On ideal binary mask as the computational goal of auditory scene analysis. In P. Divenyi (Ed.), Speech separation by humans and machines (pp. 181–197). Norwell, MA: Kluwer Academic.

Wang, D. L. (2008). Time-frequency masking for speech separation and its potential for hearing aid design. Trends in Amplification, 12, 332–353.

Wang, D. L., Kjems, U., Pedersen, M. S., Boldt, J. B. & Lunner, T. (2009). Speech intelligibility in background noise with ideal binary time-frequency masking. Journal of the Acoustical Society of Amer- ica, 125(4), 2336–2347.

Wilken, P. & Ma, W. J. (2004). A detection theory account of change detection. Journal of Vision, 4, 1120–1135.

Wingfield, A. & Tun, P. A. (2001). Spoken language comprehension in older adults: Interactions between sensory and cognitive change in normal aging. Seminars in Hearing, 22(3), 287–301.

Received 22 April 2009, accepted 30 April 2009

logical Associations.