Skip to main navigation Skip to search Skip to main content

Why are listeners hindered by talker variability?

Research output: Contribution to journalReview articlepeer-review

11 Scopus citations

Abstract

Though listeners readily recognize speech from a variety of talkers, accommodating talker variability comes at a cost: Myriad studies have shown that listeners are slower to recognize a spoken word when there is talker variability compared with when talker is held constant. This review focuses on two possible theoretical mechanisms for the emergence of these processing penalties. One view is that multitalker processing costs arise through a resource-demanding talker accommodation process, wherein listeners compare sensory representations against hypothesized perceptual candidates and error signals are used to adjust the acoustic-to-phonetic mapping (an active control process known as contextual tuning). An alternative proposal is that these processing costs arise because talker changes involve salient stimulus-level discontinuities that disrupt auditory attention. Some recent data suggest that multitalker processing costs may be driven by both mechanisms operating over different time scales. Fully evaluating this claim requires a foundational understanding of both talker accommodation and auditory streaming; this article provides a primer on each literature and also reviews several studies that have observed multitalker processing costs. The review closes by underscoring a need for comprehensive theories of speech perception that better integrate auditory attention and by highlighting important considerations for future research in this area.

Original languageEnglish
Pages (from-to)104-121
Number of pages18
JournalPsychonomic Bulletin and Review
Volume31
Issue number1
DOIs
StatePublished - Feb 2024

Keywords

  • Attention
  • Auditory streaming
  • Normalization
  • Speech perception
  • Talker variability

Fingerprint

Dive into the research topics of 'Why are listeners hindered by talker variability?'. Together they form a unique fingerprint.

Cite this