| On the upper cutoff frequency of the auditory critical-band envelope detectors in the context of speech perception. | |
| | |
MedLine Citation:
|
PMID: 11572372 Owner: NLM Status: MEDLINE |
Abstract/OtherAbstract:
|
Studies in neurophysiology and in psychophysics provide evidence for the existence of temporal integration mechanisms in the auditory system. These auditory mechanisms may be viewed as "detectors," parametrized by their cutoff frequencies. There is an interest in quantifying those cutoff frequencies by direct psychophysical measurement, in particular for tasks that are related to speech perception. In this study, the inherent difficulties in synthesizing speech signals with prescribed temporal envelope bandwidth at the output of the listener's cochlea have been identified. In order to circumvent these difficulties, a dichotic synthesis technique is suggested with interleaving critical-band envelopes. This technique is capable of producing signals which generate cochlear temporal envelopes with prescribed bandwidth. Moreover, for unsmoothed envelopes, the synthetic signal is perceptually indistinguishable from the original. With this technique established, psychophysical experiments have been conducted to quantify the upper cutoff frequency of the auditory critical-band envelope detectors at threshold, using high-quality, wideband speech signals (bandwidth of 7 kHz) as test stimuli. These experiments show that in order to preserve speech quality (i.e., for inaudible distortions), the minimum bandwidth of the envelope information for a given auditory channel is considerably smaller than a critical-band bandwidth (roughly one-half of one critical band). Difficulties encountered in using the dichotic synthesis technique to measure the cutoff frequencies relevant to intelligibility of speech signals with fair quality levels (e.g., above MOS level 3) are also discussed. |
| | |
Authors:
|
O Ghitza |
Publication Detail:
|
Type: Journal Article |
Journal Detail:
|
Title: The Journal of the Acoustical Society of America Volume: 110 ISSN: 0001-4966 ISO Abbreviation: J. Acoust. Soc. Am. Publication Date: 2001 Sep |
Date Detail:
|
Created Date: 2001-09-26 Completed Date: 2001-10-11 Revised Date: 2007-11-15 |
Medline Journal Info:
|
Nlm Unique ID: 7503051 Medline TA: J Acoust Soc Am Country: United States |
Other Details:
|
Languages: eng Pagination: 1628-40 Citation Subset: IM |
Affiliation:
|
Media Signal Processing Research, Agere Systems, Murray Hill, New Jersey 07974, USA. |
Export Citation:
|
APA/MLA Format Download EndNote Download BibTex |
| MeSH Terms | |
Descriptor/Qualifier:
|
Acoustic Stimulation
/
methods Auditory Perception* Cochlea / physiology Computer Simulation Functional Laterality Hair Cells, Auditory, Inner / physiology Humans Models, Biological Psychophysics / methods Speech Intelligibility Speech Perception* Time Factors |
From MEDLINE®/PubMed®, a database of the U.S. National Library of Medicine
Previous Document: Minimum spectral contrast needed for vowel identification by normal hearing and cochlear implant lis...
Next Document: A probabilistic union model with automatic order selection for noisy speech recognition.