Document Detail


On the upper cutoff frequency of the auditory critical-band envelope detectors in the context of speech perception.
MedLine Citation:
PMID:  11572372     Owner:  NLM     Status:  MEDLINE    
Abstract/OtherAbstract:
Studies in neurophysiology and in psychophysics provide evidence for the existence of temporal integration mechanisms in the auditory system. These auditory mechanisms may be viewed as "detectors," parametrized by their cutoff frequencies. There is an interest in quantifying those cutoff frequencies by direct psychophysical measurement, in particular for tasks that are related to speech perception. In this study, the inherent difficulties in synthesizing speech signals with prescribed temporal envelope bandwidth at the output of the listener's cochlea have been identified. In order to circumvent these difficulties, a dichotic synthesis technique is suggested with interleaving critical-band envelopes. This technique is capable of producing signals which generate cochlear temporal envelopes with prescribed bandwidth. Moreover, for unsmoothed envelopes, the synthetic signal is perceptually indistinguishable from the original. With this technique established, psychophysical experiments have been conducted to quantify the upper cutoff frequency of the auditory critical-band envelope detectors at threshold, using high-quality, wideband speech signals (bandwidth of 7 kHz) as test stimuli. These experiments show that in order to preserve speech quality (i.e., for inaudible distortions), the minimum bandwidth of the envelope information for a given auditory channel is considerably smaller than a critical-band bandwidth (roughly one-half of one critical band). Difficulties encountered in using the dichotic synthesis technique to measure the cutoff frequencies relevant to intelligibility of speech signals with fair quality levels (e.g., above MOS level 3) are also discussed.
Authors:
O Ghitza
Publication Detail:
Type:  Journal Article    
Journal Detail:
Title:  The Journal of the Acoustical Society of America     Volume:  110     ISSN:  0001-4966     ISO Abbreviation:  J. Acoust. Soc. Am.     Publication Date:  2001 Sep 
Date Detail:
Created Date:  2001-09-26     Completed Date:  2001-10-11     Revised Date:  2007-11-15    
Medline Journal Info:
Nlm Unique ID:  7503051     Medline TA:  J Acoust Soc Am     Country:  United States    
Other Details:
Languages:  eng     Pagination:  1628-40     Citation Subset:  IM    
Affiliation:
Media Signal Processing Research, Agere Systems, Murray Hill, New Jersey 07974, USA.
Export Citation:
APA/MLA Format     Download EndNote     Download BibTex
MeSH Terms
Descriptor/Qualifier:
Acoustic Stimulation / methods
Auditory Perception*
Cochlea / physiology
Computer Simulation
Functional Laterality
Hair Cells, Auditory, Inner / physiology
Humans
Models, Biological
Psychophysics / methods
Speech Intelligibility
Speech Perception*
Time Factors

From MEDLINE®/PubMed®, a database of the U.S. National Library of Medicine


Previous Document:  Minimum spectral contrast needed for vowel identification by normal hearing and cochlear implant lis...
Next Document:  A probabilistic union model with automatic order selection for noisy speech recognition.