Document Detail


Speech recognition with altered spectral distribution of envelope cues.
MedLine Citation:
PMID:  10491708     Owner:  NLM     Status:  MEDLINE    
Abstract/OtherAbstract:
Recognition of consonants, vowels, and sentences was measured in conditions of reduced spectral resolution and distorted spectral distribution of temporal envelope cues. Speech materials were processed through four bandpass filters (analysis bands), half-wave rectified, and low-pass filtered to extract the temporal envelope from each band. The envelope from each speech band modulated a band-limited noise (carrier bands). Analysis and carrier bands were manipulated independently to alter the spectral distribution of envelope cues. Experiment I demonstrated that the location of the cutoff frequencies defining the bands was not a critical parameter for speech recognition, as long as the analysis and carrier bands were matched in frequency extent. Experiment II demonstrated a dramatic decrease in performance when the analysis and carrier bands did not match in frequency extent, which resulted in a warping of the spectral distribution of envelope cues. Experiment III demonstrated a large decrease in performance when the carrier bands were shifted in frequency, mimicking the basal position of electrodes in a cochlear implant. And experiment IV showed a relatively minor effect of the overlap in the noise carrier bands, simulating the overlap in neural populations responding to adjacent electrodes in a cochlear implant. Overall, these results show that, for four bands, the frequency alignment of the analysis bands and carrier bands is critical for good performance, while the exact frequency divisions and overlap in carrier bands are not as critical.
Authors:
R V Shannon; F G Zeng; J Wygonski
Related Documents :
2331798 - Are specific proteins implicated in the learning process of imprinting?
11538228 - Laboratory studies of the infrared spectral properties of co in astrophysical ices.
8445128 - Derived band auditory brain-stem response estimates of traveling wave velocity in human...
18532118 - Accurate estimation of the duration of tonal signals emitted by marine mammals.
20452128 - The effect of solar cycles on human lifespan in the 50 united states: variation in ligh...
7501378 - Microanalytic acoustical voice characteristics of near-total laryngectomy.
Publication Detail:
Type:  Journal Article; Research Support, U.S. Gov't, P.H.S.    
Journal Detail:
Title:  The Journal of the Acoustical Society of America     Volume:  104     ISSN:  0001-4966     ISO Abbreviation:  J. Acoust. Soc. Am.     Publication Date:  1998 Oct 
Date Detail:
Created Date:  1999-10-28     Completed Date:  1999-10-28     Revised Date:  2007-11-14    
Medline Journal Info:
Nlm Unique ID:  7503051     Medline TA:  J Acoust Soc Am     Country:  UNITED STATES    
Other Details:
Languages:  eng     Pagination:  2467-76     Citation Subset:  IM    
Affiliation:
House Ear Institute, Los Angeles, California 90057, USA. Shannon@hei.org
Export Citation:
APA/MLA Format     Download EndNote     Download BibTex
MeSH Terms
Descriptor/Qualifier:
Adult
Attention*
Cochlear Implants
Female
Humans
Male
Middle Aged
Perceptual Distortion*
Phonetics
Pitch Perception
Prosthesis Design
Reference Values
Semantics
Sound Spectrography*
Speech Acoustics
Speech Perception*
Time Perception
Grant Support
ID/Acronym/Agency:
DC01526/DC/NIDCD NIH HHS

From MEDLINE®/PubMed®, a database of the U.S. National Library of Medicine


Previous Document:  Syllabic strength and lexical boundary decisions in the perception of hypokinetic dysarthric speech.
Next Document:  Temporal and spatio-temporal vibrotactile displays for voice fundamental frequency: an initial evalu...