Document Detail


Single band amplitude envelope cues as an aid to speechreading.
MedLine Citation:
PMID:  1775660     Owner:  NLM     Status:  MEDLINE    
Abstract/OtherAbstract:
Amplitude envelopes derived from speech have been shown to facilitate speech-reading to varying degrees, depending on how the envelope signals were extracted and presented and on the amount of training given to the subjects. In this study, three parameters related to envelope extraction and presentation were examined using both easy and difficult sentence materials: (1) the bandwidth and centre frequency of the filtered speech signal used to obtain the envelope; (2) the bandwidth of the envelope signal determined by the lowpass filter cutoff frequency used to "smooth" the envelope fluctuations; and (3) the carrier signal used to convey the envelope cues. Results for normal hearing subjects following a brief visual and auditory-visual familiarization/training period showed that (1) the envelope derived from wideband speech does not provide the greatest benefit to speechreading when compared to envelopes derived from selected octave bands of speech; (2) as the bandwidth centred around the carrier frequency increased from 12.5 to 1600 Hz, auditory-visual (AV) performance obtained with difficult sentence materials improved, especially for envelopes derived from high-frequency speech energy; (3) envelope bandwidths below 25 Hz resulted in AV scores that were sometimes equal to or worse than speechreading alone; (4) for each filtering condition tested, there was at least one bandwidth and carrier condition that produced AV scores that were significantly greater than speechreading alone; (5) low-frequency carriers were better than high-frequency or wideband carriers for envelopes derived from an octave band of speech centred at 500 Hz; and (6) low-frequency carriers were worse than high-frequency or wideband carriers for envelopes derived from an octave band centred at 3150 Hz. These results suggest that amplitude envelope cues can provide a substantial benefit to speechreading for both easy and difficult sentence materials, but that frequency transposition of these signals to regions remote from their "natural" spectral locations may result in reduced performance.
Authors:
K W Grant; L D Braida; R J Renn
Publication Detail:
Type:  Journal Article; Research Support, U.S. Gov't, P.H.S.    
Journal Detail:
Title:  The Quarterly journal of experimental psychology. A, Human experimental psychology     Volume:  43     ISSN:  0272-4987     ISO Abbreviation:  Q J Exp Psychol A     Publication Date:  1991 Aug 
Date Detail:
Created Date:  1992-03-02     Completed Date:  1992-03-02     Revised Date:  2007-11-14    
Medline Journal Info:
Nlm Unique ID:  8107269     Medline TA:  Q J Exp Psychol A     Country:  ENGLAND    
Other Details:
Languages:  eng     Pagination:  621-45     Citation Subset:  IM    
Affiliation:
Research Laboratory of Electronics, Massachusetts Institute of Technology, Cambridge.
Export Citation:
APA/MLA Format     Download EndNote     Download BibTex
MeSH Terms
Descriptor/Qualifier:
Adult
Attention*
Humans
Lipreading*
Perceptual Distortion
Perceptual Masking
Phonetics*
Pitch Perception*
Psychoacoustics
Speech Perception*
Grant Support
ID/Acronym/Agency:
DC 00010/DC/NIDCD NIH HHS; DC 00117/DC/NIDCD NIH HHS

From MEDLINE®/PubMed®, a database of the U.S. National Library of Medicine


Previous Document:  Formant transition duration and amplitude rise time as cues to the stop/glide distinction.
Next Document:  Crossmodal integration in the identification of consonant segments.