Document Detail

Distortion product otoacoustic emissions evoked by tone complexes.
Jump to Full Text
MedLine Citation:
PMID:  20838846     Owner:  NLM     Status:  MEDLINE    
Abstract/OtherAbstract:
Distortion product otoacoustic emissions (DPOAEs) are traditionally evoked by two-tone stimuli. In this study, emission data from Mongolian gerbils are reported that were obtained with stimuli consisting of six to 10 tones. The stimuli were constructed by replacing one of the tones of a tone pair by a narrowband multitone complex. This produced rich spectra of the ear canal sound pressure in which many of the third-order DPOAEs originated from the interaction of triplets of stimulus components. A careful choice of the stimulus frequencies ensured that none of these DPOAE components coincided. Three groups of DPOAEs are reported, two of which are closely related to DPOAEs evoked by tone pairs. The third group has no two-tone equivalent and only arises when using a multitone stimulus. We analyzed the relation between multitone-evoked DPOAEs and DPOAEs evoked by tone pairs, and explored the new degrees of freedom offered by the multitone paradigm.
Authors:
Sebastiaan W F Meenderink; Marcel van der Heijden
Related Documents :
9390786 - Nonlinearity of mechanoelectrical transduction of outer hair cells as the source of non...
8977146 - Development of cochlear active mechanisms in humans differs between gender.
15807046 - Sensitivity of distortion product otoacoustic emissions in noise-exposed chinchillas.
17736496 - Quasars: millisecond-of-arc structure revealed by very-long-baseline interferometry.
10655056 - Mutation of a nuclear receptor gene, nr2e3, causes enhanced s cone syndrome, a disorder...
1861616 - Availability and category-frequency estimation.
Publication Detail:
Type:  Journal Article; Research Support, Non-U.S. Gov't     Date:  2010-09-14
Journal Detail:
Title:  Journal of the Association for Research in Otolaryngology : JARO     Volume:  12     ISSN:  1438-7573     ISO Abbreviation:  J. Assoc. Res. Otolaryngol.     Publication Date:  2011 Feb 
Date Detail:
Created Date:  2011-01-05     Completed Date:  2011-05-04     Revised Date:  2011-07-25    
Medline Journal Info:
Nlm Unique ID:  100892857     Medline TA:  J Assoc Res Otolaryngol     Country:  United States    
Other Details:
Languages:  eng     Pagination:  29-44     Citation Subset:  IM    
Affiliation:
Department of Neuroscience, Erasmus MC, Dr. Molewaterplein 50, Rotterdam, 3015 GE, The Netherlands.
Export Citation:
APA/MLA Format     Download EndNote     Download BibTex
MeSH Terms
Descriptor/Qualifier:
Acoustic Stimulation
Animals
Cochlea / physiology*
Cochlear Microphonic Potentials
Female
Gerbillinae
Male
Models, Biological*
Sound*
Comments/Corrections

From MEDLINE®/PubMed®, a database of the U.S. National Library of Medicine

Full Text
Journal Information
Journal ID (nlm-ta): J Assoc Res Otolaryngol
ISSN: 1525-3961
ISSN: 1438-7573
Publisher: Springer-Verlag, New York
Article Information
Download PDF
© The Author(s) 2010
Received Day: 22 Month: 6 Year: 2010
Accepted Day: 11 Month: 8 Year: 2010
Electronic publication date: Day: 14 Month: 9 Year: 2010
pmc-release publication date: Day: 14 Month: 9 Year: 2010
Print publication date: Month: 2 Year: 2011
Volume: 12 Issue: 1
First Page: 29 Last Page: 44
ID: 3015028
PubMed Id: 20838846
Publisher Id: 233
DOI: 10.1007/s10162-010-0233-4

Distortion Product Otoacoustic Emissions Evoked by Tone Complexes
Sebastiaan W. F. MeenderinkAff1
Marcel van der HeijdenAff1 Address: +31-10-7043558 +31-10-7044734 http://neuro.nl/research
Department of Neuroscience, Erasmus MC, Dr. Molewaterplein 50, Rotterdam, 3015 GE The Netherlands

Introduction

To cope with a dynamic range exceeding 100 dB, several stages of auditory processing employ compressive transformations. In the cochlea, for example, the growth rate of mechanical responses to pure tones can be as low as 0.2 dB per decibel increase of the stimulus intensity (Rhode 2007). This mechanical nonlinear transformation has side effects. The incoming signals are distorted and additional, nonstimulus components are produced.

When the stimulus consists of two pure tones having frequencies f1 and f2, their intermodulation results in multiple distortion products (DPs) of which the third-order (“cubic”) difference tones (CDTs) at frequencies 2f1 − f2 and 2f2 − f1 are best known. Distortion products are often audible; reports of these “subjective tones” date back several centuries (see Plomp 1965, for a historical perspective). Their audibility suggests that they are associated with propagating components in the inner ear, which is supported by psychophysical (Smoorenburg 1972) and electrophysiological studies (Goldstein and Kiang 1968; Kim et al. 1980).

Under favorable stimulus conditions, DPs return to the ear canal, where they can be recorded with a sensitive microphone. Reports of acoustic byproducts of nonlinear auditory processing date back many decades (Wever and Lawrence, 1954; von Békésy, 1960), but their systematic study started with the pioneering work on otoacoustic emissions by Kemp and colleagues (reviewed in Kemp 2008). Distortion products measured acoustically in the ear canal are called distortion-product otoacoustic emissions.

Virtually all studies of DPOAEs employ two stimulus tones to evoke them. The few studies that did address three-tone DPOAEs merely pointed out their occurrence (Kemp and Brown 1986; Kemp 1998), or analyzed them in the narrow context of low-frequency biasing (Bian and Scherrer 2007; Marquardt et al. 2007) or time-varying suppression (Meenderink and van der Heijden 2010). It occurred to us that there is no a priori reason for such constraints, and that the systematic study of multitone (>2) emissions may enlarge the scope of DPOAE research beyond what is possible with tone pairs. Mathematically, third-order distortions arise from those terms in the Fourier expansion of the distorted stimulus that involve the interaction of three stimulus components (Schroeder 1975). If f, g, and h are stimulus frequencies, such third-order terms produce distortions at frequencies ± f ± g ± h. Since f, g, and h need not be all different, these third-order terms include harmonics 3f, (i.e., f = g = h), as well as CDTs at 2f − g (for which f = h). Thus, third-order DPs evoked by tone pairs are a limited subset of the more general case of interacting triplets of tones.

This study explores the generation of third-order DPOAEs by stimuli that contain more than two frequencies. We replaced one of the primaries of a tone pair by a narrowband tone complex, resulting in a stimulus that contained multiple tone triplets at once. The number of third-order DPs grows as the third power of the number of primary components, and it can easily happen that several DPOAE components coincide. We avoided such coincidences by a careful choice of the primary frequencies. The resulting uniqueness of third-order DPOAE frequencies greatly facilitated their analysis. We observed three groups of DPOAEs evoked by the new stimulus paradigm, two of which are closely related to DPOAEs evoked by tone pairs. The third group does not have a two-tone equivalent. We analyzed the relation between multitone-evoked DPOAEs and DPOAEs evoked by tone pairs, and explored the new degrees of freedom offered by the multitone paradigm.


Materials and Methods
Animal Preparation

Recordings were made from adult Mongolian gerbils (Meriones unguiculatus; three males and 12 females; 21 ears; body weight, 46–66 g). Animals were anesthetized by intraperitoneal injection of ketamine/xylazine solution (effective dosage, 80 and 12 μg/g body weight, respectively). Supplementary anesthesia was administered subcutaneously at ∼1/3 of the initial dose in 1-h intervals. A small metal rod was attached to the dorsal surface of the skull and was used to fix the head of the animal. Body temperature was maintained at 37°C using a thermocontrolled heating pad. The pinna and cartilaginous ear canal were removed and a custom-built probe was sealed over the bony ear canal using Vaseline. In eight of the animals a small hole was drilled in the wall of the inferior posterior mastoid chamber of the ipsilateral bulla to prevent pressure buildup in the middle ear cavity. We did not notice apparent differences between recordings with or without such venting, and no distinction between the data from these two conditions is made. Animal procedures were in accordance with guidelines provided by the animal committee of the Erasmus MC.

Stimuli

To evoke otoacoustic emissions, acoustic stimuli that consisted of multiple frequency components (each with a random starting phase) were used. For convenience of description, the components are separated into two constituents: (1) a single frequency component and (2) a tone complex consisting of M tones. In this study, M varied between 5 and 9. The frequency of the single component will be denoted by f; the frequencies of the tone complex by g1, g2...gM. The stimulus frequencies were chosen such that all possible difference and sum frequencies were unique (Victor et al. 1977; van der Heijden and Joris 2003; 2006). Following van der Heijden and Joris (2003), we will refer to this stimulus property as “zwuis”. Nonlinear interaction between any combinations of the stimulus components can give rise to third-order DPOAEs, and different groups of third-order DPOAEs will be identified and analyzed in Results.

The primary frequencies were further restricted as described in Table 1 (the terminology used in this table anticipates the naming of the DPOAE groups described in Results). This restriction served to avoid the coincidence of DP components across groups. Combined, the restrictions in the choice of stimulus frequencies (“zwuis” and “periodicity”) ensure that all third-order DPOAEs of the type gi ± gj ± f are unique and never equal to a primary component. Consequently, based on its frequency, each DPOAE component can be unambiguously identified with the unique triplet (f,gi,gj) of stimulus components whose nonlinear interaction produced it.

The tone complex and the single frequency component were presented over separate D/A channels (Tucker-Davis Technologies RP2.1). To minimize transients, stimuli were gated using a raised cosine window (10–90% in 10 ms). The output of each channel was fed through a stereo power amplifier (TDT SA1) and broadcast from a separate driver (2× TDT CF1 or 2× Visaton FRS-7). These drivers where connected to the recording probe by means of plastic tubes. The correct sound pressure levels were attained by calibrating the drivers in situ while taking the probe transfer characteristics into account. The recording probe contained a 1/2″ pressure-microphone (GRAS 40AG) that was used to record the ear canal sound pressure synchronously with stimulus delivery. The recorded signal was band-pass filtered (0.02–100 kHz, NEXUS 2690), acquired via an A/D channel (TDT RP2.1), and stored on computer disk for offline analysis. Signal generation and data acquisition were both done at a rate of 48.8 kHz.

Data Analysis

All stimulus frequencies were chosen such that an integer number of sample points held an even number of periods for these tones (see Table 1). As a consequence, these periodic segments (typically 97,656 samples, i.e., ∼2 s) also held an even number of periods for all third-order DPOAEs. This allowed for averaging of a single, continuous recording by breaking it down into its periodic segments, and eliminated any spectral leakage when extracting frequency components using Fourier analysis (Papoulis 1962).

After excluding the first and last periodic segment of each recording (to exclude the ramps and transient phenomena), the remaining signal was averaged over the periodic segments and the magnitude and phase spectra were calculated via Fourier analysis. No form of artifact rejection was used. The stimulus design resulted in the even components of these spectra to be commensurate with the stimulus (and any DPOAEs evoked by it). The odd components of the spectra, which are not commensurate with the stimulus, were used to estimate the noise floor. Additional recordings were made in an artificial ear to check for distortion in the hardware. For the stimuli used in this study, these system distortions were well below the DPOAEs, often not exceeding the noise floor.

The phase of each DPOAE component was expressed in cycles re. the phase of the stimulus components producing it as determined from the microphone signal. Thus, the phase [\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$ {\Phi_{\rm{dp}}}(f,{g_i},{g_j}) $$\end{document}] of a DP component having frequency fdp = gi ± gj ± f becomes

[Formula ID: Equ1]
[\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$ {\Phi_{\rm{dp}}}(f,{g_i},{g_j}) = {\varphi_{\rm{dp}}} - ({\varphi_i}\pm {\varphi_j}\pm {\varphi_f}) $$ \end{document}]
where [\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$ {\varphi_{...}} $$\end{document}] denotes the phase values extracted from the Fourier spectrum of the recording.

The relation between DPOAE phase and stimulus frequency was analyzed by fitting the phase data using two linear models. Any DPOAE components not exceeding the noise floor were excluded from the fitting procedure. Weight factors were assigned to the remaining DPOAE components to prevent the phases of weak components (close to the noise floor) from dominating the data fits. The weight factor W was

[Formula ID: Equ2]
[\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$ W = 1/{\sigma^2} $$\end{document}]
where σ2 is the variance in amplitude of a tonal component caused by the addition of noise. This variance was evaluated using numerical simulations of the effect of adding random components (with phase drawn from a uniform distribution, and amplitude drawn from a Rayleigh distribution) on the phase of single spectral components. The simulations yielded a polynomial description of the variance σ2
[Formula ID: Equ3]
[\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$ \sigma ^{2} = 23.6*{\text{NSR}}^{{\text{6}}} - 57.1*{\text{NSR}}^{{\text{5}}} - 52.4*{\text{NSR}}^{{\text{4}}} \,\, - 20.8*{\text{NSR}}^{{\text{3}}} + 4.06*{\text{NSR}}^{2} + 5.88*{\text{NSR}} $$\end{document}]
where NSR denotes the reciprocal of the signal-to-noise amplitude ratio.

The first model was a straight line of DPOAE phase Φdp against DPOAE frequency fdp

[Formula ID: Equ4]
[\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$ {\Phi_{\rm{dp}}} = {\Phi_0} - \tau {f_{\rm{dp}}} $$\end{document}]

The phase offset Φ0 and group delay τ were estimated by minimizing the weighted sum of squared differences between data and model,

[Formula ID: Equ5]
[\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$ \sum\limits_k {{W_k}{{(\Phi_k^{\rm{model}} - \Phi_k^{\rm{data}})}^2}} $$\end{document}]

The second model, which was used in the case of two-dimensional representations of the DPOAE phase data (see Fig. 5), is given by

[Formula ID: Equ6]
[\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$ {\Phi_{\rm{dp}}}(f,{g_i},{g_j}) = {\Phi_0} + {\tau_{ + }}({g_i} + {g_j}) + {\tau_{-} }({g_i} - {g_j}) $$\end{document}]

Notice that Eq. 6 does not explicitly include a term for the single stimulus frequency f, because f was not varied within data sets that were fitted. The phase offset Φ0 and group delays τ+ and τ were estimated by minimizing the weighted sum of squared residuals (Eq. 5). Group delays will be reported by their mean ±95% confidence interval. All stimulus generation, data acquisition, and offline analysis were done via custom software in MATLAB.


Results

Figure 1 shows amplitude spectra of the ear canal sound pressure when either the lower primary (A) or the upper primary (B) consisted of the tone complex. Nonlinear interaction of the single stimulus tone with one or two components from the tone complex produced three readily identified groups of third-order DPOAEs, with each DP frequency resulting from a unique set of three stimulus frequencies. The DPOAEs are of biological origin: they exceed the estimated noise floor as well as the system distortion.

The three groups of third-order DPOAEs evoked by a stimulus that consists of a single primary at frequency f and a tone complex having M components at frequencies g1, g2gM, are:

  1. The “far” group: M DPOAEs having frequencies 2f − gi, where i = 1…M; (left-pointing triangles in Fig. 1).
  2. The “near” group: 1/2M(M + 1) DPOAEs having frequencies gi + gj − f, where i,j = 1…M; ij (circles for i = j, and right-pointing triangles when i < j in Fig. 1).
  3. The “sideband” group: M(M−1) DPOAEs having frequencies f + gi − gj, with i,j = 1…M; ij (diamonds in Fig. 1).

Other third-order DPOAEs also arise from such a stimulus (e.g., 3f, gi + gj − gk), but we restrict our analysis to those components that involve both the single stimulus component f and one or two components from the tone complex g. In the next sections, each of the three groups will be considered.

The Far Group

In the frequency spectrum, these DPOAEs (left-pointing triangles in Fig. 1) occur at the “far side” of the tone complex (at the opposite side of the single tone as the tone complex). Using a tone complex having M components results in a far group that consists of M DPOAEs, at frequencies ffar(i) = 2fgi, i = 1…M. Each DPOAE thus results from the interaction of only two stimulus components, the single-tone primary and one frequency component from the tone complex. This is analogous to DPOAEs evoked by two-tone stimuli, but instead of sweeping one primary across frequencies to evoke subsequent DPOAEs, all the different stimulus frequencies are here presented simultaneously as a tone complex. More specifically, the far group resembles 2f1 − f2 DPOAEs when f2 is replaced by the tone complex (f < g; Fig. 1B), or 2f2 − f1 DPOAEs when f1 is replaced by the tone complex (f > g; Fig. 1A).

We tested the similarity between the far group and DPOAEs evoked by two-tone stimuli by also recording the latter emissions using either a fixed-f2 or a fixed-f1 paradigm (i.e., one stimulus tone was fixed in frequency, while the other was varied across recordings). DPOAEs from the two-tone recordings were then compared to the far group obtained from a single recording in which all of the stimulus components were presented at once. As an example, Figure 2A shows the superimposed amplitude spectra of 10 recordings. In nine of these recordings, two-tone stimuli (light gray) were used, in which the upper stimulus component f2 was fixed, while the frequency of the lower component f1 was varied across recordings (fixed-f2 paradigm). In the 10th recording, all stimulus frequency components were presented simultaneously (dark gray). The intensity of each component in the tone complex was reduced relative to the f1 level in the two-tone paradigm in such a way that the total power of the tone complex equaled the power of each single f1 component in the fixed-f2 paradigm. The amplitude and phase data for the DPOAEs from these recordings are shown in more detail in Figure 2B, C, respectively (triangles for far group, circles for DPOAEs from consecutive two-tone stimuli). Notice that, although the amplitude of each DPOAE component in the far group is smaller than that resulting from the two-tone stimulus, their total power (horizontal dashed line in Fig. 2B) is nearly identical to the power of each of the latter DPOAEs. The phase data exhibit very similar group delays, but show a small vertical offset (∼0.02 cycles) between the two types of recordings. Such an offset was, however, not observed systematically across recordings. Comparisons between the far group and two-tone DPOAEs from similar recordings in four animals are made in Figure 3 (triangles) as scatter plots for the group delay (A) and DPOAE power (B). Depending on the relative primary frequencies, the far group resembled either the 2f1f2 (when f < g) or the 2f2f1 (when f > g) DPOAEs. There appears to be no need to distinguish between these two situations.

The general observation is that the far group is practically identical to DPOAEs evoked by consecutive two-tone stimuli. Thus, for this group of DPOAEs there is little difference between sweeping one primary across a set of frequencies (fixed-f1 or fixed-f2 paradigm) and presenting the whole set together as a tone complex.

The Near Group

Spectrally, this group of emissions occurs at the side of the tone complex opposite of the single stimulus tone. In analogy with the far group we will refer to it as the “near group.” The frequencies of this group are given by [\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$ {f_{\rm{near}}}\left( {i,j} \right) = {g_i} + {g_j} - f $$\end{document}] . Figure 4 gives an example for which f > g (see Fig. 1B for an example with f < g).

This DPOAE group contains more frequency components than there are stimulus components. In the example of Figure 4, a tone complex that consisted of nine components evoked a total of 36 unique DPOAE components in the near group. Also, the near group appears to be composed of two subsets that systematically differ in their amplitudes by ∼6 dB (compare close circle and triangles in Fig. 4A).

Both the number of DPOAE components and the apparent existence of two subsets can be explained by realizing that the generation of each DPOAE component involves three stimulus components: the single stimulus component f, and two components (gi, gj) from the tone complex. The single frequency component f is constant within the near group, so all possible combinations are represented by the square matrix (gi, gj). Graphically, these combinations (gi, gj) are given by the lattice points in Figure 5A. The subset of M combinations for which i = j produce DPOAE components (circles) involving only one frequency component from the tone complex. This subgroup is the “mirror image” of the far group of DPOAEs. The remaining 1/2M(M−1) combinations for which i ≠ j (triangles in Fig. 5A) produce the additional frequency components of the near group of DPOAEs. Their generation involves two different components from the tone complex. Notice that not all possible combinations result in a unique DPOAE component. Since gi + gj = gj + gi, only half of all i ≠ j combinations result in a unique frequency. In combinatorial terms, both gi + gj and gj + gi contribute to the same DPOAE component. This combinatorial effect corresponds to the coefficient 2 of the xy term in the binomial expansion of (x+y)2, and causes the DPOAEs for which i ≠ j to have their amplitudes doubled (+6 dB) re. the amplitude of DPOAEs for which i = j. Apart from this combinatorial factor 2 in amplitude, there is no fundamental difference between the two groups, as is shown by the phase data (Fig. 4B) in which all near components are well described by a single function of DPOAE frequency.

To investigate the analogy between the near group and DPOAEs evoked by two-tone stimuli, recordings identical to those described for Figure 2 were made: in a series of two-tone recordings, DPOAEs were obtained using either a fixed-f2 or a fixed-f1 paradigm. These two-tone data were compared with a single recording in which all stimulus components were presented at once. Results are given in Figure 3 (circles) together with the data for the far group. As before, the grouping of all data around the diagonals suggests that the DPOAEs evoked by the two stimulus protocols are essentially the same.

The similarity between the DPOAEs obtained using a tone complex or a series of two-tone recordings again shows that there are no essential differences between DPOAEs recorded sequentially and simultaneously. They are generated by the same nonlinear mechanism, and there are no indications that they propagate along different cochlear paths.

Given the close similarity between the near and far groups and consecutive two-tone stimuli, the multitone data may be considered as 2f1 − f2 or 2f2 − f1 DPOAEs as if recorded using a fixed-f1 or a fixed-f2 paradigm. Specifically, the near group resembles either 2f1 − f2’s with f2 fixed (f > g) or 2f2 − f1’s with f1 fixed (f < g), while the far group corresponds to either 2f2 − f1’s with f2 fixed (f > g) or 2f1 − f2’s with f1 fixed (f < g). Using this interpretation of the multitone data, DPOAE group delays were calculated for these four different {DPOAE, stimulus} configurations (Fig. 6A, B). For the “2f1 − f2” DPOAEs, the fixed-f1 paradigm yielded systematically larger group delays than the fixed-f2 paradigm. A comparison of the former group delays with 2f1f2 group delays from two-tone data of the gerbil (adapted from Faulstich and Kössl 2000), shows that the absolute group delays as well as the trend for these delays to decrease with increasing frequency are very similar, confirming the similarity between the two-tone and multitone DPOAEs. In contrast, group delays for the “2f2 − f1” DPOAEs were similar for the fixed-f1 and fixed-f2 paradigm, and are comparable to the {“2f1 − f2”/fixed-f2} group delays.

For the near group of DPOAEs, the zwuis character of the stimulus becomes indispensable (see Materials and Methods). It ensures that all possible combinations gi + gj are unique, so that each DPOAE frequency gi + gjf can be unambiguously attributed to a unique pair (gi, gj) of stimulus components from the tone complex. This is illustrated in Figure 5B, in which the (gi,gj)-matrix of Figure 5A is transformed (rotated and translated) into the equivalent matrix ([gi + gjf],[gigj]), where gi + gj − f equals the DPOAE frequency. The zwuis character of the stimulus ensures that none of the symbols are vertically aligned: projecting them on the abscissa (i.e., DPOAE frequency) will never result in two coinciding points. This is necessary to disentangle the different contributions of the stimulus components to the individual DPOAEs.

The wealth of components in the near group offers an extra “degree of freedom” in the analysis of phase data. The presence of i ≠ j components enables an analysis of DPOAE phase that extends beyond the dependence on DPOAE frequency analyzed thus far. The additional degree of freedom is illustrated by representing the DPOAE components of the near group in the two-dimensional format of Figure 5B. In this representation, the dependence of DPOAE phase on DPOAE frequency corresponds to variation along horizontal lines (i.e., parallel to the abscissa). In contrast, variation along vertical lines indicates a dependence on stimulus frequencies in a way that does not correspond to changes in emission frequency.

Figure 7A shows the phase data from Figure 4B in the two-dimensional representation in the form of a contour plot. It is important to keep in mind that the phase data shown here were obtained from a single recording. In discussing them, it is hard to avoid phrases like “variations in stimulus frequencies”, which might suggest the use of frequency sweeps. In reality, all stimulus components were presented together, and the “variations” refer to comparisons across DPOAE components produced by different triplets drawn from the M + 1 stimulus components.

The contours in Figure 7A are iso-phase lines; the maximum variation of DPOAE phase occurs along the direction perpendicular to them. The vertical orientation of the contours thus indicates that DPOAE phase changes with variation along the abscissa. Thus, DPOAE phase only depends on DPOAE frequency. In contrast, variations in stimulus frequencies (gi, gj) that keep the DPOAE frequency fixed (variations along the ordinate) have no systematic effect on DPOAE phase. This observation is quantified by fitting the two-dimensional phase data with the model given in Eq. 6 (see Materials and Methods), which yielded τ+ = 665 ± 16 μs, and τ = 16 ± 26 μs (Fig. 7B). Notice that τ+ is the coefficient describing the phase variations with DPOAE frequency, i.e., the slope of the frequency-phase plot of Figure 4B. As explained above, τ+ matches the group delay obtained with a two-tone stimulation paradigm. In contrast, τ is the coefficient for those variations in the frequencies of the tone complex g that keep the (gi + gj), and thus DPOAE frequency, fixed.

Figure 7C is a scatter plot of similarly calculated group delays for 998 recordings (830 when f > g; 168 when f < g) from 15 animals. These data show that DPOAE phase for the near group is well described by their variation with DPOAE frequency. This holds true for both stimulus configurations f > g and f < g. Variations in stimulus frequency that do not correspond with changes in DPOAE frequency (i.e., varying gi − gj while fixing gi + gj) have no explanatory contribution to the observed DPOAE phase. This observation is consistent with the good fit of the straight line to the phase-versus-frequency data (Fig. 4B), which leaves little residual variance to be accounted for.

In summary, the near group results from the combination of either two or three stimulus components. The “combinatorial effects” of these components explain both the number of DPOAE components and the emergence of two subgroups whose magnitudes are 6 dB apart. There is little difference between sweeping one primary across a set of frequencies and presenting the whole set together as a tone complex. In this respect the near group is similar to the far group. The “off-diagonal” (i ≠ j) components permit an analysis of the DPOAE phase that goes beyond their dependence on DPOAE frequency. This analysis shows that, within the range of frequencies tested, the phase of DPOAE components only varies with those variations in stimulus frequencies that correspond to changes in the DPOAE frequency.

The “Sideband Group”

The third and final group of DPOAEs does not have a two-tone equivalent; it only arises when the stimulus consists of more than two frequency components. It consists of a set of sidebands around the single stimulus component f (e.g., diamonds in Fig. 1A, B) at frequencies f + gi − gj, with i ≠ j.

As before, the origin of the DPOAE components (in terms of the stimulus components from the tone complex) is illustrated by considering the matrix (gi,gj) (Fig. 5C). Each i ≠ j combination results in a DPOAE component, and the zwuis character of the stimulus ensures that these components are all unique (see Fig. 5D). Because (gi − gj) ≠ (gj − gi), the combinatorial effect described for the near group does not apply here: no subgroups of DPOAEs arise. DPOAE components for which i = j coincide with the stimulus component f, and cannot be resolved in the spectra of the recorded signals. Thus an M-component tone complex results in M2M = M(M−1) different DPOAE components within the sideband group.

Since the DPOAE frequencies of the sideband group depend on the frequency difference gi − gj, a collective frequency shift of the entire tone complex g does not affect the frequencies of the DPOAEs. Recently, we used this property to study reverse intracochlear propagation of otoacoustic emissions in the gerbil ear (Meenderink and van der Heijden 2010). In the phase analysis of the sideband group, we distinguish the two cases f > g (single tone above tone complex) and f < g (single tone below tone complex).

DPOAE Phase; f > g

As an example, Figure 8A shows phase data for the sideband group of DPOAEs as a function of DPOAE frequency. These data correspond to the amplitude spectrum in Figure 4A, and were obtained using a stimulus for which f > g. A straight line (Eq. 4) fitted to these data yields a group delay of 536 ± 13 μs. As was the case for the near group, the phase data can be rearranged in a two-dimensional format (Fig. 8B) with DPOAE frequency along the abscissa and non-DPOAE frequency variations represented along the ordinate. Similar to the near group, the contour lines are vertical, indicating that the variation in DPOAE phase only occurs with those changes in stimulus frequency that cause the DPOAE frequency to change. Fitting Eq. 6 to these data (Fig. 8C) yielded τ+ = 22 ± 14 μs and τ = 536 ± 13 μs. Note that compared to the near group, the roles of τ+ and τ are reversed: this time it is the τ coefficient that represents the group delay along the DPOAE frequency axis. A scatter plot (Fig. 8D) of group delays for phase data obtained from all 15 animals (N = 1,065) generalizes this observation. DPOAE phase varies systematically with DPOAE frequency (captured in τ), whereas non-DPOAE-related frequency variations (quantified by τ+) do not improve the explanatory power of the linear model Eq. 6 for stimulus conditions with f > g.

DPOAE Phase; f < g

From the DPOAE amplitude spectra, no differences are apparent between the sideband DPOAEs obtained with f > g (Fig. 1A, diamonds) versus those evoked with f < g (Fig. 1B, close diamonds). The phase data, however, do reveal a contrast between the two stimulus configurations. As an example, Figure 9A, B show amplitude and phase spectra obtained with f < g. As before, the phase data change with DPOAE frequency, and this trend is largely captured by a straight line fit to these data. On closer inspection, however, the phase also show a systematic patterning along near-vertical lines; a trend that cannot be explained by their dependence on DPOAE frequency alone. Rearranging the phase data in the two-dimensional format as described earlier (see Fig. 5D) clarifies the nature of the dependence (Fig. 9C). This time, the contours are not vertical, but tilted clockwise. The tilting indicates that, although a major portion of the phase variation is explained by the DPOAE frequency, phase is also affected by the stimulus frequencies in a different way. This is quantified by fitting Eq. 6 to these data, which yielded τ+ = −113 ± 20 μs and τ = 642 ± 19 μs. The resulting plot (Fig. 9D) clearly shows non-vertical contours that reflect the nonzero group delay τ+. Analysis of a total of 298 recordings obtained in 13 gerbils generalizes this observation. Unlike the corresponding analysis for the f > g case (Fig. 8D), the scatter plot of the group delays τ+ and τ (Fig. 9E) is no longer distributed around the line τ+ = 0. Instead, it shows a systematic trend towards negative group delays for the non-DPOAE-related frequency components (τ+ < 0). In this respect, these DPOAEs (i.e., sideband group obtained with f < g) are different from all other groups, including the sideband DPOAEs obtained with f > g.

Theoretical Analysis of the Stimulus Dependence of DPOAE Phase: Residual Phase Effects

The multitone stimuli provided an extra degree of freedom in the analysis of DPOAE phase beyond the straightforward, one-dimensional graphs of DPOAE phase versus DPOAE frequency. In the two-dimensional phase plots (Figs. 79), the phase gradients in the horizontal direction correspond to the straightforward dependence on DPOAE frequency, which is the dominant effect in all of the phase data of the near and sideband groups. The two-dimensional phase data, however, also allowed the assessment of phase gradients in the vertical direction, which may be interpreted as data obtained with a “fixed-fdp” recording paradigm. These gradients allowed us to assess whether DPOAE phase changes systematically when the stimulus frequencies (gi, gj) are “varied” in a way that keeps the DPOAE frequency fixed. (They are not really varied, because the various frequencies g1gM are all presented simultaneously. For ease of explanation, however, we will discuss the data as if collected while varying (gi,gj) according to a fixed-fdp sweep.)

Because the dependence on DPOAE frequency (i.e., the phase gradient along the horizontal direction) is the main effect in our data (often it is also the only dependence observed), we will term the second type of DPOAE phase dependency the residual phase dependency. It is an interesting type of dependence, because travel times common to all components (stimulus and DPOAEs alike) do not contribute to it; they are absorbed into the main effect. As explained below, this makes the residual phase dependence a sensitive probe of the details of DPOAE generation.

Consider a generic model of DPOAE generation, in which the phase of a stimulus component f at a certain location x the cochlea is described by a function Φ(f,x), and the generation site of a third-order DP component fdp is XG(f1,f2,f3; fdp). No specific assumptions are made on the functions Φ and XG. In the Appendix it is shown that, to first-order approximation, the phase of the DP at its place of generation is given by

[Formula ID: Equ7]
[\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$ {\Phi_{\rm{dp}}} = {\Phi_0} + \kappa \sum\limits_{k = 1}^3 {{f_k}\frac{{\partial {X_{\rm{G}}}}}{{\partial {f_k}}}} + \kappa {f_{\rm{dp}}}\frac{{\partial {X_{\rm{G}}}}}{{\partial {f_{\rm{dp}}}}} - \tau {f_{\rm{dp}}} $$\end{document}]
where κ and τ, and Φ0 are constants. Consider now a particular joint variation of the stimulus frequencies
[Formula ID: Equa]
[\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$ \left( {{f_1},{f_2},{f_3}} \right) \to ({f_1} + \Delta {f_1},{f_2} + \Delta {f_2},{f_3} + \Delta {f_3}) $$\end{document}]
that leaves fdp fixed, i.e. Δfdp = 0. For such fixed-fdp variations, the only term contributing to variations in Φdp is the second term of Eq. 7, resulting in
[Formula ID: Equ8]
[\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$ \Delta {\Phi_{\rm{dp}}} = \kappa \sum\limits_{k = 1}^3 {\Delta {f_k}\frac{{\partial {X_{\rm{G}}}}}{{\partial {f_k}}}} = \kappa \Delta {X_G} $$\end{document}]

Therefore, what we have called the residual phase dependence, directly corresponds to shifts ΔXG of the cochlear location XG at which the DP is generated. If the generation site is unchanged (ΔXG = 0), fixed-fdp “sweeps” will leave DPOAE phase unchanged. Conversely, when a residual phase effect is observed, it signals a shift of the generation site of the DPOAE.

With our particular stimulus design, two-dimensional phase representations were possible for two DPOAE types (near and sideband groups) in two stimulus configurations (f > g and f < g). The analysis of DPOAE phase in the resulting four situations is summarized in Table 2. The only case showing a systematic residual phase effect was the sideband DPOAE when f > g (lower right entry of the table). In order to relate this observation to the theoretical analysis of the residual phase effect, it is necessary to consider how, for each of the four situations listed in Table 2, the site of DPOAE generation is affected by the fixed-fdp “sweeps”. To this end, we make the following assumption: When the nonlinear interaction of three stimulus frequencies (f1,f2,f3), produces a DP at frequency fdp, the site of DP generation corresponds to the region with characteristic frequency fmax = max(f1,f2,f3,fdp). In other words, DP generation is at the most basal best site among the four frequency components. We will refer to this as the assumption of the most basal generation site.

The assumption of the most basal generation site is a generalization of the evidence that, for two-tone stimuli, the 2f1 − f2 DP is generated near the f2 site, whereas the 2f2 − f1 DP is generated near its own best site (Brown and Kemp 1984; Martin et al. 1987). The assumption is further motivated by the extremely steep high-frequency flank of mechanical tuning (Robles and Ruggero 2001), implying that tones (whether presented acoustically or generated inside the cochlea) barely propagate in cochlear regions more apical than their own best site.

The schematic diagrams in Figure 10 depict the overlapping cochlear excitation patterns for each of the four different situations of Table 2. The generation site of DPOAEs (as predicted from the assumption of most basal generation) is labeled XG in each case. The fixed-fdp “sweeps” that underlie the analysis of the residual phase effect are indicated by the arrows placed over the peaks. These arrows indicate the joint shift in primaries gi and gj that keep fdp fixed. Notice that the requirement of a fixed fdp calls for different types of joint shifts. For the near group (upper two panels), where [\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$ {f_{\rm{dp}}} = {g_i} + {g_j} - f $$\end{document}] , primaries gi and gj must move in opposite directions; for the sideband group (lower two panels), where [\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$ {f_{\rm{dp}}} = f + {g_i} - {g_j} $$\end{document}] , the primaries gi and gj must move in the same direction. A straightforward application of the above assumption of the most basal generation site now immediately leads to the result that the only situation in which a fixed-fdp “sweep” causes a shift of the intracochlear location of DP generation XG, is the sideband group with f < g (lower right panel of Fig. 10). This is consistent with our observation that the sideband group with f < g is the only one showing a residual phase effect (Table 2).


Discussion

We explored the spectrum of third-order DPOAEs evoked by multitone stimuli in the gerbil ear. Multitone (>2 primaries) stimuli were used in previous work on DPOAEs, but in the majority of those studies the DPOAEs considered were still generated by two primaries; the third tone was only used to interact with the generation of the two-tone DPOAEs (Brown and Kemp 1984; Meenderink and Narins 2007). Genuine three-tone DPs have been recognized and observed in their own right, both as “additional DPs” (in widely separated primaries) and as “sidebands” around one stimulus tone (when the other two primaries were close; e.g., Kemp and Brown 1986; Kemp 1998). They have also occurred as the spectral side-effect of low-frequency biasing of two-tone DPOAEs (Bian and Scherrer 2007; Marquardt et al. 2007) or dynamic suppression of stimulus frequency OAEs (Meenderink and van der Heijden 2010). Unlike previous work, the aim of the present study was to integrate three-tone DPOAEs into a unifying framework alongside conventional DPOAEs.

Our stimuli were constructed by replacing one of the primaries of a customary two-tone stimulus by a narrowband tone complex. This produced rich spectra of the ear canal sound pressure that contained a large number of third-order DPOAEs. The analysis of this spectrum was greatly facilitated by a particular stimulus design in which an irregular spacing of the components prevents multiple DPOAEs from coinciding (Victor et al. 1977; van der Heijden and Joris 2003; 2006).

The richness of the DPOAE spectrum renders the conventional names (2f1 − f2; 2f2 − f1;…) insufficient. We identified and analyzed three groups of third-order DPOAEs, two of which (the “near” and “far” groups) are closely related to the well-known CDTs evoked by tone pairs; and one of which (the “sideband” group) manifested itself as sidebands around the single-tone primary. The sideband group has no two-tone evoked counterpart. We showed that a consideration of the basic properties of third-order distortions is sufficient to explain the occurrence of these three groups as well as their basic properties, such as the number of components of each group and the emergence of two sets of magnitudes in the near group.

We explored the connection between the near and far groups evoked by tone complexes on the one hand, and the customary DPOAEs evoked by tone pairs on the other. Their frequency-phase curves, group delays, and magnitude spectra were found to be very similar (Figs. 2, 3, and 6). We conclude that the general features of these DPOAEs are the same, regardless of whether one of the tones of a pair is swept in frequency (the customary approach), or whether the different “swept” frequencies are presented simultaneously in the form of a tone complex (the approach introduced in this study). Such an equivalence of subsequent and simultaneous stimulation modes is not a priori obvious for a nonlinear system like the inner ear. Incidentally, simultaneous presentation does not necessarily speed up the measurements as the same response energy is now subdivided among the multiple DP components giving a lower signal to noise ratio.

Overall, the third-order DPOAEs evoked by the interaction of triplets of primary components appear to be a natural extension of those evoked by tone pairs. The use of tone complexes, however, also opens up modes of analysis that go beyond two-tone paradigms. The sideband group of DPOAEs realizes a relation between stimulus frequencies and DPOAE frequencies that is very different from the two-tone paradigm. It allows for much wider frequency spacing between primaries than is possible for the generation of two-tone DPOAEs. In a recent study (Meenderink and van der Heijden 2010), we have employed a particular realization of the sideband group, in which the tone complex was more than an octave below the single-tone primary, to analyze cochlear travel times.

More generally, the analysis of three independent interacting primaries creates “extra degrees of freedom” compared to two-tone paradigms. By representing DPOAE phase (near and sideband groups) in a two-dimensional format (Figs. 7A, 8B, and 9C), we tested whether variation of DPOAE phase was exhaustively described by its dependence on DPOAE frequency (which is given by a fixed relation between the stimulus frequencies), or whether the other degrees of freedom co-determined DPOAE phase. For the latter phenomenon, in which DPOAE phase changes while DPOAE frequency is constant, we introduced the term “residual phase effects”.

The two-dimensional phase analysis (Figs. 7B, 8D, and 9E) showed that, in all except one situation, DPOAE phase exclusively varies along the DPOAE-frequency direction. The one exception that showed residual phase dependence was the sideband group emerging when the tone complex was above the single primary (Table 2). A preliminary theoretical analysis showed that, under general assumptions of DPOAE generation, this is precisely the only situation in which the intracochlear generation site of the DPOAEs is variable while DPOAE frequency is fixed (Fig. 10).

Further quantitative modeling of the residual phase effect is beyond the scope of this explorative report. It seems reasonable, however, to link our observations with the known contrast between f1- and f2-sweeps on the phase of 2f1 − f2 emissions (e.g., Shera et al. 2000). Several studies (Moulin and Kemp 1996a,b; O Mahoney and Kemp, 1995; Schneider et al. 1999; Whitehead et al. 1996), including the present one (Fig. 6), have shown that for the 2f1 − f2 CDT, a fixed-f1 recording paradigm yields larger group delays than a fixed-f2 paradigm. Their different group delays (evaluated from phase versus 2f1 − f2 curves) imply that fixed-(2f1 − f2) sweeps, in which f1 and f2 are jointly swept while keeping 2f1 − f2 fixed, will still produce a phase variation. In the terminology introduced in this study, this is a case of a residual phase effect. It has been explained by the assumption that 2f1 − f2 emissions are generated near the f2 region (Brown and Kemp 1984; Martin et al. 1987), which moves during the fixed-(2f1 − f2) sweep. The complementary observation that 2f2 − f1 group delays are independent of sweep paradigm is similarly explained by assuming that it is generated near its own peak region (Martin et al. 1987). With this assumption, a joint sweep of f1 and f2 that leaves 2f2 − f1 unchanged will have no effect on DPOAE phase: the residual phase effect is absent. We have generalized these observations on the location of DPOAE generation in the “assumption of the most basal generation site” introduced in our theoretical analysis (last section of Results). This explanation parallels the description of DP generation in terms of “wave-fixed” and “place-fixed” sources (reviewed in Kemp 2008). Apparently, these concepts, which were formulated to analyze two-tone DPOAEs, are also applicable to the sideband DPOAEs reported in the present study.

Rather than introducing new names and types of emissions (see also Shera and Guinan 1999), we would like to emphasize the potential synthesizing power of multitone DPOAEs. By definition, third-order distortions emerge by the interaction of three primaries. It is perfectly valid for two of the three interacting tones to coincide, but there is no compelling reason to restrict the study of DPOAEs to this rather limited subclass of distortion products. In fact, such a restriction to two-tone DPOAEs is potentially biasing and may obscure the underlying unity of the family of distortion products. Any realistic nonlinear mechanism that generates a 2f1f2 component will also generate a 2f2f1 component. In this respect, lower and upper CDTs are really two sides of the same coin. Unfortunately, the typical primary ratio of ∼1.2 results in a sizeable frequency separation between the lower and upper CDTs (ratio of ∼1.75). This frequency difference creates a marked contrast in their propagation toward the ear canal owing to the tonotopic organization the cochlea. Therefore, this type of two-tone data emphasizes the contrast between the lower and upper CDTs, and underexposes the underlying unity. This unity is evident from the fact that extensive (f1, f2) area sweeps show a continuum of DPOAEs rather than two disjoint regions (Knight and Kemp 2000; Schneider et al. 2003).

The data of the present study demonstrate that it is well possible to generalize beyond two-tone stimuli, provided some caution is observed in the stimulus design. The capability of a multitone paradigm to reveal the unity of third-order DPOAEs is perhaps best illustrated by a recording in which the single tone was placed in the middle of the multitone primary (Fig. 11). The stimulus design still makes it possible to distinguish the near, far and sideband groups based on the unique triplets of primaries producing each DPOAE component, and the phases of the groups are distinguished by the different symbols in Figure 11B. It is clear, however, that the distinction between the groups has become purely formal. Owing to their overlapping frequency range, the groups merge into a single family of third-order DPOAEs spanning a bandwidth of ∼1.1 kHz over which they show a systematic phase variation. This DPOAE spectrum bridges the gap between lower and upper CDTs, perhaps illustrating that the nomenclature of DPOAEs (2f1f2, near group…) is often more a reflection of the experimental methodology than of the mechanisms that shape them.


This work was supported by NWO-VENI grant 863.08.003 to SWFM. Portions of this work were presented at the 10th International Workshop on the Mechanics of Hearing, Keele University, 2008.

Open Access This article is distributed under the terms of the Creative Commons Attribution Noncommercial License which permits any noncommercial use, distribution, and reproduction in any medium, provided the original author(s) and source are credited.


References
Bian L,Scherrer NM. Low-frequency modulation of distortion product otoacoustic emissions in humansJ Acoust Soc AmYear: 20071221681169210.1121/1.276446717927428
Brown AM,Kemp DT. Suppressibility of the 2f1-f2 stimulated acoustic emissions in gerbil and manHear ResYear: 198413293710.1016/0378-5955(84)90092-36706860
Faulstich M,Kössl M. Evidence for multiple DPOAE components based upon group delay of the 2f1-f2 distortion in the gerbilHear ResYear: 20001409911010.1016/S0378-5955(99)00189-610675638
Goldstein JL,Kiang NYS. Neural correlates of the aural combination tone 2f1-f2Proc IEEEYear: 19685698199210.1109/PROC.1968.6449
Kemp DT,Brown AM. Allen JB,Hall JL,Hubbard A,Neely ST,Tubis AWideband analysis of otoacoustic intermodulationPeripheral auditory mechanismsYear: 1986New YorkSpringer306313
Kemp DT. Berlin CIOtoacoustic emissions: distorted echoes of the cochlea's traveling waveOtoacoustic Emissions: basic science and clinical applicationsYear: 1998San DiegoSingular160
Kemp DT. Manley GA,Fay RR,Popper ANOtoacoustic emissions: concepts and originsActive processes and otoacoustic emissionsYear: 2008New YorkSpringer139
Kim DO,Molnar CE,Matthews JW. Cochlear mechanics: nonlinear behavior in two-tone responses as reflected in cochlear-nerve-fiber responses and in ear-canal sound pressureJ Acoust Soc AmYear: 1980671704172110.1121/1.3842977372925
Knight RD,Kemp DT. Indications of different distortion product otoacoustic emission mechanisms from a detailed f1, f2 area studyJ Acoust Soc AmYear: 200010745747310.1121/1.42835110641654
O’Mahoney CF,Kemp DT. Distortion product otoacoustic emission delay measurement in human earsJ Acoust Soc AmYear: 19959737313735
Marquardt T,Hensel J,Mrowinski D,Scholz G. Low-frequency characteristics of human and guinea pig cochleaeJ Acoust Soc AmYear: 20071213628363810.1121/1.272250617552714
Martin GK,Lonsbury-Martin BL,Probst R,Scheinin SA,Coats AC. Acoustic distortion products in rabbit ear canal. II. Sites of origin revealed by suppression contours and pure tone exposuresHear ResYear: 19872819120810.1016/0378-5955(87)90049-93654389
Meenderink SWF,Narins PM. Suppression of distortion product otoacoustic emissions in the anuran earJ Acoust Soc AmYear: 200712134435110.1121/1.238245817297789
Meenderink SWF,Heijden M. Reverse cochlear propagation in the intact cochlea of the gerbil: evidence for slow traveling wavesJ NeurophysiolYear: 20101031448145510.1152/jn.00899.200920089817
Moulin A,Kemp DT. Multicomponent acoustic distortion product otoacoustic emission phase in humans. I. General characteristicsJ Acoust Soc AmYear: 19961001617163910.1121/1.4160638817891
Moulin A,Kemp DT. Multicomponent acoustic distortion product otoacoustic emission phase in humans. II. Implications for distortion product otoacoustic emissions generationJ Acoust Soc AmYear: 19961001640166210.1121/1.4160648817892
Papoulis A. Fourier integral and its applicationsYear: 1962New YorkMcGraw-Hill
Plomp R. Detectability of threshold for combination tonesJ Acoust Soc AmYear: 1965371110112310.1121/1.190953214339721
Rhode WS. Basilar membrane mechanics in the 6–9 kHz region of sensitive chinchilla cochleaeJ Acoust Soc AmYear: 20071212792280410.1121/1.271839717550178
Robles L,Ruggero MA. Mechanics of the mammalian cochleaPhysiol RevYear: 2001811305135211427697
Schneider S,Prijs VF,Schoonhoven R. Group delays of distortion product otoacoustic emissions in the guinea pigJ Acoust Soc AmYear: 19991052722273010.1121/1.42689010335624
Schneider S,Prijs VF,Schoonhoven R. Amplitude and phase of distortion product otoacoustic emissions in the guinea pig in an (f1, f2) area studyJ Acoust Soc AmYear: 20031133285329610.1121/1.156875312822801
Schroeder MR. Amplitude behavior of the cubic difference toneJ Acoust Soc AmYear: 19755872873210.1121/1.3807211184846
Shera CA,Guinan JJ Jr. Evoked otoacoustic emissions arise by two fundamentally different mechanisms: a taxonomy for mammalian OAEsJ Acoust Soc AmYear: 199910578279810.1121/1.4269489972564
Shera CA,Talmadge CL,Tubis A. Interrelations among distortion-product phase-gradient delays: their connection to scaling symmetry and its breakingJ Acoust Soc AmYear: 20001082933294810.1121/1.132323411144585
Smoorenburg GF. Audibility region of combination tonesJ Acoust Soc AmYear: 19725260361410.1121/1.1913151
Heijden M,Joris PX. Cochlear phase and amplitude retrieved from the auditory nerve at arbitrary frequenciesJ NeurosciYear: 20032321242128
Heijden M,Joris PX. Panoramic measurements of the apex of the cochleaJ NeurosciYear: 200626114621147310.1523/JNEUROSCI.1882-06.200617079676
Victor JD,Shapley RM,Knight BW. Nonlinear analysis of cat retinal ganglion cells in the frequency domainProc Natl Acad Sci U S AYear: 1977743068307210.1073/pnas.74.7.3068268656
Békésy G. Experiments in hearingYear: 1960New YorkMcGraw-Hill
Wever EG,Lawrence M. Physiological AcousticsYear: 1954PrincetonPrinceton University Press
Whitehead ML,Stagner BB,Martin GK,Lonsbury-Martin BL. Visualization of the onset of distortion-product otoacoustic emissions, and measurement of their latencyJ Acoust Soc AmYear: 19961001663167910.1121/1.4160658817893
Appendix

Denote the phase of the cochlear vibration evoked by a single tone by

[Formula ID: Equ9]
A1  [\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$ \Phi \left( {x;f} \right), $$\end{document}]
where f is the frequency of the tone and x is the cochlear location. No specific assumptions are made regarding this function, except sufficient smoothness. Consider a triplet of stimulus tones at frequencies f1, f2, and f3, which need not be all different. The frequency fdp of a third-order DP produced by the nonlinear interaction of the three tones is
[Formula ID: Equ10]
A2  [\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$ {f_{\rm{dp}}} = \sum\limits_{k = 1}^3 {{w_k}{f_k}}, $$\end{document}]
where wk = ±1 are weight factors. For the three DPOAE groups considered in this study (e.g., far, near, and sideband groups), the weight factors always add up to unity:
[Formula ID: Equ11]
A3  [\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$ \sum\limits_{k = 1}^3 {{w_k} = 1} . $$\end{document}]

With the assumption that a DP component is generated at a single cochlear location XG, its phase Φdp at this location becomes

[Formula ID: Equ12]
A4  [\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$ {\Phi_{\rm{dp}}} = \sum\limits_{k = 1}^3 {{w_k}\Phi \left( {{X_G};{f_k}} \right)} . $$\end{document}]

In general, XG will be a function of all three primary frequencies and the DP frequency, fdp:

[Formula ID: Equb]
[\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$ {X_{\rm{G}}} = {X_{\rm{G}}}({f_1},{f_2},{f_3};{f_{\rm{dp}}}). $$\end{document}]

For sufficiently narrowband signals, the phase of a frequency component f close to f0 at a place x close to x0 is well described by the first-order approximation

[Formula ID: Equ13]
A5  [\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$ \Phi (x,f) \cong \Phi \left( {{x_0};{f_0}} \right) + \kappa \left( {x - {x_0}} \right) - \tau \left( {f - {f_0}} \right), $$\end{document}]
where the constants [\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$ \kappa = \partial \Phi /\partial x({x_0},{f_0}) $$\end{document}] and [\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$ \tau = - \partial \Phi /\partial f({x_0},{f_0}) $$\end{document}] are the average wave number (inverse wavelength) and group delay, respectively. Similarly, if the primary frequencies and DP frequency are sufficiently close together, one may approximate
[Formula ID: Equ14]
A6  [\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$ {X_{\rm{G}}}({f_1},{f_2},{f_3};{f_{\rm{dp}}}) \cong {X_{\rm{G}}}({f_0}) + \sum\limits_{k = 1}^3 {({f_k} - {f_0})\frac{{\partial {X_{\rm{G}}}}}{{\partial {f_k}}}({f_0})} + ({f_{\rm{dp}}} - {f_0})\frac{{\partial {X_{\rm{G}}}}}{{\partial {f_{\rm{dp}}}}}({f_0}), $$\end{document}]
where XG(f0) is shorthand for XG(f0,f0,f0;f0).

The combination of A4, A5, and A6 yields

[Formula ID: Equ15]
A7  [\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$ {\Phi_{\rm{dp}}} = {\Phi_0} + \kappa \sum\limits_{k = 1}^3 {{f_k}\frac{{\partial {X_{\rm{G}}}}}{{\partial {f_k}}}} + \kappa {f_{\rm{dp}}}\frac{{\partial {X_{\rm{G}}}}}{{\partial {f_{\rm{dp}}}}} - \tau {f_{\rm{dp}}}, $$\end{document}]
where various constant terms were absorbed into Φ0.

Equation A7 shows that variations in primary frequency fk affect DP phase in two possible ways. Variations in fk that cause fdp to change result in a “direct” effect on Φdp via the last term. This simply reflects the average group delay τ of the collective incoming traveling waves toward the generation site. It is the main effect of changing the primary frequencies, which incorporates the travel times toward the generation site. The main effect does not require the generation site XG to shift with the variation in stimulus frequencies. The other two contributions to changes in Φdp, the second and third terms of A7, occur through shifts of the generation site XG. Both terms contain the factor κ, the wave number of the traveling waves at the generation site. The third term describes the changes of XG with fdp; this contribution merges into the main effect of the fourth term. The second term is the only term that can cause phase changes mediated by fixed-fdp variations of the stimulus frequencies. This establishes the link between residual phase effects and shifts of the location of DP generation.


Figures

[Figure ID: Fig1]
FIG. 1 

DPOAEs evoked by multitone stimuli. Amplitude spectra of ear canal sound pressure are shown. Either the lower (A) or the upper (B) primary of the two-tone stimulus was replaced by a tone complex. The stimulus (squares) consisted of a seven-tone complex (g1 to g7) and a single tone (f). Three groups of third order DPOAEs that result from the interaction between f and g can be identified: the far group at 2f − gi (left-pointing triangles), the near group at frequencies gi + gj − f (right-pointing triangles for i < j and closed circles for i = j), and the sideband group at f + gi − gj (diamonds). The estimated noise floor (line) and the recorded system distortion (open circles) are also given. Stimulus parameters: f = 6.6 kHz; Lf = 60 dB SPL; Lg = 50 dB SPL per component; seven components in tone complex; g = 5…5.2 kHz in A and g = 7.2…7.6 kHz in B.



[Figure ID: Fig2]
FIG. 2 

Comparison between DPOAEs evoked with either a single multitone stimulus or a series of two-tone stimuli. A Ten superimposed amplitude spectra obtained either with nine separate two-tone stimuli (light gray) or with a nine-tone zwuis-complex (dark gray). The dashed box around the DPOAEs is shown again in (B), with the corresponding phase data in (C). Circles represent the 2f2 − f1 DPOAEs from the consecutive two-tone stimuli; triangles, the far group. The horizontal dashed line in B indicates the total power of the DPOAEs in the far group. Phase data were fitted with straight lines (dashed lines), yielding group delays of 372 ± 12 μs and 356 ± 17 μs for the 2f2 − f1 DPOAEs (circles) and the far group (triangles), respectively. Stimulus parameters: f (or f2) = 7.7 kHz; g (or f1) = 6.1…6.6 kHz (nine components); Lf (or L2) = 70 dB SPL; Lg = 60 dB SPL per component; L1 = 70 dB SPL.



[Figure ID: Fig3]
FIG. 3 

Comparison of DPOAEs obtained with two-tone sweeps and multitone stimuli. DPOAEs from four animals are displayed in scatter plots of group delay (A) and power (B). Data were obtained either from a single recording using a tone complex (ordinate) or from several consecutive two-tone recordings that visited the frequencies of the tone complex one-by-one (abscissa). In B, the total power ΣP of the DPOAE group is plotted against the average power [\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$ \hat{P} $$\end{document}] of the two-tone evoked DPOAEs. See Figure 2 for an individual example. Data for the far group (triangles) and near group (circles) are shown. Dotted and dashed lines in A indicate equality ±100 μs; in B, equality ±3 dB.



[Figure ID: Fig4]
FIG. 4 

Near group of DPOAEs. A Power spectrum of the ear canal sound pressure. DPOAE components of the near group are given by gi + gj − f (triangles for i < j and circles for i = j). Notice the systematic, ∼6 dB, difference in magnitude between the two subgroups. B Corresponding phase data. A straight line fitted to these data (dashed line) yielded a group delay of 666 ± 16 μs. Stimulus parameters: f = 12 kHz; g = 9…9.5 kHz (nine components); Lf = 65 dB SPL; Lg = 55 dB SPL per component.



[Figure ID: Fig5]
FIG. 5 

(color online) Schematic diagrams illustrating how pairs of components from the tone complex combine to produce DPOAEs in the near group (upper row) and sideband group (bottom row). Each DPOAE results from a unique pair [gi , gj] of tone complex components, represented by the two-dimensional, square matrix. For the near group (A), a transformation of this matrix by a 135° counterclockwise rotation (see inset), and by a translation over the distance −f, yields the equivalent matrix [gi + gj − f, gi − gj] (B). In this representation the abscissa equals DPOAE frequency, and the ordinate represents stimulus frequency variations that leave DPOAE frequency fixed. For the sideband group (C), the translation over distance f combined with a 45° counterclockwise rotation yields matrix [f + gi − gj, gi + gj] (D). This transformation has the same effect: the abscissa again equals DPOAE frequency, while the ordinate represents the variations that leave DPOAE frequency fixed. Note that the zwuis character of the stimulus ensures that none of the symbols are aligned vertically in panels B and D. The (color) grading of the symbols has no particular meaning, but is an aid to track the rotations.



[Figure ID: Fig6]
FIG. 6 

Group delays for the near and far groups of DPOAEs. Based on their similarity with DPOAEs from two-tone sweeps (see text), the data were split into effective 2f1 − f2 group delays (A) and effective 2f2 − f1 group delays (B). Within each of these groups, and as indicated in the graph, the markers distinguish effective fixed-f2 and effective fixed-f1 group delays, corresponding to (f > g) and (f < g), respectively. For comparison, genuine two-tone data from the literature are shown in A (line). These 2f1 − f2 group delays (Faulstich and Kössl 2000), were obtained from gerbils using a fixed-f2 paradigm.



[Figure ID: Fig7]
FIG. 7 

A Two-dimensional representation of DPOAE phase for the near group. The phases of Figure 4B are replotted as a contour plot, using the two-dimensional representation of stimulus frequencies of Figure 5B. The abscissa gives DPOAE frequency fdp; the ordinate, fixed-fdp variations. B Results from the two-dimensional model (Eq. 6) fitted to these phase data (τ+ = 665 ± 16 μs; τ = 16 ± 26 μs). Contours in A and B are drawn at 0.1-cycle intervals. Stimulus parameters: f = 12 kHz; g = 9…9.5 kHz (nine components); Lf = 65 dB SPL; Lg = 55 dB SPL per component. C Scatter plot of τ+ and τ calculated for 998 recordings from 15 animals (830 when f > g; 168 when f < g). DPOAE phase for the near group is well described by DPOAE frequency, irrespective of the stimulus configuration.



[Figure ID: Fig8]
FIG. 8. 

Phase data for the sideband group obtained with f > g. A DPOAE phase plotted against DPOAE frequency. A straight line was fitted to these data (dashed line), yielding a group delay of 536 ± 13 μs. In B, the same data are plotted as a contour plot, using the two-dimensional representation of Fig. 5D. The abscissa gives DPOAE frequency fdp; the ordinate, fixed-fdp frequency variations. C Results from the two-dimensional model (Eq. 6) fitted to these phase data (τ+ = 22 ± 14 μs; τ = 536 ± 13 μs). Contours in B and C are drawn at 0.1-cycle intervals. The corresponding magnitude spectrum is shown in Fig. 4A. Stimulus parameters: f = 12 kHz; g = 9…9.5 kHz (nine components); Lf = 65 dB SPL; Lg = 55 dB SPL per component. D Scatter plot of τ and τ+ calculated for 1,065 recordings from 15 animals, all with f > g. DPOAE phase for the f > g sideband group is well described by DPOAE frequency; fixed-fdp frequency variations have no systematic effect on DPOAE phase.



[Figure ID: Fig9]
FIG. 9 

Sideband DPOAEs obtained with f < g. A Amplitude spectrum of ear-canal sound pressure. B Corresponding phase data for the sideband group of DPOAEs. Fitting a straight line to these data yielded a group delay of 641 ± 32 μs. The line fit (dashed line) captures the major trend in these data, but misses the systematic near-vertical patterning in the data. C Two-dimensional representation of the same phase data and D the result from fitting the linear model of Eq. 6 to these phase data (τ+ = −113 ± 20 μs; τ = 642 ± 19 μs). Contours in C and D are drawn at 0.1-cycle intervals. Stimulus parameters: f = 5.3 kHz; g = 6.3...6.8 kHz (nine components); Lf = 65 dB SPL; Lg = 55 dB SPL per component. E Scatter plot of τ and τ+ calculated for 298 recordings from 13 animals, all with f < g. DPOAE phase for the sideband group is not exhaustively described by DPOAE frequency fdp; the data show a systematic effect of fixed-fdp frequency variations.



[Figure ID: Fig10]
FIG. 10 

Schematic diagrams of the cochlear excitation patterns for different three-tone stimuli and the surmised locations of DP generation. Af > g; near group; Bf < g, near group; Cf > g, sideband group; Df < g, sideband group. For a given stimulus (solid lines; shaded areas), DP generation is postulated to occur at the best site of the highest frequency component (f, gi, gj, or fdp) involved, as indicated in the lower part of each panel by XG (gray). A shift of the stimulus frequencies (gi, gj) that leaves the DP frequency unchanged (dashed lines) results in a shifted DP generation site for the sideband group when f < g (D): in this example, it shifts to a more basal location. No such shift of the generation site XG occurs for any of the other {stimulus, DP} configurations (A, B, C).



[Figure ID: Fig11]
FIG. 11 

Merging of DPOAE groups for f ≅ g. A DPOAE spectrum evoked by a special stimulus in which the single stimulus tone f (closed square; vertical arrowhead) was placed within the frequency range of the tone complex g (open squares), resulting in the overlap of the three DPOAE groups (diamonds sideband group; circles near group; open squares far group). The stimulus design (see Table 1) ensures that DPOAE components never coincide, despite their overlapping frequency ranges. Phase data (B) were analyzed both separately per group and collectively, yielding group delays of 652 ± 37 μs (sideband); 667 ± 44 μs (near); 699 ± 62 μs (far); and 655 ± 29 μs (all groups pooled). Stimulus parameters: f = 6.0 kHz; g = 5.8...6.3 kHz (five components); Lf = 50 dB SPL; Lg = 50 dB SPL per component.



Tables
[TableWrap ID: Tab1] TABLE 1 

Relation between the number of periods for the stimulus components and the different groups of DPOAEs that result from them


Frequency Number of periods
g; components from tone complex 8 k + 4
f; single component 8 k + 2
2f − gi; ‘far’ group 8 k
gi + gj − f; ‘near’ group 8 k + 6
f + gi − gj; ‘sideband’ group 8 k + 2
gi + gj − gk; within-complex OAEs 8 k + 4

All stimulus components were chosen to have an even number of periods (rows 1–2, with integer k) over an integer number of sample points (typically 97,656). This results in the third-order DPOAEs of the different groups (rows 3–5; see text) to be periodic over the same number of samples, and never coincide with any stimulus component. The choice for the number of periods also rules out any coincidences across members of the three groups and the fourth group (bottom row) of the third-order DPOAEs arising from interaction of triplets (gi, gj, gk) within the tone complex


[TableWrap ID: Tab2] TABLE 2 

Population data of the residual phase effect for the four different {stimulus, DPOAE}-configurations


f > g f < g
Near (τ) −6 ± 51 μs (N = 830) −3 ± 45 μs (N = 168)
Sideband (τ+) 8 ± 40 μs (N = 1,065) −115 ± 114 μs (N = 298)

Each cell displays mean (±standard deviation) of the fixed-fdp group delay (see text). The numbers between parentheses give the number of recordings. Only for the sideband group, recorded with f < g, does the residual phase contribute to the observed DPOAE phase variation



Article Categories:
  • Article

Keywords: Keywords DPOAEs, otoacoustic, gerbil, Meriones unguiculatus, tone complex, zwuis.

Previous Document:  Improved horizontal directional hearing in bone conduction device users with acquired unilateral con...
Next Document:  Automatic extraction of concepts to extend RadLex.