Abstract
By Fourier's theorem1, signals can be decomposed into a sum of sinusoids of different frequencies. This is especially relevant for hearing, because the inner ear performs a form of mechanical Fourier transform by mapping frequencies along the length of the cochlear partition. An alternative signal decomposition, originated by Hilbert2, is to factor a signal into the product of a slowly varying envelope and a rapidly varying fine time structure. Neurons in the auditory brainstem3,4,5,6 sensitive to these features have been found in mammalian physiological studies. To investigate the relative perceptual importance of envelope and fine structure, we synthesized stimuli that we call ‘auditory chimaeras’, which have the envelope of one sound and the fine structure of another. Here we show that the envelope is most important for speech reception, and the fine structure is most important for pitch perception and sound localization. When the two features are in conflict, the sound of speech is heard at a location determined by the fine structure, but the words are identified according to the envelope. This finding reveals a possible acoustic basis for the hypothesized ‘what’ and ‘where’ pathways in the auditory cortex7,8,9,10.
This is a preview of subscription content, access via your institution
Access options
Subscribe to this journal
Receive 51 print issues and online access
$199.00 per year
only $3.90 per issue
Buy this article
- Purchase on Springer Link
- Instant access to full article PDF
Prices may be subject to local taxes which are calculated during checkout
Similar content being viewed by others
References
Fourier, J. B. J. La théorie analytique de la chaleur. Mém. Acad. R. Sci. 8, 581–622 (1829).
Hilbert, D. Grundzüge einer allgemeinen Theorie der linearen Integralgleichungen (Teubner, Leipzig, 1912).
Rhode, W. S., Oertel, D. & Smith, P. H. Physiological response properties of cells labeled intracellularly with horseradish peroxidase in cat ventral cochlear nucleus. J. Comp. Neurol. 213, 448–463 (1983).
Joris, P. X. & Yin, T. C. Envelope coding in the lateral superior olive. I. Sensitivity to interaural time differences. J. Neurophysiol. 73, 1043–1062 (1995).
Yin, T. C. & Chan, J. C. Interaural time sensitivity in medial superior olive of cat. J. Neurophysiol. 65, 465–488 (1990).
Langner, G. & Schreiner, C. E. Periodicity coding in the inferior colliculus of the cat. I. Neuronal mechanisms. J. Neurophysiol. 60, 1799–1822 (1988).
Rauschecker, J. P. & Tian, B. Mechanisms and streams for processing of “what” and “where” in auditory cortex. Proc. Natl Acad. Sci. USA 97, 11800–11806 (2000).
Recanzone, G. H. Spatial processing in the auditory cortex of the macaque monkey. Proc. Natl Acad. Sci. USA 97, 11829–11835 (2000).
Tian, B., Reser, D., Durham, A., Kustov, A. & Rauschecker, J. P. Functional specialization in rhesus monkey auditory cortex. Science 292, 290–293 (2001).
Maeder, P. P. et al. Distinct pathways involved in sound recognition and localization: a human fMRI study. NeuroImage 14, 802–816 (2001).
Dodge, C. & Jerse, T. A. Computer Music: Synthesis, Composition, and Performance (Schirmer Books, New York, 1997).
Depalle, P., Garcia, G. & Rodet, X. A virtual castrato (!?). Proc. Int. Computer Music Conf., Aarhus, Denmark 357–360 (1994).
Shen, C., Smith, Z. M., Oxenham, A. J. & Delgutte, B. Auditory Chimera Demo; available at 〈http://epl.meei.harvard.edu/~bard/chimera〉 (2001).
Licklider, J. Effect of amplitude distortion upon the intelligibility of speech. J. Acoust. Soc. Am. 18, 429–434 (1946).
Saberi, K. & Perrot, D. R. Cognitive restoration of reversed speech. Nature 398, 760 (1999).
Nilsson, M., Soli, S. D. & Sullivan, J. A. Development of the hearing in noise test for the measurement of speech reception thresholds in quiet and in noise. J. Acoust. Soc. Am. 95, 1085–1099 (1994).
Shannon, R. V., Zeng, F.-G., Kamath, V., Wygonski, J. & Ekelid, M. Speech recognition with primarily temporal cues. Science 270, 303–304 (1995).
Ghitza, O. On the upper cutoff frequency of the auditory critical-band envelope detectors in the context of speech perception. J. Acoust. Soc. Am. 110, 1628–1640 (2001).
Wightman, F. L. & Kistler, D. J. The dominant role of low-frequency interaural time differences in sound localization. J. Acoust. Soc. Am. 91, 1648–1661 (1992).
Henning, G. B. & Ashton, J. The effect of carrier and modulation frequency on lateralization based on interaural phase and interaural group delay. Hear. Res. 4, 184–194 (1981).
Bernstein, L. R. & Trahiotis, C. Lateralization of low-frequency complex waveforms: The use of envelope-based temporal disparities. J. Acoust. Soc. Am. 77, 1868–1880 (1985).
Wilson, B. S. et al. Better speech recognition with cochlear implants. Nature 352, 236–238 (1991).
Rubinstein, J. T., Wilson, B. S., Finley, C. C. & Abbas, P. J. Pseudospontaneous activity: stochastic independence of auditory nerve fibers with electrical stimulation. Hear. Res. 127, 108–118 (1999).
Litvak, L., Delgutte, B. & Eddington, D. Auditory nerve fiber responses to electrical stimulation: modulated and unmodulated pulse trains. J. Acoust. Soc. Am. 110, 368–379 (2001).
Greenwood, D. D. A cochlear frequency-position function for several species—29 years later. J. Acoust. Soc. Am. 87, 2592–2604 (1990).
Ville, J. Théorie et applications de la notion de signal analytique. Cables Transmission 2, 61–74 (1948).
Troullinos, G., Ehlig, P.,,Chirayil, R., Bradley, J. & Garcia, D. in Digital Signal Processing Applications with the TMS320 Family (ed. Papamichalis, P.) 221–330 (Texas Instruments, Dallas, 1990).
Hartmann, W. M. & Johnson, D. Stream segregation and peripheral channeling. Music Percept. 9, 155–184 (1991).
Acknowledgements
We thank C. Shen for assistance with data analysis and J. R. Melcher and L. D. Braida for comments on an earlier version of the manuscript. A.J.O. is currently a fellow at the Hanse Institute for Advanced Study in Delmenhorst, Germany. This work was supported by grants from the National Institutes of Health (NIDCD).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing financial interests.
Rights and permissions
About this article
Cite this article
Smith, Z., Delgutte, B. & Oxenham, A. Chimaeric sounds reveal dichotomies in auditory perception. Nature 416, 87–90 (2002). https://doi.org/10.1038/416087a
Received:
Accepted:
Issue Date:
DOI: https://doi.org/10.1038/416087a
This article is cited by
-
The importance of temporal-fine structure to perceive time-compressed speech with and without the restoration of the syllabic rhythm
Scientific Reports (2023)
-
Hearing as adaptive cascaded envelope interpolation
Communications Biology (2023)
-
Advantages of Pulse Rate Compared to Modulation Frequency for Temporal Pitch Perception in Cochlear Implant Users
Journal of the Association for Research in Otolaryngology (2022)
-
The Intelligibility of Time-Compressed Speech Is Correlated with the Ability to Listen in Modulated Noise
Journal of the Association for Research in Otolaryngology (2022)
-
Non-invasive suppression of essential tremor via phase-locked disruption of its temporal coherence
Nature Communications (2021)
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.