Multiple Sound Source Localisation in Reverberant Environments Inspired by the Auditory Midbrain

Liu, Jindong; Perez-Gonzalez, David; Rees, Adrian; Erwin, Harry; Wermter, Stefan

doi:10.1007/978-3-642-04274-4_22

Multiple Sound Source Localisation in Reverberant Environments Inspired by the Auditory Midbrain

Jindong Liu¹⁸,
David Perez-Gonzalez¹⁹,
Adrian Rees¹⁹,
Harry Erwin¹⁸ &
…
Stefan Wermter¹⁸

Conference paper

2022 Accesses
1 Citations
3 Altmetric

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 5768))

Abstract

This paper proposes a spiking neural network (SNN) of the mammalian auditory midbrain to achieve binaural multiple sound source localisation. The network is inspired by neurophysiological studies on the organisation of binaural processing in the medial superior olive (MSO), lateral superior olive (LSO) and the inferior colliculus (IC) to achieve a sharp azimuthal localisation of sound sources over a wide frequency range in a reverberant environment. Three groups of artificial neurons are constructed to represent the neurons in the MSO, LSO and IC that are sensitive to interaural time difference (ITD), interaural level difference (ILD) and azimuth angle respectively. The ITD and ILD cues are combined in the IC to estimate the azimuth direction of a sound source. To deal with echo, we propose an inter-inhibited onset network in the IC, which can extract the azimuth information from the direct path sound and avoid the effects of reverberation. Experiments show that the proposed onset cell network can localise two sound sources efficiently taking into account the room reverberation.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Bronkhorst, A., Plomp, R.: Effect of multiple speechlike maskers on binaural speech recognition in normal and impaired hearing. The Journal of the Acoustical Society of America 92, 3132–3139 (1992)
Article Google Scholar
Blauert, J.: Spatial Hearing: The Psychophysics of Human Sound Localization. MIT Press, Cambridge (1997)
Google Scholar
Jeffress, L.: A place theory of sound localization. J. Comp. Physiol. Psychol. 41, 35–39 (1948)
Article Google Scholar
Moore, B.: An Introduction to the Psychology of Hearing. Academic Press, San Diego (2003)
Google Scholar
Yin, T.: Neural mechanisms of encoding binaural localization cues in the auditory brainstem. Integrative Functions in the Mammalian Auditory Pathway I, 99–159 (2002)
Article Google Scholar
Willert, V., Eggert, J., Adamy, J., Stahl, R., Koerner, E.: A probabilistic model for binaural sound localization. IEEE Trans. Syst. Man Cybern. Part B Cybern. 36(5), 982–994 (2006)
Article Google Scholar
Voutsas, K., Adamy, J.: A biologically inspired spiking neural network for sound source lateralization. IEEE Trans. Neural Networks 18(6), 1785–1799 (2007)
Article Google Scholar
Litovsky, R., Colburn, H., Yost, W., Guzman, S.: The precedence effect. Journal of the Acoustical Society of America 106(4 I), 1633–1654 (1999)
Article Google Scholar
Palomäki, K., Brown, G., Wang, D.: A binaural processor for missing data speech recognition in the presence of noise and small-room reverberation. Speech Communication 43(4), 361–378 (2004)
Article Google Scholar
Sivaramakrishnan, S., Oliver, D.: Distinct K Currents Result in Physiologically Distinct Cell Types in the Inferior Colliculus of the Rat. Journal of Neuroscience 21(8), 2861 (2001)
Google Scholar
Slaney, M.: An efficient implementation of the patterson-holdsworth auditory filter bank. Apple Computer Technical Report 35 (1993)
Google Scholar

Download references

Author information

Authors and Affiliations

Dept. of Computing and Technology, University of Sunderland, Sunderland, SR6 0DD, United Kingdom
Jindong Liu, Harry Erwin & Stefan Wermter
Institute of Neuroscience, The Medical School, Newcastle University, NE2 4HH, United Kingdom
David Perez-Gonzalez & Adrian Rees

Authors

Jindong Liu
View author publications
You can also search for this author in PubMed Google Scholar
David Perez-Gonzalez
View author publications
You can also search for this author in PubMed Google Scholar
Adrian Rees
View author publications
You can also search for this author in PubMed Google Scholar
Harry Erwin
View author publications
You can also search for this author in PubMed Google Scholar
Stefan Wermter
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Dipartimento di Elettronica, Politecnico di Milano, Piazza L. da Vinci 32, 20133, Milano, Italy
Cesare Alippi
Department of Electrical and Computer Engineering, University of Cyprus, 75 Kallipoleos Street, 1678, Nicosia, Cyprus
Marios Polycarpou , Christos Panayiotou & Georgios Ellinas , &

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Liu, J., Perez-Gonzalez, D., Rees, A., Erwin, H., Wermter, S. (2009). Multiple Sound Source Localisation in Reverberant Environments Inspired by the Auditory Midbrain. In: Alippi, C., Polycarpou, M., Panayiotou, C., Ellinas, G. (eds) Artificial Neural Networks – ICANN 2009. ICANN 2009. Lecture Notes in Computer Science, vol 5768. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-04274-4_22

Download citation

DOI: https://doi.org/10.1007/978-3-642-04274-4_22
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-04273-7
Online ISBN: 978-3-642-04274-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics