Speech Enhancement Based on the Response Features of Facilitated EI Neurons

Cavalcante, André B.; Mandic, Danilo P.; Rutkowski, Tomasz M.; Barros, Allan Kardec

doi:10.1007/11679363_73

André B. Cavalcante²⁰,
Danilo P. Mandic²¹,
Tomasz M. Rutkowski²² &
…
Allan Kardec Barros²⁰

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 3889))

Included in the following conference series:

International Conference on Independent Component Analysis and Signal Separation

2974 Accesses

Abstract

A real-time approach for the enhancement of speech at zero degree azimuth is proposed. This is achieved inspired by the response features of the “Facilitated EI neurons”. This way, frequency segregation through a bandpass filter bank is followed by “supression analysis” which inhibits sources that are not at “facilitated” positions. Unlike with the existing approaches for the solution of cocktail party problem, where the performance under low SNR (signal-to-noise ratio) reverberation conditions is severely limited, the proposed approach has the capability to circumvent these problems. This is quantified through both objective and subjective performance measures and supported by real world simulation examples.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Cherry, E.C.: Some experiments on the recognition of speech, with one and two ears. Journal of the Acoustic Society of America 25, 975–979 (1953)
Article Google Scholar
Hyvarinen, A., Oja, E.: A fast fixed-point algorithm for independent component analysis. Neural Computation 9, 1483–1492 (1997)
Article Google Scholar
de Cheveigne, A.: The auditory system as a separation machine. In: Proceedings of International Symposium of Hearing (2000)
Google Scholar
Barros, A.K., Ohnishi, N.: Single channel speech enhancement by efficient coding. Signal Processing 85, 1805–1812 (2005)
Article MATH Google Scholar
Lewicki, M.: Efficient coding of natural sounds. Nature Neuroscience 5(4), 356–363 (2002)
Article Google Scholar
Roman, N., Wang, D., Brown, G.: Speech segregation based on sound localization. Journal of Acoustical Society of America 114(1), 2236–2252 (2003)
Article Google Scholar
Virgag, N.: Single channel speech enhancement based on masking properties of the human auditory system. IEEE Transactions on Signal Processing 7, 126–137 (1999)
Google Scholar
Barros, A.K., Rutkowski, T.M., Itakura, F., Ohnishi, N.: Estimation of speech embedded in a reverberant and noisy environment by independent component analysis and wavelets. IEEE Transactions on Neural Networks 13(4) (2002)
Google Scholar
Pollak, G., Burger, R., Park, T., Klug, A., Bauer, E.: Roles of inhibition for transforming binaural properties in the brainstem auditory system. Hearing Research 168(1–2), 60–78 (2002)
Article Google Scholar
Pollak, G., Burger, R., Klug, A.: Dissecting the circuitry of the auditory system. Trends in Neuroscience 26, 33–39 (2004)
Article Google Scholar
Grantham, G.: Discrimination of dynamic interaural intensity differences. Journal of Acoustical Society of America 76(1), 71–76 (1984)
Article Google Scholar
Allen, J., Berkley, D.: Image method for efficiently simulating small-room acoustics. Journal of Acoustical Society of America 65, 943–950 (1979)
Article Google Scholar
Hansen, J., Pellom, B.L.: An effective qualiy evaluation protocol for speech enhancement algorithms. In: Proceedings of ICSLP 1998, vol. 7, pp. 2819–2822 (1998)
Google Scholar

Download references

Author information

Authors and Affiliations

Laboratory for Biological Information Processing, Universidade Federal do Maranhāo, Brazil
André B. Cavalcante & Allan Kardec Barros
Department of Electrical and Electronic Engineering, Imperial College London, United Kingdom
Danilo P. Mandic
Laboratory for Advanced Brain Signal Processing, Brain Science Institute Riken, Japan
Tomasz M. Rutkowski

Authors

André B. Cavalcante
View author publications
You can also search for this author in PubMed Google Scholar
Danilo P. Mandic
View author publications
You can also search for this author in PubMed Google Scholar
Tomasz M. Rutkowski
View author publications
You can also search for this author in PubMed Google Scholar
Allan Kardec Barros
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Siemens Corporate Research, 755 College Road East, 08540, Princeton, NJ, USA
Justinian Rosca
Department of CSEE, Oregon Health and Science University, Portland, Oregon, USA
Deniz Erdogmus
Dep. of Electrical and Computer Engineering, University of Florida, Gainesville, Florida, USA
José C. Príncipe
McMaster University, 1280 Main Street West, L8S 4K1, Hamilton, Ontario, Canada
Simon Haykin

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Cavalcante, A.B., Mandic, D.P., Rutkowski, T.M., Barros, A.K. (2006). Speech Enhancement Based on the Response Features of Facilitated EI Neurons. In: Rosca, J., Erdogmus, D., Príncipe, J.C., Haykin, S. (eds) Independent Component Analysis and Blind Signal Separation. ICA 2006. Lecture Notes in Computer Science, vol 3889. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11679363_73

Download citation

DOI: https://doi.org/10.1007/11679363_73
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-32630-4
Online ISBN: 978-3-540-32631-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics