Abstract
A real-time approach for the enhancement of speech at zero degree azimuth is proposed. This is achieved inspired by the response features of the “Facilitated EI neurons”. This way, frequency segregation through a bandpass filter bank is followed by “supression analysis” which inhibits sources that are not at “facilitated” positions. Unlike with the existing approaches for the solution of cocktail party problem, where the performance under low SNR (signal-to-noise ratio) reverberation conditions is severely limited, the proposed approach has the capability to circumvent these problems. This is quantified through both objective and subjective performance measures and supported by real world simulation examples.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Cherry, E.C.: Some experiments on the recognition of speech, with one and two ears. Journal of the Acoustic Society of America 25, 975–979 (1953)
Hyvarinen, A., Oja, E.: A fast fixed-point algorithm for independent component analysis. Neural Computation 9, 1483–1492 (1997)
de Cheveigne, A.: The auditory system as a separation machine. In: Proceedings of International Symposium of Hearing (2000)
Barros, A.K., Ohnishi, N.: Single channel speech enhancement by efficient coding. Signal Processing 85, 1805–1812 (2005)
Lewicki, M.: Efficient coding of natural sounds. Nature Neuroscience 5(4), 356–363 (2002)
Roman, N., Wang, D., Brown, G.: Speech segregation based on sound localization. Journal of Acoustical Society of America 114(1), 2236–2252 (2003)
Virgag, N.: Single channel speech enhancement based on masking properties of the human auditory system. IEEE Transactions on Signal Processing 7, 126–137 (1999)
Barros, A.K., Rutkowski, T.M., Itakura, F., Ohnishi, N.: Estimation of speech embedded in a reverberant and noisy environment by independent component analysis and wavelets. IEEE Transactions on Neural Networks 13(4) (2002)
Pollak, G., Burger, R., Park, T., Klug, A., Bauer, E.: Roles of inhibition for transforming binaural properties in the brainstem auditory system. Hearing Research 168(1–2), 60–78 (2002)
Pollak, G., Burger, R., Klug, A.: Dissecting the circuitry of the auditory system. Trends in Neuroscience 26, 33–39 (2004)
Grantham, G.: Discrimination of dynamic interaural intensity differences. Journal of Acoustical Society of America 76(1), 71–76 (1984)
Allen, J., Berkley, D.: Image method for efficiently simulating small-room acoustics. Journal of Acoustical Society of America 65, 943–950 (1979)
Hansen, J., Pellom, B.L.: An effective qualiy evaluation protocol for speech enhancement algorithms. In: Proceedings of ICSLP 1998, vol. 7, pp. 2819–2822 (1998)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Cavalcante, A.B., Mandic, D.P., Rutkowski, T.M., Barros, A.K. (2006). Speech Enhancement Based on the Response Features of Facilitated EI Neurons. In: Rosca, J., Erdogmus, D., Príncipe, J.C., Haykin, S. (eds) Independent Component Analysis and Blind Signal Separation. ICA 2006. Lecture Notes in Computer Science, vol 3889. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11679363_73
Download citation
DOI: https://doi.org/10.1007/11679363_73
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-32630-4
Online ISBN: 978-3-540-32631-1
eBook Packages: Computer ScienceComputer Science (R0)