doi:10.1016/j.specom.2005.08.005
Copyright © 2005 Elsevier B.V. All rights reserved.
A noise-estimation algorithm for highly non-stationary environments
Sundarrajan Rangachari and Philipos C. Loizou
, 
Department of Electrical Engineering, University of Texas at Dallas, P.O. Box 830688, EC 33 Richardson, TX 75083-0688, USA
Received 17 December 2004;
revised 11 August 2005;
accepted 24 August 2005.
Available online 21 September 2005.
References and further reading may be available for this article. To view references and further reading you must
purchase this article.
Abstract
A noise-estimation algorithm is proposed for highly non-stationary noise environments. The noise estimate is updated by averaging the noisy speech power spectrum using time and frequency dependent smoothing factors, which are adjusted based on signal-presence probability in individual frequency bins. Signal presence is determined by computing the ratio of the noisy speech power spectrum to its local minimum, which is updated continuously by averaging past values of the noisy speech power spectra with a look-ahead factor. The local minimum estimation algorithm adapts very quickly to highly non-stationary noise environments. This was confirmed with formal listening tests which indicated that the proposed noise-estimation algorithm when integrated in speech enhancement was preferred over other noise-estimation algorithms.
Keywords: Speech enhancement; Noise estimation; Non-stationary noise
Fig. 1. Flow diagram of proposed noise-estimation algorithm.
Fig. 2. Plot of noisy speech power spectrum and local minimum using (3) for a speech degraded by babble noise at 5 dB SNR at frequency bin k = 5.
Fig. 3. Top panel: Plot of estimated speech-presence probability based on the ratio Sr(λ, k). Bottom panel: spectrogram of the clean signal.
Fig. 4. Plot of true noise spectrum and the estimated noise spectrum using our proposed method for a speech degraded by babble noise at 5 dB SNR and single frequency f = 250 Hz.
Fig. 5. Comparison between the noise spectrum (for f = 1 kHz) estimated using the proposed algorithm (thick line) and Martin’s (Martin, 2001) (dashed line) algorithm for a sentence corrupted by car noise (t < 1.8 s) followed by a sentence corrupted by multi-talker babble (t > 1.8 s).
Fig. 6. Top panel: Plot of true noise spectrum and estimated noise spectrum using the proposed method for a noisy speech signal (5 dB SNR) at f = 250 Hz. Bottom panel: Plot of true noise spectrum and estimated noise spectrum using (Doblinger, 1995). Arrows indicate regions where noise is overestimated.
Fig. 7. Comparison of estimated noise spectrum (f = 500 Hz) of proposed method (dashed line) with that of Hirsch and Ehrlicher (1995) (solid line) for a noisy speech of SNR 20 dB (t < 1.8 s) followed by a noisy speech of SNR 5 dB (t > 1.8 s).
Fig. 8. Spectrograms of speech enhanced using Martin’s (2001) noise-estimation method (panel c), Cohen’s method (Cohen (2003)) (panel d) and the proposed noise-estimation method (panel e). Spectrograms of the clean and noisy speech signals are given in panels (a) and (b) respectively. Arrows in panels (c) and (d) at t > 3.8 s show the presence of residual noise due partly to the inability of the noise-estimation algorithms to track the sudden appearance of high-frequency noise in the last sentence (sentence 3). In contrast, as shown in panel (e), the residual noise is greatly reduced with the proposed noise-estimation algorithm.
Table 1.
Percent preference for the proposed method compared to other methods for single and mixed type noise

Table 2.
The normalized mean squared error (MSE) between the estimated and true noise spectra for various methods

Table 3.
Objective evaluation and comparison of the proposed noise-estimation algorithm in terms of segmental SNR values (dB) and LLR values
