Low Delay Noise Reduction and Dereverberation for Hearing Aids (pdf)

Article PDF cannot be displayed. You can download it here:

https://link.springer.com/content/pdf/10.1155%2F2009%2F437807.pdf

Low Delay Noise Reduction and Dereverberation for Hearing Aids

Hindawi Publishing Corporation EURASIP Journal on Advances in Signal Processing Volume 2009, Article ID 437807, 9 pages doi:10.1155/2009/437807 Research Article Low Delay Noise Reduction and Dereverberation for Hearing Aids Heinrich W. Löllmann (EURASIP Member) and Peter Vary Institute of Communication Systems and Data Processing, RWTH Aachen University, 52056 Aachen, Germany Correspondence should be addressed to Heinrich W. Löllmann, Received 11 December 2008; Accepted 16 March 2009 Recommended by Heinz G. Goeckler A new system for single-channel speech enhancement is proposed which achieves a joint suppression of late reverberant speech and background noise with a low signal delay and low computational complexity. It is based on a generalized spectral subtraction rule which depends on the variances of the late reverberant speech and background noise. The calculation of the spectral variances of the late reverberant speech requires an estimate of the reverberation time (RT) which is accomplished by a maximum likelihood (ML) approach. The enhancement with this blind RT estimation achieves almost the same speech quality as by using the actual RT. In comparison to commonly used post-filters in hearing aids which only perform a noise reduction, a significantly better objective and subjective speech quality is achieved. The proposed system performs time-domain filtering with coeﬃcients adapted in the non-uniform (Bark-scaled) frequency-domain. This allows to achieve a high speech quality with low signal delay which is important for speech enhancement in hearing aids or related applications such as hands-free communication systems. Copyright © 2009 H. W. Löllmann and P. Vary. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. 1. Introduction Algorithms for the enhancement of acoustically disturbed speech signals have been the subject of intensive research over the last decades, cf., [1–3]. The wide-spread use of mobile communication devices and, not at least, the introduction of digital hearing aids have contributed significantly to the interest in this field. For hearing impaired people, it is especially diﬃcult to communicate with other persons in noisy environments. Therefore, speech enhancement systems have become an integral component of modern hearing aids. However, despite significant progress, the development of speech enhancement systems for hearing aids is still a very challenging problem due to the demanding requirements regarding computational complexity, signal delay and speech quality. A common approach is to use a beamformer with two or three closely spaced microphones followed by a post-filter, e.g., [4, 5]. An adaptive beamformer is often used, implemented by first- or second- order diﬀerential microphone arrays or a generalized sidelobe canceller (GSC), respectively, e.g., [5]. Due to the use of small microphone arrays, only a limited noise suppression can be achieved by this, especially for diﬀuse noise fields. Therefore, the output signal of the beamformer is further processed by a (Wiener) post-filter to achieve an improved noise suppression, e.g., [4–7]. A related approach is to use an extension of the GSC structure termed as speech distortion weighted multichannel Wiener filter [8, 9]. This approach allows to balance the tradeoﬀ between speech distortions and noise reduction and is more robust towards reverberation than a common GSC. So far, such systems achieve only a very limited suppression of speech distortions due to room reverberation. Such impairments are caused by the multiple reflections and diﬀraction of the sound on walls and objects of a room. These multiple echoes add to the direct sound at the receiver and blur its temporal and spectral characteristics. As a consequence, reverberation and background noise reduce listening comfort and speech intelligibility, especially for hearing impaired persons [10, 11]. Therefore, algorithms for a joint suppression of background noise and reverberation eﬀects are of special interest for speech enhancement in hearing instruments. However, many proposals are less suitable for this application. For example, dereverberation algorithms based on linear prediction such as [12] achieve mainly a reduction of early reflections and do not consider additive noise, 2 EURASIP Journal on Advances in Signal Processing while algorithms based on a time-averaging [13] exhibit a high signal delay. Coherence-based speech enhancement algorithms such as [14] or [15] can suppress background noise and reverberation, but they are rather ineﬀective if only two closely spaced microphones can be used. This problem can be alleviated to some extend by a noise classification and binaural processing [16] which, however, requires two hearing aid devices connected by a wireless data link. A single-channel algorithm for speech dereverberation and noise reduction has been proposed recently in [17]. However, this algorithm is less suitable for hearing aids due to its high computational complexity and signal delay as well as its strong speech distortions. A more powerful approach for noise reduction and dereverberation is to use blind source separation (BSS), e.g., [18]. Such algorithms do not require a priori knowledge about the microphone positions or source locations. However, they depend on a full data link between the hearing aid devices and possess a high computational complexity. Therefore, further work remains to be done to integrate such algorithms into common hearing instruments [19]. In this contribution, a single-channel speech enhancement algorithm is proposed, which is more suitable for current hearing aid devices. It performs a suppression of background noise and late reverberant speech by means of a generalized spectral subtraction. The devised (post-)filter exhibits a low signal delay, which is important in hearing aids, e.g., to avoid comb filter eﬀects. The calculation of the late reverberant speech energy requires (only) an estimate of the reverberation time (RT), which is accomplished by a maximum likelihood (ML) approach. Thus, no explicit speech modeling is involved in the dereverberation process as, e.g., in [20] such that an estimation of speech model parameters is not needed here. The paper is organized as follows. In Section 2, the underlying signal model is introduced. The overall system for low delay speech enhancement is outlined in Section 3. The calculation of the spectral weights for noise reduction and dereverberation is treated in Section 4. An important issue is the determination of the spectral variances of the late reverberant speech, which in turn is based on an estimation of the RT. These issues are treated in Sections 4.2 and 4.3. The performance of the new system is analyzed in Section 5, and the main results are summarized i (...truncated)