Speech enhancement based on statistical modeling of teager energy operated perceptual wavelet packet coefficients and adaptive thresholding function

BUET ILS
BUET Institutional Repository: Home
→
Dissertations/Theses
→
Dissertations/Theses - Department of Electrical and Electronic Engineering
→
View Item

dc.contributor.advisor	Shahnaz, Dr. Celia
dc.contributor.author	Tauhidul Islam, Md.
dc.date.accessioned	2016-11-27T05:45:25Z
dc.date.available	2016-11-27T05:45:25Z
dc.date.issued	2014-07
dc.identifier.uri	http://lib.buet.ac.bd:8080/xmlui/handle/123456789/4067
dc.description.abstract	In order to handle the practical situations of real-life applications, a speech enhance- ment method is needed to be capable of producing optimum results with improved overall speech quality with maximized intelligibility particularly under low levels of SNRs. For solving this open problem, this thesis presents a speech enhancement approach, where an adaptive threshold is statistically determined using the Tea- ger energy (TE) operated perceptual wavelet packet (PWP) coefficients of noisy speech. A frame of noisy speech signal is analyzed first in PWP transform domain to obtain a set of PWP coefficients. TE operation is performed on the PWP coeffi- cients to increase the separability between clean speech and noise coefficients. The TE operated PWP coefficients with better time and frequency resolution are then used to determine an appropriate adaptive threshold based on different statistical models, namely Gaussian, Laplace, Rayleigh, Poisson and Student t distributions. The threshold thus obtained is applied upon the PWP coefficients by employing a custom thresholding function, which is designed based on the presence of noise in the noisy speech signal. A couple of custom thresholding functions designed in this thesis can be viewed as a linear combination of the modified hard or μ-law thresh- olding function and the semisoft thresholding function. The enhanced speech frame is synthesized by performing the inverse PWP transform on the thresholded PWP coefficients obtained using the statistically determined threshold and the designed custom thresholding function. The final enhanced speech signal is reconstructed by using the standard overlap-and-add method. Extensive Simulations using NOIZEUS database are carried out considering the presence of car and multi-talker babble noises to evaluate the performance of the proposed method in terms of standard ob- jective metrics and subjective listening tests. It is shown that the proposed method outperforms the reported state-of the-art methods with superior efficacy at high as well as low levels of SNRs.	en_US
dc.language.iso	en	en_US
dc.publisher	Department of Electrical and Electronic Engineering (EEE)	en_US
dc.subject	Speech processing systems	en_US
dc.title	Speech enhancement based on statistical modeling of teager energy operated perceptual wavelet packet coefficients and adaptive thresholding function	en_US
dc.type	Thesis-MSc	en_US
dc.contributor.id	0411062259	en_US
dc.identifier.accessionNumber	114113
dc.contributor.callno	623.822/TAU/2014	en_US