DSpace Repository

Pitch extraction of noisy speech using dominant frequency of the harmonic speech model

Show simple item record

dc.contributor.advisor Kamrul Hasan, Dr. Md.
dc.contributor.author Celia Shahnaz
dc.date.accessioned 2015-10-14T06:25:47Z
dc.date.available 2015-10-14T06:25:47Z
dc.date.issued 2002-12
dc.identifier.uri http://lib.buet.ac.bd:8080/xmlui/handle/123456789/1010
dc.description.abstract This thesis work is concerned with the development of a new method for pitch extraction from the observations of speech heavily degraded by white noise. The extraction of pitch at a very low SNR is still a challenging problem for researchers and no effective method has yet been reported. To estimate pitch all the existing autocorrelation based techniques, invariably use the location of the second largest peak at the speech signal periodicity relative to the strongest peak at the true origin of the autocorrelation function. At a very low SNR, this approaches are likely to give gross pitch error due to the presence of spurious peaks obscuring the pitch peak. Thus the conventional methods fail to estimate pitch below a certain positive value of SNR with acceptable accuracy. In this research work, unlike conventional approaches, we propose a noise robust method for pitch estimation using the dominant harmonic (DH) in the harmonic sinusoidal speech model. To estimate the DH accurately under strong noisy condition, a cosine autocorrelation model (CAM) of the noise-free speech signal is proposed. Moreover, a sinusoidal autocorrelation model (SAM) of the clean-speech using an approximate all-pole speech model is derived for more accurate estimation of the DH. The harmonic number of the DH required to estimate pitch is then determined based on function optimization. This utilizes the inner product of a variable period short-length impulse train with the half-wave rectified weighted autocorrelation function of the pre-filtered noisy speech (PFNS) signal. As the DH is an integer multiple of pitch, the period of the impulse train is tuned as a function of the DH. Numerical analysis reveals that the traditional idea of "reliable pitch estimation depending solely on the correct determination of the second largest peak of the autocorrelation function" can be relaxed. In this work pitch estimation accuracy is highly dependent on the accuracy of estimation of the DH, a task which seems to be easier than the conventional methods. The simulation results show that the novel method presented in this work can estimate pitch with higher accuracy even at an SNR as low as -5 dB as compared to other recently reported results. en_US
dc.language.iso en en_US
dc.publisher Department of Electrical and Electronic Engineering en_US
dc.subject Pitch extraction of noisy speech en_US
dc.subject Frequency of the harmonic speech model en_US
dc.title Pitch extraction of noisy speech using dominant frequency of the harmonic speech model en_US
dc.type Thesis-MSc en_US
dc.contributor.id 100006201 P en_US
dc.identifier.accessionNumber 97098
dc.contributor.callno 623.8043/CEL/2002 en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search BUET IR


Advanced Search

Browse

My Account