DSpace Repository

Noisy speech enhancement in wavelet domain based on generative adversarial network

Show simple item record

dc.contributor.advisor Shahnaz, Dr. Celia
dc.contributor.author Tahseen Minhaz, Ahmed
dc.date.accessioned 2019-07-10T04:35:42Z
dc.date.available 2019-07-10T04:35:42Z
dc.date.issued 2019-01-02
dc.identifier.uri http://lib.buet.ac.bd:8080/xmlui/handle/123456789/5277
dc.description.abstract In this thesis, a speech enhancement method based on generative adversarial network in wavelet domain is presented. A deep neural network based generator model is designed to provide an estimate of the clean speech coefficients from the noisy speech coefficients. A discriminator network is also designed that assesses the outputs from the generator and provide feedback on how close this clean estimates are to the real data distribution. Generator learns from this feedback and updates the function to provide better estimates of the clean speech coefficients, which is used to produce an enhanced speech frame. The complete network is trained using speech signals from a publicly available dataset. The proposed method outperforms some recent methods of speech enhancement under different noisy conditions at different levels of SNR in terms of objective performance metrics, spectrogram analysis and subjective evaluation. en_US
dc.language.iso en en_US
dc.publisher Department of Electrical and Electronic Engineering (EEE), BUET en_US
dc.subject Speech processing systems en_US
dc.title Noisy speech enhancement in wavelet domain based on generative adversarial network en_US
dc.type Thesis-MSc en_US
dc.contributor.id 0416062234 en_US
dc.identifier.accessionNumber 117000
dc.contributor.callno 623.822/TAH/2019 en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search BUET IR


Advanced Search

Browse

My Account