Method of detection and classification of nasalized vowels based on acoustic features derived from both magnitude and phase spectra

BUET ILS
BUET Institutional Repository: Home
→
Dissertations/Theses
→
Dissertations/Theses - Department of Electrical and Electronic Engineering
→
View Item

dc.contributor.advisor	Shahnaz, Dr. Celia
dc.contributor.author	Shamima Najnin
dc.date.accessioned	2016-08-06T04:29:58Z
dc.date.available	2016-08-06T04:29:58Z
dc.date.issued	2012-07
dc.identifier.uri	http://lib.buet.ac.bd:8080/xmlui/handle/123456789/3595
dc.description.abstract	The performance of the detection and classi cation of nasalized vowels from the mixture of oral and nasalized vowels largely depend on the acoustic features by which the task is performed. In this thesis, sets of acoustic features are derived from both tha magnitude and phase spectra of the vowels to evaluate the performance. It is shown through detail acoustic analysis on nasalized vowels that, although ad- ditional formants at various frequency locations are introduced, a new resonance in the low frequency region around 250 Hz is introduced and found to remain consistent irrespective of male or female speakers in the modi ed group delay function. By ex- ploiting this fact, which is veri ed on the band limited modi ed group delay function capable of resolving two closely spaced formants occuring in case of nasalized vowels, an acoustic parameter RMGD is derived. Utilizing RMGD, the idea of detecting nasalized vowels and determining the degree of nasality with respect to the adjacent nasal consonants of the vowel is evolved. It is argued that vowel can be nasalized with at least one adjacent nasal consonant even if the nasal consonant is pre-vocalic and the vowel is more nasalized in pre-nasal position than in post-nasal position. It is also found that vowel with nasal consonants on both side do not guarantee to be more nasalized vowel compared to the vowel with one adjacent nasal conso- nant. By utilizing the fact of changing nasality with the number of adjacent nasal consonant, the detection and classi cation of non-nasalized and contextually nasal- ized vowels is formulated as a four class problem and solved based on a threshold or classi er based scheme and found superior in detecting and classifying nasalized vowels than some of the existing methods. Mel-ferequency Cepstral coe cients are widely used features for the detection task. Conventionally, features for detecting and classifying nasalized vowels are derived considering magnitude spectrum only, ignoring the phase spectrum. Exploiting the power spectrum and the group delay function of a band limited vowel, the product spectrum is de ned thus incorporat- ing the information of both magnitude and phase spectra. The product spectrum is then di erentiated with respect to frequency to obtain di erential product spectrum that is argued to provide more noise robustness in the presence of noise. Assuming the noise reduction capability of the autocorrelation sequence power and product spectrum of the band limited autocorrelation sequence of the vowel are developed. Simulation results show that Mel-frequency cepstral coe cients derived from the product spectrum, di erential product spectrum, power and product spectrum of the autocorrelation sequence consitently outperforms the some of the convetional approaches in the task of detecting and classifying nasalized vowels in both clean and di erent noisy conditions.	en_US
dc.language.iso	en	en_US
dc.publisher	Department of Electrical and Electronic Engineering (EEE)	en_US
dc.subject	Speech recognition	en_US
dc.title	Method of detection and classification of nasalized vowels based on acoustic features derived from both magnitude and phase spectra	en_US
dc.type	Thesis-MSc	en_US
dc.contributor.id	1009062020 F	en_US
dc.identifier.accessionNumber	111201
dc.contributor.callno	006.454/SHA/2012	en_US