DSpace Repository

Bangla voice command recognition with context specific optimization

Show simple item record

dc.contributor.advisor Adnan, Dr. Muhammad Abdullah
dc.contributor.author Nafis Sadeq
dc.date.accessioned 2021-08-17T10:26:05Z
dc.date.available 2021-08-17T10:26:05Z
dc.date.issued 2021-02-27
dc.identifier.uri http://lib.buet.ac.bd:8080/xmlui/handle/123456789/5761
dc.description.abstract Voice command recognition task commonly involves an Automatic Speech Recognition (ASR) system with context-specific optimization. Automatic Speech Recognition system development involves corpus resource development such as phoneme list, text corpus, word dictionary, phonetic dictionary, and speech corpus. These corpus resources are used to train speech recognition models. The performance of the speech recognition systems can be further improved by exploiting user and device-specific contexts. Context information for a specific smartphone user includes contact names, installed apps, songs, media files, location, recent search history, the content of the screen user is looking at, etc. The context information changes frequently so it is desired that the contextual model will be updated on-the-fly within the device. Traditional speech recognition systems usually consist of several individual components such as an acoustic model, a language model, a pronunciation dictionary, etc. So context-specific optimization can be achieved by tuning a particular component like the language model. Recently, end-to-end speech recognition architectures have been very effective in many speech recognition tasks. Incorporating context-specific optimization with the latest end-to-end speech recognition architectures requires a different approach. In this work, we focus on Bangla voice command recognition. We develop an ASR system for voice command recognition tasks and improve the performance further using context-specific optimization. In our work, we develop each linguistic resource in a way that considers language-specific characteristics of Bangla. We enrich our speech corpus with both domain-specific and domain-independent speech data. We also experiment with traditional and end-to-end speech recognition architectures. We propose a novel approach for context-specific optimization of voice commands. We also explore several other approaches for improving ASR performance such as synthetic speech corpus development and semi-supervised speech recognition. en_US
dc.language.iso en en_US
dc.publisher Department of Computer Science and Engineering(CSE), BUET en_US
dc.subject Automatic speech recognition en_US
dc.title Bangla voice command recognition with context specific optimization en_US
dc.type Thesis-MSc en_US
dc.contributor.id 1018052033 en_US
dc.identifier.accessionNumber 117756
dc.contributor.callno 006.454/NAF/2021 en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search BUET IR


Advanced Search

Browse

My Account