DSpace Repository

Design of an enhanced speech authentication system over mobile devices

Show simple item record

dc.contributor.advisor Ali, Dr. Md. Liakot
dc.contributor.author Anwar, Moshin Uddin
dc.date.accessioned 2019-03-27T06:12:55Z
dc.date.available 2019-03-27T06:12:55Z
dc.date.issued 2018-04-30
dc.identifier.uri http://lib.buet.ac.bd:8080/xmlui/handle/123456789/5152
dc.description.abstract Increasing efficiency in biometric authentication via speech recognition and identification and its use in mobile devices has been one of the most invested researches worldwide in computing industry. Since the very initial state of using speech recognition algorithms towards very recent time even in the current year 2018, different strategies and combinations have been used to optimize the result in order to surpass human recognition capacity and achieve even more! For long many years, ¬various speech signal processing techniques have been experimented and optimized using expectation maximization or gradient descent optimization or their variations across end-to-end speech feature extraction and recognition scheme, but the result was below the satisfactory limit despite multitude of time, cost and effort have been invested. Very recently, huge improvement of computing power of devices, made it possible to use complex multi-layered neural network technologies (i.e., deep learning or deep neural network) such as convolutional net, long short term memory, bidirectional recurrent neural network as well as complex statistical or evolutionary strategies and its variations to optimize further the results reducing the error rates. To this end, using series of combination of various deep learning algorithms across end-to-end speech features and language modelling it has been possible by some big companies and join venture investments to attain a somewhat notable achievement: that the experiment just surpassed the human efficiency. But, still we have been far way behind the recognition efficiency to be more promising, to identify a practically useful and achievable optimal solution which can equally perform in noisy environments and mutations of speech features. This thesis work has emphasized mostly on how to devise an efficient technique that would reduce the time, cost and complexity of such huge efforts so far done so that future improvements can be made on this optimum path. To this end, it has been identified that text independent speech recognition can be efficiently trained, if deep learning technology with the guidance of genetic algorithm (GA) through intelligently choosing hyper-parameters of the networks can be adopted. It has been experimented that series of iterations to estimate and re-estimate the hyper-parameters can lead to a better and optimal solution with extremely less time and cost. It can be calculated that the runtime is O(No. of generations) instead of O(variations ^ network parameters), to save time extremely compared to legacy processes of selecting series of deep learning networks. As a way forward, we have suggested more automated parameter fixing followed by automated iterations can be a future attempt for such implementation. en_US
dc.language.iso en en_US
dc.publisher Institute of Information and Communication Technology en_US
dc.subject Speech recognition en_US
dc.title Design of an enhanced speech authentication system over mobile devices en_US
dc.type Thesis-MSc en_US
dc.contributor.id 0411312028P en_US
dc.identifier.accessionNumber 116948
dc.contributor.callno 006.454/MOH/2018 en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search BUET IR


Advanced Search

Browse

My Account