String regularities and degenerate strings

BUET ILS
BUET Institutional Repository: Home
→
Dissertations/Theses
→
Dissertations/Theses - Department of Computer Science and Engineering
→
View Item

dc.contributor.advisor	Rahman, Dr. M. Sohel
dc.contributor.author	Faizul Bari, Md.
dc.date.accessioned	2016-03-13T04:12:21Z
dc.date.available	2016-03-13T04:12:21Z
dc.date.issued	2009-12
dc.identifier.uri	http://lib.buet.ac.bd:8080/xmlui/handle/123456789/2547
dc.description.abstract	Finding patterns and characterizing regularities in strings are in the heart of research in the field of stringology. These problems are interesting as fundamental computer science problems and often turn out to be important and useful in many applications. In case of string regularities periods, repeats, and covers are the most common forms and there are several efficient algorithms for computing regularities in a string. In this thesis, we focus on a generalization of the regular string commonly known as degenerate strings. A degenerate string can be represented as a sequence T T[lJT[2] ... T[nj, where T[iJ <;; E for each i, and E is a given alphabet. It is an extended notion of the regular string which is very much applicable especially in the context of computational biology. Pattern matching problem for degenerate string has also been investigated in the literature albeit with little success. The lack of any efficient algorithms for degenerate pattern matching may be attributed at least partially to the lack of fruitful research on regularities of degenerate strings. Very recently, the issue of regularities in degenerate string has received some attention. In this thesis,we study the problem of string regularities for degenerate strings. We present two novel and efficient algorithms for computing cover array for degenerate strings. Both of our algorithms run in O(n2) time and space. Using string combinatorics and probability analysis we have further proved that expected number of both borders and covers for a degenerate string is bounded by a constant. By using this probabilistic analysis, we have also shown that the average running time and space complexity of our algorithms are O(n).	en_US
dc.language.iso	en	en_US
dc.publisher	Department of Computer Science and Engineering, BUET	en_US
dc.subject	Computer algorithms	en_US
dc.title	String regularities and degenerate strings	en_US
dc.type	Thesis-MSc	en_US
dc.contributor.id	100705050 P	en_US
dc.identifier.accessionNumber	107481
dc.contributor.callno	005.1/FAI/2009	en_US