| dc.contributor.advisor | Latiful Hoque, Dr. Abu Sayed Md. | |
| dc.contributor.author | Masumuzzaman Bhuiyan, Mohammad | |
| dc.date.accessioned | 2016-03-13T09:58:59Z | |
| dc.date.available | 2016-03-13T09:58:59Z | |
| dc.date.issued | 2007-09-15 | |
| dc.identifier.uri | http://lib.buet.ac.bd:8080/xmlui/handle/123456789/2568 | |
| dc.description.abstract | Loss-less data compression is potentially attractive in database application for storage reduction and performance improvement. The existing compression architectures work well for small memory resident database. Some other techniques use disk-based compression and therefore, can suppo11 large database. But all these systems can execute a limited number of queries. Moreover, they cannot perform queries based on multiple tables. We have developed a disk based compression architecture that uses dictionary based compression. Each column is stored separately in compressed form. String data are compressed and numeric data mayor may not be compressed based on the discretion of the database designer. We have compared our system with widely used Microsoft SQL -Server. The experimental result shows that the proposed system requires 10 to 20 times less space. As the system is column oriented, schema evolution is easy. We have also defined a number of query operators on compressed database. We have implemented natural join, selection with range predicate, set operations and all aggregation functions. These complex queries have not been explored in existing compression based systems. Other than selection queries our system outperforms Microsoft SQL Server with respect to query time. The performance of selection queries could be improved by introducing indices. Also the system is appropriate for parallel computation by distributing the compressed columns to separate processors. | en_US |
| dc.language.iso | en | en_US |
| dc.publisher | Department of Computer Science and Engineering, BUET | en_US |
| dc.subject | Data base design | en_US |
| dc.title | High performance queries on multiple tables for compressed form of relational database | en_US |
| dc.type | Thesis-MSc | en_US |
| dc.contributor.id | 040405004 F | en_US |
| dc.identifier.accessionNumber | 104365 | |
| dc.contributor.callno | 005.74/MAS/2007 | en_US |