| dc.contributor.advisor | Latiful Hoque, Dr. Abu Sayed Md. | |
| dc.contributor.author | Abdur Rouf, Mohammad | |
| dc.date.accessioned | 2015-11-28T05:53:14Z | |
| dc.date.available | 2015-11-28T05:53:14Z | |
| dc.date.issued | 2006-04 | |
| dc.identifier.uri | http://lib.buet.ac.bd:8080/xmlui/handle/123456789/1404 | |
| dc.description.abstract | The emergence of e-application has been creating extremely high volume of data that reaches to terabyte threshold. Many'organizations are producing data that are doubling every year. The conventional data management system is costlier in terms of storage space and processing speed, and sometimes it is unable to handle such huge amount of data. New algorithms and techniques need to develop to store and manipulate these data. The database compression can be used for scalable storage and faster data access. We propose compression based data management system architecture that can be used to handle terabyte level of relational data. The existing compression schemes e.g. HIBASE or Three Layer Database Compression Architecture work well for memory resident data and provide good performance. These are low cost solution for highperformance data management system but are not scalable to manage terabyte level of data. We have developed a disk based columnar multi-block vector structure (CMBVS) that can be used to store relational data in a compressed representation with direct addressability. Parallel data access can be achieved by distributing the vector structure into multiple servers to improve the scalability. The lowest layer of the model is the block structure to store the compressed representation of data. The next higher level is the vector-structure that relates the block structure to an attribute of the relational data model. The structures are capable of carrying out query directly on the compressed form of data. This reduces query time drastically. We have compared our system with conventional relational DBMS. The experimental results show that our system is about twenty five times efficient in storage cost and twenty-seven to seventy-seven times faster in retrieval time performance than that of the conventional systems. | en_US |
| dc.language.iso | en | en_US |
| dc.publisher | Department of Computer Science and Engineering, BUET | en_US |
| dc.subject | Data management - Terabyte | en_US |
| dc.title | Scalable strorage in compressed representation for terabyte data management | en_US |
| dc.type | Thesis-MSc | en_US |
| dc.contributor.id | 100105021 P | en_US |
| dc.identifier.accessionNumber | 102870 | |
| dc.contributor.callno | 005.74/ABD/2006 | en_US |