DSpace Repository

Development of a Bangla news classification system

Show simple item record

dc.contributor.advisor Islam, Dr. Md. Saiful
dc.contributor.author Salayhin, Md Sirajus
dc.date.accessioned 2019-11-12T04:27:01Z
dc.date.available 2019-11-12T04:27:01Z
dc.date.issued 2019-03-31
dc.identifier.uri http://lib.buet.ac.bd:8080/xmlui/handle/123456789/5371
dc.description.abstract On-line newspapers and digital editions of print newspapers has become more and more popular as technology continues to grow. As the number of popular news articles grows and people also have different interests they want to categorize news to read only their interested topics. Classification of on-line news in the past, has often been done manually. Text classification is a well-studied problem. Several methods have been proposed and many of them can be directly applied to news classification as long as there exists a good set of training documents for each predefined category. To develop a news classifier we can use either Collaborative filtering, Content-based filtering, Subscription-based personalization approach. Among the above three approaches, we have chosen to adopt Content filtering approach to support personalized news classification in Categorizer system. The main difficulty in using this approach is that the ve -Gram Based Text - Bangla text classification. We want to Compare different Classifier algorithm to know better classification result and finally build a web application where users can search and read category based Bangla news articles. In this project an application has been developed to compare different classification algorithms for Bangla news classifier and a web application for categories news. Data collection, analysis and model building part has been developed with Python and Python based machine learning library (Scikit Learn). For development the application I have used Python based web framework. After analysis different algorithms we found Naive Bayes has more accuracy then other machine learning algorithms. en_US
dc.language.iso en en_US
dc.publisher Institute of Information and Communication Technology en_US
dc.subject Web search engines en_US
dc.title Development of a Bangla news classification system en_US
dc.type Thesis - Post Graduate Diploma en_US
dc.contributor.id 1014311039 en_US
dc.identifier.accessionNumber 117240
dc.contributor.callno 025.04/SIR/2019 en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search BUET IR


Advanced Search

Browse

My Account