Please use this identifier to cite or link to this item: https://elibrary.tucl.edu.np/handle/123456789/9628
Title: A Comparative Study on Document Categorization Using Apriori Algorithm and Naive Bayse Algorithm
Authors: Maharjan, Sudan
Keywords: Supervised machine learning;Apriori algorithm;Naive bayse algorithm;Automatic document classification
Issue Date: 2018
Publisher: Department of Computer Science and Information Technology
Institute Name: Central Department of Computer Science and Information Technology
Level: Masters
Abstract: Automatic document classification is the process for classifying electronic text document into specific category based on its contents. This dissertation work is about document classification and this dissertation will help in arranging electronic documents automatically. Document classification has many applications in computer science, information science, newspaper classification, library science etc. Document classification can be used in spam filtering, news article classification, pornography classification, indexing of documents, routing of emails etc. The problem of automated document classification can be solved in supervised, unsupervised or semi-supervised machine learning technique. This dissertation work is based on both unsupervised and supervised machine learning technique where Apriori Algorithm is related to unsupervised machine learning and the Navie Bayes Classifier itself is supervised machine learning. The overall work of training and testing is based on three different classes of documents: Graphics, Guns and Sports. The system performance is measured on the basis of accuracy and F1 measure where Apriori Algorithm performed better than Naïve Bayes.
URI: https://elibrary.tucl.edu.np/handle/123456789/9628
Appears in Collections:Computer Science & Information Technology

Files in This Item:
File Description SizeFormat 
final thesis Sudan.1-14.pdf1.25 MBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.