A Comparative Study on Document Categorization Using Apriori Algorithm and Naive Bayse Algorithm

dc.contributor.authorMaharjan, Sudan
dc.date.accessioned2022-03-31T10:24:25Z
dc.date.available2022-03-31T10:24:25Z
dc.date.issued2018
dc.description.abstractAutomatic document classification is the process for classifying electronic text document into specific category based on its contents. This dissertation work is about document classification and this dissertation will help in arranging electronic documents automatically. Document classification has many applications in computer science, information science, newspaper classification, library science etc. Document classification can be used in spam filtering, news article classification, pornography classification, indexing of documents, routing of emails etc. The problem of automated document classification can be solved in supervised, unsupervised or semi-supervised machine learning technique. This dissertation work is based on both unsupervised and supervised machine learning technique where Apriori Algorithm is related to unsupervised machine learning and the Navie Bayes Classifier itself is supervised machine learning. The overall work of training and testing is based on three different classes of documents: Graphics, Guns and Sports. The system performance is measured on the basis of accuracy and F1 measure where Apriori Algorithm performed better than Naïve Bayes.en_US
dc.identifier.urihttps://hdl.handle.net/20.500.14540/9628
dc.language.isoen_USen_US
dc.publisherDepartment of Computer Science and Information Technologyen_US
dc.subjectSupervised machine learningen_US
dc.subjectApriori algorithmen_US
dc.subjectNaive bayse algorithmen_US
dc.subjectAutomatic document classificationen_US
dc.titleA Comparative Study on Document Categorization Using Apriori Algorithm and Naive Bayse Algorithmen_US
dc.typeThesisen_US
local.academic.levelMastersen_US
local.institute.titleCentral Department of Computer Science and Information Technologyen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
final thesis Sudan.1-14.pdf
Size:
1.22 MB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: