Please use this identifier to cite or link to this item: https://elibrary.tucl.edu.np/handle/123456789/14846
Title: A Comparative Study of Naive Bayesian Spam Filtering Using Word Distribution and Trigrams
Authors: Dangol, Pabitra
Keywords: Naive Bayesian Spam;Word Distribution
Issue Date: 2011
Publisher: Department of Computer Science and I.T.
Institute Name: Central Department of Computer Science and Information Technology
Level: Masters
Abstract: A comparative study of Naive Bayesian spam filter is done on the basis of tokenization. The study is focused on the reliability and accuracy of the spam filter between word-based tokenization and trigram-based tokenization. Both of the filters are implemented using the same classifier and trainer. The results of the study is that word-based spam filtering is better when the amount of pre-categorized emails available for training are limited and when the resources available for the classification process were limited as well. For sufficient amount of resources and emails, the results suggest that trigram-based spam filtering is better due to its higher reliability and accuracy.
URI: https://elibrary.tucl.edu.np/handle/123456789/14846
Appears in Collections:Computer Science & Information Technology

Files in This Item:
File Description SizeFormat 
Cover page.pdf45.96 kBAdobe PDFView/Open
Chapter.pdf235.66 kBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.