Please use this identifier to cite or link to this item:
https://elibrary.tucl.edu.np/handle/123456789/14846
Title: | A Comparative Study of Naive Bayesian Spam Filtering Using Word Distribution and Trigrams |
Authors: | Dangol, Pabitra |
Keywords: | Naive Bayesian Spam;Word Distribution |
Issue Date: | 2011 |
Publisher: | Department of Computer Science and I.T. |
Institute Name: | Central Department of Computer Science and Information Technology |
Level: | Masters |
Abstract: | A comparative study of Naive Bayesian spam filter is done on the basis of tokenization. The study is focused on the reliability and accuracy of the spam filter between word-based tokenization and trigram-based tokenization. Both of the filters are implemented using the same classifier and trainer. The results of the study is that word-based spam filtering is better when the amount of pre-categorized emails available for training are limited and when the resources available for the classification process were limited as well. For sufficient amount of resources and emails, the results suggest that trigram-based spam filtering is better due to its higher reliability and accuracy. |
URI: | https://elibrary.tucl.edu.np/handle/123456789/14846 |
Appears in Collections: | Computer Science & Information Technology |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Cover page.pdf | 45.96 kB | Adobe PDF | View/Open | |
Chapter.pdf | 235.66 kB | Adobe PDF | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.