Automatic Text Summarization System for Nepali Language Based on Sentence Extraction

Lamichhane, Rajendra

Please use this identifier to cite or link to this item: https://elibrary.tucl.edu.np/handle/123456789/14818

Full metadata record

DC Field	Value	Language
dc.contributor.author	Lamichhane, Rajendra	-
dc.date.accessioned	2023-02-03T06:30:34Z	-
dc.date.available	2023-02-03T06:30:34Z	-
dc.date.issued	2013	-
dc.identifier.uri	https://elibrary.tucl.edu.np/handle/123456789/14818	-
dc.description.abstract	Automated text summarization is a generic problem in the Natural Language Processing (NLP) community. It has grabbed great attention recently as the amount of information increases throughout the world, online and ofﬂine. As the volume and availability of data increases, it causes redundancy and scatterness over the world. So, there is the need of effective and powerful tool to summarize text documents automatically. So far, many researches have been done for English and other European languages with high performance. However, Nepali language still suffers from the little attentions and researches in this ﬁeld. In this dissertation, a method has been proposed, which lets us to summarize Nepali text documents automatically based on sentence extraction techniques. The various stages involved in this approach which are: text preprocessing, feature extraction, sentence scoring and ranking, and summary generation. The proposed system is tested with various datasets collected from different sources such as books, newspapers, article, reports, etc. Automated evaluation techniques are used to validate the proposed system against the manual summaries. The overall accuracy of the proposed system is achieved as 79:18% precision, 71:77% recall and 75:02% F-Score. Cosine similarity measure gives overall similarity of 91:16% between manual summary and system summary.	en_US
dc.language.iso	en_US	en_US
dc.publisher	Department of Computer Science and I.T.	en_US
dc.subject	Automatic Text Summarization	en_US
dc.subject	Sentence Extraction	en_US
dc.subject	Nepali language	en_US
dc.subject	Preprocessing	en_US
dc.title	Automatic Text Summarization System for Nepali Language Based on Sentence Extraction	en_US
dc.type	Thesis	en_US
local.institute.title	Central Department of Computer Science and Information Technology	en_US
local.academic.level	Masters	en_US
Appears in Collections:	Computer Science & Information Technology

Files in This Item:

File	Description	Size	Format
All thesis.pdf		1.64 MB	Adobe PDF	View/Open
Appendix B.pdf		519.17 kB	Adobe PDF	View/Open

Show simple item record

TUCL eLibrary

Easy and open access to all types of digital resources of TUCL