Automatic Text Summarization System for Nepali Language Based on Sentence Extraction

dc.contributor.authorLamichhane, Rajendra
dc.date.accessioned2023-02-03T06:30:34Z
dc.date.available2023-02-03T06:30:34Z
dc.date.issued2013
dc.description.abstractAutomated text summarization is a generic problem in the Natural Language Processing (NLP) community. It has grabbed great attention recently as the amount of information increases throughout the world, online and offline. As the volume and availability of data increases, it causes redundancy and scatterness over the world. So, there is the need of effective and powerful tool to summarize text documents automatically. So far, many researches have been done for English and other European languages with high performance. However, Nepali language still suffers from the little attentions and researches in this field. In this dissertation, a method has been proposed, which lets us to summarize Nepali text documents automatically based on sentence extraction techniques. The various stages involved in this approach which are: text preprocessing, feature extraction, sentence scoring and ranking, and summary generation. The proposed system is tested with various datasets collected from different sources such as books, newspapers, article, reports, etc. Automated evaluation techniques are used to validate the proposed system against the manual summaries. The overall accuracy of the proposed system is achieved as 79:18% precision, 71:77% recall and 75:02% F-Score. Cosine similarity measure gives overall similarity of 91:16% between manual summary and system summary.en_US
dc.identifier.urihttps://elibrary.tucl.edu.np/handle/20.500.14540/14818
dc.language.isoen_USen_US
dc.publisherDepartment of Computer Science and I.T.en_US
dc.subjectAutomatic Text Summarizationen_US
dc.subjectSentence Extractionen_US
dc.subjectNepali languageen_US
dc.subjectPreprocessingen_US
dc.titleAutomatic Text Summarization System for Nepali Language Based on Sentence Extractionen_US
dc.typeThesisen_US
local.academic.levelMastersen_US
local.institute.titleCentral Department of Computer Science and Information Technologyen_US
Files
Original bundle
Now showing 1 - 2 of 2
No Thumbnail Available
Name:
All thesis.pdf
Size:
1.6 MB
Format:
Adobe Portable Document Format
Description:
No Thumbnail Available
Name:
Appendix B.pdf
Size:
519.17 KB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: