Please use this identifier to cite or link to this item: https://elibrary.tucl.edu.np/handle/123456789/8967
Full metadata record
DC FieldValueLanguage
dc.contributor.authorBharati, Nitu-
dc.date.accessioned2022-03-11T06:59:00Z-
dc.date.available2022-03-11T06:59:00Z-
dc.date.issued2011-11-
dc.identifier.citationMasters of Science in Information and Communication Engineering,en_US
dc.identifier.urihttps://elibrary.tucl.edu.np/handle/123456789/8967-
dc.descriptionMulti-class Text Classification is the task of classifying a given text into one or more than one classes taken form a set of predefined classes.en_US
dc.description.abstractMulti-class Text Classification is the task of classifying a given text into one or more than one classes taken form a set of predefined classes. A class can be a topic of a text, for example, a class of any text about a movie can be ``entertainment’’. In this research I investigate unsupervised learning to accurately identify the topic of a given text. The cost involved in labeling a large amount of data and availability of huge amount of unlabeled data makes unsupervised learning an ideal choice. The probabilistic algorithm used for text classification can be termed as topic modeling and is capable to extract multiple topics within a single text of a document. LDA model used in this report exploits co-occurrence patterns of words in documents to extract semantically meaningful probabilistic clusters of words called topics .Each of those clusters is labeled using the significant terms selected in each cluster. Semantic distance between the significant terms from the clusters and Wikipedia documents is measured to identify labels for each cluster.en_US
dc.language.isoenen_US
dc.publisherPulchowk Campusen_US
dc.subjectText Classificationen_US
dc.subjectIdentifyen_US
dc.titleUnsupervised Text Classification as Topic Annotationen_US
dc.typeThesisen_US
local.institute.titleInstitute of Engineeringen_US
local.academic.levelMastersen_US
local.affiliatedinstitute.titlePulchowk Campusen_US
Appears in Collections:Electronics and Computer Engineering

Files in This Item:
File Description SizeFormat 
066MSI612pdf.pdf920.16 kBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.