Please use this identifier to cite or link to this item: https://elibrary.tucl.edu.np/handle/123456789/17237
Title: Automatic construction of dictionary on English-Nepali parallel corpus
Authors: Saud, Lokendra Bahadur
Keywords: Automatic construction;Dictionary;english Nepali parallel
Issue Date: 2010
Publisher: Department of Computer Science and Information Technology
Institute Name: Central Department of Computer Science and Information Technology
Level: Masters
Abstract: This dissertation describes an approach based on word alignment on parallel corpora, which aims at facilitating the lexicographic work of dictionary building. The proposed model of dictionary construction first perform the tokenization of input text and then TnT tagger is used for tagging the tokenized text, after this the word alignment is done to find out the word pair form source (English)language and target (Nepali) language. Finally our dictionary generation algorithm generate the sample dictionary formed from the word that made the given input text. Our model does rely on the information from tagging as well. Hence the model accuracy not only depends on the alignment algorithms and the training corpus but also depends on the accuracy of tagger. This corpus-driven technique, in particular the exploitation of parallel corpora, proved to be helpful in the creation of bilingual dictionaries for several reasons. Most importantly, a parallel corpus of appropriate size guarantees that the most relevant translations are included in the dictionary.
URI: https://elibrary.tucl.edu.np/handle/123456789/17237
Appears in Collections:Computer Science & Information Technology

Files in This Item:
File Description SizeFormat 
Cover(1).pdf43.02 kBAdobe PDFView/Open
Chapter(2).pdf386.84 kBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.