Please use this identifier to cite or link to this item:
https://elibrary.tucl.edu.np/handle/123456789/14958
Title: | Handling Unknown Words in English to Nepali Statistical Machine Translation Using Analogical Learning Approach |
Authors: | Ghimire, Pravakar |
Keywords: | Statistical machine;Machine translation |
Issue Date: | 2011 |
Publisher: | Department of Computer Science and Information Technology |
Institute Name: | Central Department of Computer Science and Information Technology |
Level: | Masters |
Abstract: | Unknown words are one of the great difficulties in the field of machine translation. In the process of translation, a system is most likely to encounter words that were not present in the available training data. While this is in part due to the segmentation issues, it is also often simply due to the lack of training data. In statistical machine translation to translate a sentence from one language to another we make use of a parallel corpus but it is not possible to a corpus to contain all the words from a whole language domain, hence the unknown word problem is obvious. In this thesis work an effective approach is used to translate those unknown words in English to Nepali statistical machine translation using word analogy. In this method the meaning of the unknown word is identified on the basis of other words presented in the corpus and the analogy between the prefixes and suffixes of those words with the unknown word. |
URI: | https://elibrary.tucl.edu.np/handle/123456789/14958 |
Appears in Collections: | Computer Science & Information Technology |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
chapter page.pdf | 548.06 kB | Adobe PDF | View/Open | |
cover page.pdf | 318.84 kB | Adobe PDF | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.