A Computational Analysis of Nepali Morphology: A Model for Natural Language Processing

dc.contributor.authorPrasain, Balaram
dc.date.accessioned2023-02-05T06:55:44Z
dc.date.available2023-02-05T06:55:44Z
dc.date.issued2011
dc.description.abstractThe main goal of this study is to present a computational analysis of morphology in Nepali for developing a model for natural language processing by applying the finite state approach. The morphological categories have been analyzed according to the principle of Two-level morphology (Koskeniemmi 1983), and these categories have been implemented using Xerox finite state tool (Beesley and Kartumnen 2003) to create the morphological analyzer. A version of finite state automaton called finite state transducer is used in this study which handles relation between two languages, namely upper language and lower language. Upper language is equivalent to lexical level and lower language is equivalent to surface level. The finite state transducer is bidirectional, i.e., moving from surface level to lexical level is analysis and from lexical level to surface level is generation. This study is organized into eight chapters. Chapter 1 presents the general morphological concepts, the objectives, methodology, the significance and limitations of the study. Chapter 2 presents the theoretical framework that is adopted for the study. Chapter 3 analyzes nouns, pronouns, adjectives, numerals and classifiers in Nepali. Chapter 4 analyzes the verbs in Nepali from computational approach in the first part and verbal inflections in the second part. Chapter 5 deals with indeclinable words in Nepali. Chapter 6 analyzes the derivational process. Chapter 7 implements the outcome of analysis in previous chapters into a finite state transducer using Xerox Finite State Tool. Chapter 8 summarizes the findings of the study. This study has identified fourteen groups of nouns, eight groups of pronouns, four groups of adjectives, one group of cardinal numerals, two groups of ordinal numerals, three groups of classifiers, ten groups of verbs, seven groups of adverbs, two groups of conjunctions, three groups of postpositions, one group of particles and fifteen groups of derivations in Nepali. The phonological rules for each group have also been identified. The finite state transducer for each group with corresponding morphological tags and phonological rules have been created; and all of them have been put together into a single transducer which can be used as a morphological analyzer for Nepali.en_US
dc.identifier.urihttps://hdl.handle.net/20.500.14540/14866
dc.language.isoen_USen_US
dc.publisherFaculty of Linguisticsen_US
dc.subjectComputational analysisen_US
dc.subjectLanguage processingen_US
dc.subjectNepali morphologyen_US
dc.titleA Computational Analysis of Nepali Morphology: A Model for Natural Language Processingen_US
dc.typeThesisen_US
local.academic.levelPh.D.en_US
local.institute.titleCentral Department of Linguisticsen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Full Thesis.pdf
Size:
3.79 MB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description:
Collections