Please use this identifier to cite or link to this item: https://elibrary.tucl.edu.np/handle/123456789/7492
Title: A MapReduce Based Parallel Algorithm for Finding Longest Common Subsequence in Biosequences
Authors: Bohara, Jnaneshwar
Keywords: Bioinformatics, Longest Common Subsequence;MapReduce, Hadoop
Issue Date: Nov-2014
Publisher: Pulchowk Campus
Institute Name: Institute of Engineering
Level: Masters
Citation: MASTER OF SCIENCE IN COMPUTER SYSTEM AND KNOWLEDGE ENGINEERING
Abstract: The Longest Common Subsequence(LCS) identification of biological sequences has significant aplications in bioinformatics. Due to the emerging growth in bioinformatics applications, new biological sequences with longer length have been used for processing, making it great challenge for sequenctial LCS algorithms. Few parallel LCS algorithms have been proposed but their efficiency and effectiveness are not satisfactory with increasing complexity and size of biological data. To overcome limitations of existing LCS algorithms and considering MapReduce programming model as promising technology for cost effective high performace parallel computing, MapReduce based parallel algorithm for LCS has been developed. This algorithm adopts the concepts of successor tables, identical character pairs, successor tree and traversal of successor tree to find Longest Common Subsequence. The hadoop framework is used for the realization of MapReduce model.
Description: The Longest Common Subsequence(LCS) identification of biological sequences has significant aplications in bioinformatics.
URI: https://elibrary.tucl.edu.np/handle/123456789/7492
Appears in Collections:Electronics and Computer Engineering

Files in This Item:
File Description SizeFormat 
Thesis_069MSCS670.pdf883.76 kBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.