Please use this identifier to cite or link to this item: https://elibrary.tucl.edu.np/handle/123456789/10152
Title: Text Recognition in Image Using Deep Convolutional Neural Network
Authors: Tandukar, Roshan
Keywords: Text recognition;Optical character recognition;Deep convolutional neural network;Character sequence model
Issue Date: 2017
Publisher: Department of Computer Science and Information Technology
Institute Name: Central Department of Computer Science and Information Technology
Level: Masters
Abstract: Text recognition in an image is one of the challenging tasks in computer vision and pattern recognition which involves automatically reading the text from the images. The non-uniformity in text styles, font size and colors, the complex background and the orientation makes it different from Optical character recognition (OCR). In this research work, text recognition with deep convolutional neural network architecture is described. Convolutional neural network is based on deep learning. The multi-layer architecture of the deep convolutional neural network allows using the deep features with less image preprocessing tasks and sharing of weights making it a faster neural network model. The deep architecture of the convolutional neural network is investigated with its two models character sequence model and dictionary encoding model to recognize the scene text. The strength and weakness of the models are analyzed based on the experiments done with the two publicly available datasets which includes Synth90k dataset and ICDAR 2003 datasets and obtained the accuracy of 93.88% in Synth90k and 70.33% in ICDAR 2003 dataset with the dictionary encoding model. Keywords: Text recognition, computer vision, Pattern recognition, Optical character recognition (OCR), Deep convolutional neural network, Character sequence model, Dictionary encoding model
URI: https://elibrary.tucl.edu.np/handle/123456789/10152
Appears in Collections:Computer Science & Information Technology

Files in This Item:
File Description SizeFormat 
Full Thesis.pdf1.25 MBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.