Please use this identifier to cite or link to this item:
https://elibrary.tucl.edu.np/handle/123456789/18835
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | KANU, AAYUSH SHAH | - |
dc.contributor.author | POKHREL, ADITHYA | - |
dc.contributor.author | BASHYAL, BISHAL | - |
dc.contributor.author | SHARMA, JANAK | - |
dc.date.accessioned | 2023-07-31T05:32:37Z | - |
dc.date.available | 2023-07-31T05:32:37Z | - |
dc.date.issued | 2023-04-30 | - |
dc.identifier.uri | https://elibrary.tucl.edu.np/handle/123456789/18835 | - |
dc.description | This project proposes the use of the LayoutLMv2 model, a deep learning model, for information extraction from form-like documents. Specifically, the IRS 990 tax form was used as the dataset for testing and optimization. | en_US |
dc.description.abstract | This project proposes the use of the LayoutLMv2 model, a deep learning model, for information extraction from form-like documents. Specifically, the IRS 990 tax form was used as the dataset for testing and optimization. The information extraction process from form-like documents can be challenging due to the complex layout analysis and text recognition required to identify fields and corresponding values. The proposed model, LayoutLMv2, has demonstrated its effectiveness in these tasks, making it a promising solution for information extraction from form-like documents. The project resulted in the development of a web application and annotation tools that provide users with a user-friendly interface to upload documents and extract relevant information accurately and efficiently. The annotation tool enables users to label data and train custom models, while the web application streamlines document processing for businesses and organizations. | en_US |
dc.language.iso | en | en_US |
dc.publisher | I.O.E. Pulchowk Campus | en_US |
dc.subject | Transformers, | en_US |
dc.subject | OCR engine, | en_US |
dc.subject | Information Extraction | en_US |
dc.title | INFORMATION EXTRACTION FROM STRUCTURED DOCUMENT | en_US |
dc.type | Report | en_US |
local.institute.title | Institute of Engineering | en_US |
local.academic.level | Bachelor | en_US |
local.affiliatedinstitute.title | Pulchowk Campus | en_US |
Appears in Collections: | Computer Engineering |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Aayush shah kanu et al. be report computer apr 2023.pdf | 3.89 MB | Adobe PDF | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.