Please use this identifier to cite or link to this item: https://elibrary.tucl.edu.np/handle/123456789/18796
Title: LIVE CAMERA FEED SCENE DESCRIPTOR FOR VISUALLY IMPAIRED
Authors: SUBEDI, AADITYA MANI et al
Keywords: Video Captioning,;Generative Image-To-Text Transformer,;processor,;Vatex dataset,
Issue Date: 30-Apr-2023
Publisher: I.O.E. Pulchowk Campus
Institute Name: Institute of Engineering
Level: Bachelor
Abstract: Every visually impaired people wants to interact with their nature, surrounding and people. They want to feel the event happening on the nature but are naturally deprived. Our product aims at assisting visually impaired individuals in navigating their way around Pulchowk Campus and describing the actions happening inside the campus. The product utilizes a live camera feed and visual transformer techniques to generate a descriptive caption and its audio output, providing the user with a proper and timely description of their surroundings. The product is designed to work on some landmark of the campus and wide range of activities. We suggest a model that is fine tuned on the pre-trained Git-base-Vatex model in our campus video datasets to describe the surrounding scene.
Description: Every visually impaired people wants to interact with their nature, surrounding and people. They want to feel the event happening on the nature but are naturally deprived.
URI: https://elibrary.tucl.edu.np/handle/123456789/18796
Appears in Collections:Computer Engineering

Files in This Item:
File Description SizeFormat 
Aaditya mani subedi et al be project report electronics and computer apr 2023.pdf4.04 MBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.