Please use this identifier to cite or link to this item:
https://elibrary.tucl.edu.np/handle/123456789/18796
Title: | LIVE CAMERA FEED SCENE DESCRIPTOR FOR VISUALLY IMPAIRED |
Authors: | SUBEDI, AADITYA MANI et al |
Keywords: | Video Captioning,;Generative Image-To-Text Transformer,;processor,;Vatex dataset, |
Issue Date: | 30-Apr-2023 |
Publisher: | I.O.E. Pulchowk Campus |
Institute Name: | Institute of Engineering |
Level: | Bachelor |
Abstract: | Every visually impaired people wants to interact with their nature, surrounding and people. They want to feel the event happening on the nature but are naturally deprived. Our product aims at assisting visually impaired individuals in navigating their way around Pulchowk Campus and describing the actions happening inside the campus. The product utilizes a live camera feed and visual transformer techniques to generate a descriptive caption and its audio output, providing the user with a proper and timely description of their surroundings. The product is designed to work on some landmark of the campus and wide range of activities. We suggest a model that is fine tuned on the pre-trained Git-base-Vatex model in our campus video datasets to describe the surrounding scene. |
Description: | Every visually impaired people wants to interact with their nature, surrounding and people. They want to feel the event happening on the nature but are naturally deprived. |
URI: | https://elibrary.tucl.edu.np/handle/123456789/18796 |
Appears in Collections: | Computer Engineering |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Aaditya mani subedi et al be project report electronics and computer apr 2023.pdf | 4.04 MB | Adobe PDF | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.