DSpace Collection:
https://elibrary.tucl.edu.np/handle/123456789/17054
2024-03-28T19:30:18ZGESTURE SYNTHESIS USING MULTIMODAL SUPERVISED LEARNING
https://elibrary.tucl.edu.np/handle/123456789/18866
Title: GESTURE SYNTHESIS USING MULTIMODAL SUPERVISED LEARNING
Authors: BHANDARI, PRASUN; ROUNIYAR, RAHUL; SHRESTHA, RONAB; GURAU, SUNIL
Abstract: One of the long-standing ambitions of the modern science and engineering has been to create
a non-human entity that manifests human-like intelligence and behavior. One step to
achieving the goal is executing a communication just like the humans do. Human speech
is often accompanied by a variety of gestures which add rich non-verbal information to the
message the speaker is trying to convey. Gestures add clarity to the intention and emotions
of the speaker and enhance the speech by adding visual cues alongside audio signal. Our
project aims to synthesize co-speech gestures by learning from individual speaker’s style. We
follow a data-driven approach instead of rule-based approach as the audio-gesture relation
is poorly captured by a rule-based system due to issues like asynchrony and multi-modality.
As is the current trend, we train the modal from in-the-wild videos embedded with audio
instead of relying on the motion capture of subjects in lab for video annotation. For establishing
the ground truth for the data set of video frames, we rely on an automatic pose
detection system. Although the ground truth signal tends to be not as accurate as manually
annotated frames, the approach relieves us of time and labor expense. We perform the crossmodal
translation from monologue speech of a single speaker to their hand and arm motion
based on the learning of temporal correlation between the sequence of pose and audio sample.
Description: One of the long-standing ambitions of the modern science and engineering has been to create
a non-human entity that manifests human-like intelligence and behavior. One step to
achieving the goal is executing a communication just like the humans do.2023-05-01T00:00:00ZTRAFFIC VIOLATION DETECTION WITH COMPUTER VISION
https://elibrary.tucl.edu.np/handle/123456789/18865
Title: TRAFFIC VIOLATION DETECTION WITH COMPUTER VISION
Authors: DAHAL, NEHA; G.C., SIMON; MAINALI, SUBASH; SUBEDI, UDAYA RAJ
Abstract: In this project we used YOLOv5s that was trained on custom dataset collected by us which
consisted of 2193 images of 6 classes which was augmented to extend our dataset to 5259
images and was split in the ratio of 70:20:10 for train, validation, and test respectively. For
tracking the detected objects in the video, we used DeepSORT which tracks and outputs
the bounding box for the object with respective track IDs. Then if the detected and tracked
object have violated traffic lights the corresponding license plate of the object in question
is sent as input for segmentation program. The image of the license plate undergoes HSV
color space conversion, color masking and perspective transformed in that order before it
is preprocessed for profiling the different types of license plate in the dataset. The image
undergoes horizontal projection profiling and vertical projection profiling which is then validated
to separate the characters of the license plate.
Description: In this project we used YOLOv5s that was trained on custom dataset collected by us which
consisted of 2193 images of 6 classes which was augmented to extend our dataset to 5259
images and was split in the ratio of 70:20:10 for train, validation, and test respectively.2023-04-01T00:00:00ZSMART TRIAL SYSTEM
https://elibrary.tucl.edu.np/handle/123456789/18864
Title: SMART TRIAL SYSTEM
Authors: SHAKYA, AYUSH; ATAL, DEWASHISH; KARN, PRASHANNA; PANDIT, PRASHANNA RAJ
Abstract: Driving is a common and important mode of transportation in the daily lives of people. It
allows people to move freely and independently from one place to another. To drive legally
in Nepal, as in most countries, a person requires a driving license, for which he/she has to
undergo a series of tests in the government office. In Nepal, a candidate applying for a license
has to drive over a fixed path in front of the authorities. The candidate has to complete
the test in accordance with specific rules and if he fails to do so he/she will be disqualified.
Manually monitoring the errors of the candidates is done by these authorities. Also, the
issue of bribery for obtaining a driver’s license is a significant problem in Nepal. To address
this issue, we have proposed an AI-based solution named CarSight. CarSight is a Computer
Vision model designed to analyze visual data from cameras that verifies whether a driver
has passed or failed the driving test. It uses YOLO-v5 to detect and track the motion of the
trial vehicles in a set of real-time videos obtained from cameras placed around the model
track. The system gives an instant pass or fails result depending on the driver’s performance
once they finish the trial exam. The accuracy of detecting and evaluating the driver’s performance
was successfully demonstrated by our experimental results for our system. Our
proposed system can help to bring fairness in the process of obtaining a driver’s license.
Description: Driving is a common and important mode of transportation in the daily lives of people. It
allows people to move freely and independently from one place to another. To drive legally
in Nepal, as in most countries, a person requires a driving license, for which he/she has to
undergo a series of tests in the government office.2023-03-01T00:00:00ZGUIDANCE, NAVIGATION AND CONTROL OF A VTOL VEHICLE TO MAKE IT FOLLOW A PREDETERMINED TRAJECTORY
https://elibrary.tucl.edu.np/handle/123456789/18848
Title: GUIDANCE, NAVIGATION AND CONTROL OF A VTOL VEHICLE TO MAKE IT FOLLOW A PREDETERMINED TRAJECTORY
Authors: PATHAK, SAKAR; ACHARYA, SAMUNDRA; SILWAL, SHREEJAN SINGH; DHAKAL, SWAYAM
Abstract: In interplanetary missions, the landing of space vehicles is typically accomplished using
parachutes. However, this simple method is not without its challenges, as these vehicles
are prone to parachute drifts that are difficult to predict, especially on planets with dense
atmospheres like Earth. As a result, significant attention has recently been given to the
development of active control systems for space vehicles, allowing for precise guidance, navigation,
and control over predetermined trajectories and enabling soft and accurate landings
on planetary surfaces.
The ability to follow a predetermined path and land softly and precisely using real-time
onboard control algorithms would greatly enhance the capabilities of vehicles for interplanetary
travel, while also increasing the re-usability of space vehicles. This not only benefits
interplanetary travel but also improves space payload delivery systems by reducing costs and
increasing efficiency.
To this end, this project aims to implement control algorithms on an Electric Ducted
Fan (EDF) powered model of a Vertical Take Off and Landing (VTOL) vehicle, enabling
it to follow a fixed trajectory. A small CanSat payload will be attached to the vehicle and
deployed at a specific altitude, simulating the tasks required of a full-scale vehicle.
By utilizing these advanced control systems, space vehicles can navigate more accurately
and efficiently, reducing the risks and costs associated with interplanetary travel. With a
focus on trajectory control and precision landing, this project aims to contribute to the ongoing
efforts to enhance space exploration and technology development.
Description: In interplanetary missions, the landing of space vehicles is typically accomplished using
parachutes. However, this simple method is not without its challenges, as these vehicles
are prone to parachute drifts that are difficult to predict, especially on planets with dense
atmospheres like Earth.2023-04-30T00:00:00Z