Comparing Vision Transformers and CNNs for Accurate Retinal Disease Classification

dc.contributor.advisorAsst. Prof. Sarbin Sayami
dc.contributor.authorPaudyal, Binod
dc.date.accessioned2025-05-08T07:45:17Z
dc.date.available2025-05-08T07:45:17Z
dc.date.issued2025
dc.description.abstractRetinal diseases, such as Age-Related Macular Degeneration (AMD) and Diabetic Macular Edema (DME) significanty contribute to vision impairment in global scale. An early diagnosis and timely treatment can save a lot of people form blindness. This research focuses on leveraging Optical Coherence Tomography (OCT) images for the classification of retinal diseases using advanced deep learning models. Specifically, we explore the capabilities of Vision Transformers (ViTs), Convolutional Neural Networks (CNNs), and a proposed Hybrid CNN-Transformer model (HybridCNNViT). The HybridCNNViT model was developed by combining the local feature extraction strengths of CNNs with the global context modeling capabilities of Transformers. Comparative evaluations of accuracy, precision, and computational efficiency revealed that HybridCNNViT outperforms standalone ViTs and CNNs for retinal disease classification. As it offers a promising approach to improve healthcare outcomes in ophthalmology, can be further improved and used in applications of automated retinal disease detection and clinical diagnostics.
dc.identifier.urihttps://hdl.handle.net/20.500.14540/24984
dc.language.isoen_US
dc.subjectVision Transformers
dc.subjectConvolutional Neural Networks
dc.subjectHybridCNNViT
dc.subjectRetinal Disease Classification
dc.subjectOptical Coherence Tomography
dc.subjectDeep Learning in Healthcare
dc.subjectMedical Image Analysis
dc.titleComparing Vision Transformers and CNNs for Accurate Retinal Disease Classification
dc.typeThesis
local.academic.levelMasters
local.institute.titleInstitute of Science & Technology

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
11-25 Binod_Paudyal_Thesisv1.pdf
Size:
1.87 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: