Please use this identifier to cite or link to this item: https://elibrary.tucl.edu.np/handle/123456789/10243
Title: Compaative Analysis of Random Forest And Logistic Regression For Diagnosis Of Diabetes Mellitus
Authors: Pandey, Madhu
Keywords: Data Mining;Logistic Regression;Random Forest
Issue Date: Jun-2019
Publisher: Department of Computer Science and Information Technology
Institute Name: Central Department of Computer Science and Information Technology
Level: Masters
Abstract: In our daily life there is lots of data in different field. Whenever there is data we can have lots of information, patterns, meaning etc. and the process of Extracting or “mining” knowledge from large amount of data is called Data mining and is also known as “Knowledge discovery from data (KDD)”. Data mining applications has got rich focus due to its significance of classification algorithms. Diabetes Mellitus (DM) is a result of bad metabolism. DM, if not controlled, causes several complications and even affects other parts of the body. This study aims to survey on the two different classifiers with data set of patients regarding Diabetes Mellitus and to implement as well as assist by comparing Random Forest and Logistic Regression classification techniques to standardize the diagnosis and treatment of Diabetes Mellitus. From the context analysis it was seen that Logistic Regression was able to classify 81.17% of the data correctly which was better than Random Forest in comparison to results of evaluation metrics (Accuracy, Precision, Recall and F-Measure). In a nut shell, the experiment result showed that Logistic Regression had got 2% better accuracy than Random Forest for the diagnosis of diabetes mellitus.
URI: https://elibrary.tucl.edu.np/handle/123456789/10243
Appears in Collections:Computer Science & Information Technology

Files in This Item:
File Description SizeFormat 
All Thesis.pdf1.54 MBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.