EFFECT OF FEATURE SELECTION AND DATASET SIZE ON THE ACCURACY OF NAÏVE BAYESIAN CLASSIFIER AND LOGISTICS REGRESSION

dc.contributor.authorANAKOBE, Muhammad Bashir
dc.date.accessioned2019-06-18T13:13:16Z
dc.date.available2019-06-18T13:13:16Z
dc.date.issued2018-06
dc.descriptionA DISSERTATION SUBMITTED TO THE SCHOOL OF POSTGRADUATE STUDIES, AHMADU BELLO UNIVERSITY, ZARIA IN PARTIAL FULFILLMENT OF THE REQUIREMENTS FOR THE AWARD OF MASTER DEGREE Department of Statistics, Faculty of Physical Sciences, Ahmadu Bello University, Zariaen_US
dc.description.abstractBinary Logistics Regression and Naïve Bayesian classifier are two of the common classification modelling techniques that allow one to predict the category that a new observation belongs, on the basis of a training set of data containing observations (or instances) whose category membership is known. We studied the classification performances of the two linear classification under different feature (variable) selection criteria and dataset size conditions on a medical domain area were studied based on the datasets (breast cancer and heart diseases) obtained from the University of California, Irvine, online respiratory. The result indicated that logistics Regression for classification on relatively large datasets without the application of PCA (for variable selection) has the great accuracy (91.4%), while Naïve Bayesian classifier with PCA (for variable/ feature selection) tops the smaller dataset classification with an accuracy of 90.2%. These two accuracies are close enough and high enough, which is an indication of high relevance of their selections in solving classification problems on datasets from this kind of domain.en_US
dc.identifier.urihttp://hdl.handle.net/123456789/11761
dc.language.isoenen_US
dc.subjectEFFECT,en_US
dc.subjectFEATURE SELECTION,en_US
dc.subjectDATASET SIZE,en_US
dc.subjectACCURACY,en_US
dc.subjectNAÏVE BAYESIAN CLASSIFIER,en_US
dc.subjectLOGISTICS REGRESSIONen_US
dc.titleEFFECT OF FEATURE SELECTION AND DATASET SIZE ON THE ACCURACY OF NAÏVE BAYESIAN CLASSIFIER AND LOGISTICS REGRESSIONen_US
dc.typeThesisen_US
Files
Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
EFFECT OF FEATURE SELECTION AND DATASET SIZE ON THE ACCURACY OF NAÏVE .pdf
Size:
1.42 MB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.62 KB
Format:
Item-specific license agreed upon to submission
Description:
Collections