Show simple item record

Classification of Breast Cancer Using Logistic Regression

dc.contributor.authorUde, Anthony Anene
dc.date.accessioned2019-08-08T13:40:58Z
dc.date.available2019-08-08T13:40:58Z
dc.date.issued2019-06-16
dc.identifier.urihttp://repository.aust.edu.ng/xmlui/handle/123456789/4907
dc.description.abstractBreast cancer is a prevalent disease that affects mostly women, an early diagnosis will expedite the treatment of this ailment. In recent times, Machine Learning (ML) techniques have been employed in biomedical and informatics to help fight breast cancer. This research work proposed an ML model for the classification of breast cancer. To achieve this we employed logistic regression (LR) and also compared our model’s performance with other extant ML models namely, Support Vector Machine (SVM), Naïve Bayes (NB), and Multilayer Perceptron (MLP). The original Wisconsin Diagnostic Breast Cancer dataset (WDBC) was used. Our performance evaluation was done for two phases, i.e. Phase 1: when the WBCD is scaled (feature scaling) and Phase 2: when the dataset is not scaled. All models excluding MLP performed well when there is no feature scaling of dataset with f1-scores of (LR=97%, SVM = 97%, NB = 95%, MLP= 52%). However, when feature scaling is applied on dataset, the four models have f1-scores above 90% (SVM = 98%, LR = 97%, NB = 97%, MLP = 97%). Notably, the f1-score for LR in both cases did not change, hence to the best of our knowledge, we concluded that LR, given its simplicity and low time complexity is a good model to employ for binomial classification .en_US
dc.description.sponsorshipAUST and AfDB.en_US
dc.language.isoenen_US
dc.subjectUde Anthony Aneneen_US
dc.subject2019 Computer Science and Engineering Thesesen_US
dc.subjectDr. Rajesh Prasaden_US
dc.subjectLogistic regressionen_US
dc.subjectmachine learningen_US
dc.subjectsupervised learningen_US
dc.subjectfeatures scalingen_US
dc.subjectprediction modelsen_US
dc.subjectperformance metricsen_US
dc.titleClassification of Breast Cancer Using Logistic Regressionen_US
dc.typeThesisen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

  • Computer Science105

    This collection contains Computer Science Student's Theses from 2009-2022

Show simple item record