According to international agency for research on cancer, female breast cancer was the leading type of cancer worldwide in terms of the number of new cases (approximately 2.1 million) diagnosed in 2018.
Predicting outcome of a disease is a challenging task. Data mining techniques tends to simplify the prediction segment. Automated tools have made it possible to collect large volumes of medical data, which are made available to the medical research groups. This study aimed to apply machine learning algorithms using decision three classifier and descriptive statistics to evaluate the performance of the model in predicting the probability of cancer metastasis in patients that present late.
Materials and method: The breast cancer disease dataset has been taken from the department of Radiotherapy and Oncology of Usmanu Danfodiyo University Teaching Hospital, Sokoto state, Nigerian. Dataset has 259 instances and 10 attributes. The experimental results of this study used, decision three classifier in IMB SPSS (version 23) software environment. In the experiment, two classes were used and therefore a 2 × 2 confusion matrix was applied. Class 0=Not Metastasized, Class 1=Metastasized. We applied supervised machine learning approach in which dataset were divided into two classes that is training and testing using 10 fold cross validation.
Results: Shows that 259 instance of breast cancer, 218(84.2%) cases were not metastasized while 41(15.8%) cases were metastasized to the other region of the body. The overall accuracy of the model was found to be 87%, with the sensitivity of 88%, specificity 75% and the precision of 98%
Conclusion: Based on these findings, the machine learning algorism using decision three classifiers predicted that 87% of the tumor presented at stage IV, indicating that the tumour can spread to the other region of the body.
Published Date: 2020-03-05; Received Date: 2020-02-13