Skip to main navigation Skip to search Skip to main content

Breast cancer prediction using data mining method

  • State University of New York Binghamton University

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

40 Scopus citations

Abstract

This paper presents a study about breast cancer prediction based on data mining methods to discover an effective way to predict breast cancer. The objective of this paper is to compare and identify an accurate model to predict the incidence of breast cancer based on various patients' clinical records. Four data mining models are applied in this paper, i.e., support vector machine (SVM), artificial neural network (ANN), Naive Bayes classifier, AdaBoost tree. Furthermore, feature space is highly discussed in this paper due to its high influence on the efficiency and effectiveness of the learning process. To test the influence of feature space reduction, a hybrid between principal component analysis (PCA) and related data mining models is proposed, which applies a principle component analysis method to reduce the feature space. To evaluate the performance of these models, two widely used test data sets are used, Wisconsin Breast Cancer Database (1991) and Wisconsin Diagnostic Breast Cancer (1995). 10-fold cross-validation method is implemented to estimate the test error of each model. The results performed by this analysis demonstrate a comprehensive trade-off between these strategies and also provides a detailed evaluation on the models. It is expected that in real application, physicians and patients can benefit from the feature recognition outcome to prevent breast cancer.

Original languageEnglish
Title of host publicationIIE Annual Conference and Expo 2015
PublisherInstitute of Industrial Engineers
Pages818-828
Number of pages11
ISBN (Electronic)9780983762447
StatePublished - 2015
EventIIE Annual Conference and Expo 2015 - Nashville, United States
Duration: May 30 2015Jun 2 2015

Publication series

NameIIE Annual Conference and Expo 2015

Conference

ConferenceIIE Annual Conference and Expo 2015
Country/TerritoryUnited States
CityNashville
Period05/30/1506/2/15

Keywords

  • Breast cancer prediction
  • Data mining
  • Mold cross-validation

Fingerprint

Dive into the research topics of 'Breast cancer prediction using data mining method'. Together they form a unique fingerprint.

Cite this