The dataset has following nine features and takes integer value from 1 to 10. clump thickness, uniformity of cell size, uniformity of cell shape, marginal adhesion, single epit helical cell size, bare nucleoi, bland chromatin, normal nuclei, mitoses
Feature selection using F-score, then use logistic regression, Neural network and K-NN for prediction
Data cleaning, visualize data statistics Used logistic regression,and K-NN classifiers