This paper is published in Volume-6, Issue-5, 2020
Area
Computer Science
Author
Sanjay Kumar A., Sarojini Sharon R. K.
Org/Univ
St. Joseph's Institute of Technology, Chennai, Tamil Nadu, India
Keywords
Mammographic Mass, Hyperparameter, Machine Learning, Supervised Machine Learning
Citations
IEEE
Sanjay Kumar A., Sarojini Sharon R. K.. Prediction of a mammogram mass as benign or malignant, International Journal of Advance Research, Ideas and Innovations in Technology, www.IJARIIT.com.
APA
Sanjay Kumar A., Sarojini Sharon R. K. (2020). Prediction of a mammogram mass as benign or malignant. International Journal of Advance Research, Ideas and Innovations in Technology, 6(5) www.IJARIIT.com.
MLA
Sanjay Kumar A., Sarojini Sharon R. K.. "Prediction of a mammogram mass as benign or malignant." International Journal of Advance Research, Ideas and Innovations in Technology 6.5 (2020). www.IJARIIT.com.
Sanjay Kumar A., Sarojini Sharon R. K.. Prediction of a mammogram mass as benign or malignant, International Journal of Advance Research, Ideas and Innovations in Technology, www.IJARIIT.com.
APA
Sanjay Kumar A., Sarojini Sharon R. K. (2020). Prediction of a mammogram mass as benign or malignant. International Journal of Advance Research, Ideas and Innovations in Technology, 6(5) www.IJARIIT.com.
MLA
Sanjay Kumar A., Sarojini Sharon R. K.. "Prediction of a mammogram mass as benign or malignant." International Journal of Advance Research, Ideas and Innovations in Technology 6.5 (2020). www.IJARIIT.com.
Abstract
This paper is about comparing several different supervised machine learning techniques to the “mammographic masses” public data set from the UCI repository, and see which one yields the highest accuracy as measured with K-Fold cross validation (K=10 A lot of needless pain and surgery occurs from the subsequent false positives of mammogram performance. If we can create a better way of understanding them through supervised machine learning, it will make many lives easier .We have applied Decision tree, Random forest, KNN, Naive Bayes, SVM, Logistic Regression and also neural network using Keras. The data was cleaned as many rows were contain missing data, and as there were erroneous data identifiable as outliers as well. Some techniques such as SVM that we have employed requires the input data to be normalized first. Many techniques also had "hyperparameters" that had to be tuned. So here we have approached a better way to make it even better by tuning its hyperparameters.