MammogramDiagnosis

Used the "mammographic masses" public dataset from the UCI repository (source: https://archive.ics.uci.edu/ml/datasets/Mammographic+Mass)

This data contains 961 instances of masses detected in mammograms, and contains the following attributes:

BI-RADS assessment: 1 to 5 (ordinal)
Age: patient's age in years (integer)
Shape: mass shape: round=1 oval=2 lobular=3 irregular=4 (nominal)
Margin: mass margin: circumscribed=1 microlobulated=2 obscured=3 ill-defined=4 spiculated=5 (nominal)
Density: mass density high=1 iso=2 low=3 fat-containing=4 (ordinal)
Severity: benign=0 or malignant=1 (binominal)

Applied several different supervised machine learning techniques to this data set, and see which one yields the highest accuracy as measured with K-Fold cross validation (K=10).

Decision tree
Random forest
KNN
Naive Bayes
SVM
Logistic Regression

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
README.md		README.md
mammo_masses_project.ipynb		mammo_masses_project.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MammogramDiagnosis

About

Releases

Packages

Languages

shanthoshp/MammogramDiagnosis

Folders and files

Latest commit

History

Repository files navigation

MammogramDiagnosis

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages