The dataset used in this project is a modified version of this kaggle dataset.
Based on existing players we might want to establish what overall rating a new player would have without knowing the empirical formula. I used the K Nearest Neighbours algorithm to do that, which has turned out to be pretty accurate (~.95 accuracy).