Music Festival Hits Prediction

This study explores the Music Features to identify likely hit songs which will hit the top charts, and takes the analysis further to predict songs more probable to be performed at Music Festivals. We use the Spotify data as well as Billboard data and predict hits using a unique approach of genre similarity using euclidean distance. The problem of predicting the popularity values is addressed as a classification problem and various models are tested to obtain the best results. The training has been carried out on a corpus of 27K+ songs. To enhance these results, the Music Festival data has been collected and filtered to include data from just a single music festival named ’Glastenbury’. Nearly 18 genres were listed each year for this festival and Time Series analysis was performed. The RMSE value obtained for the ARIMA model built was close to 0.13 after normalisation of the data which was followed by a genre prediction to obtain an F1 score of 0.9627 with a Decision Tree Classifier. The files used in this study have been collected and organized in this repository.

Data

The data files experimented with and the links to all the data dets used in this study is present under this directory.

EDA & Pre-Processing

Exploratory Data Anlaysis performed on the data as well as the pre-processing techniques carried out have been presented here.

Models

Further analysis of the data along with the performance of all the models experimented with are present in this directory.

Execution Steps

The cells of the Jupyter Notebook file needs to be run in order to produce results as shown in the output of each file. The path to to the data folder also needs to be specified/changed depending on where the data sets are placed. There are certain dataset dependencies due to generation of new data sets from the existing ones in some of the notebook files. However, the generated datasets have also been placed with their respective notebook files.

Name		Name	Last commit message	Last commit date
Latest commit History 139 Commits
Data		Data
EDA & Pre-Processing		EDA & Pre-Processing
Models		Models
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Music Festival Hits Prediction

Data

EDA & Pre-Processing

Models

Execution Steps

About

Releases

Packages

Contributors 3

Languages

manahshetty/DataAnalytics2020

Folders and files

Latest commit

History

Repository files navigation

Music Festival Hits Prediction

Data

EDA & Pre-Processing

Models

Execution Steps

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages