Spotify-like Music Streaming and Recommendation Service

This repository details the implementation of a music streaming and recommendation service similar to Spotify, utilizing a variety of technologies and datasets for a complete and dynamic user experience.

Project Overview:

The project leverages:

Free Music Archive (FMA) for a diverse music dataset.
MongoDB for scalable data storage.
Apache Spark for efficient large-scale data processing.
Apache Kafka for real-time music recommendation.

Repository Structure:

└── analysis_for_PCA.py # Script for finding the optimal number of PCA components for normalization.
└── feature_extraction.py # Script for extracting audio features like MFCCs, etc and loading extracted features into MongoDB.
├── preprocessing.py # Script for cleaning up tracks metadata for the website.
├── model.py # Script for training music recommendation model with Spark using MinhashLSH and Approximate Nearest Neighbours.
├── app.py # Flask/Django app for the actual music streaming service/
└── producer.py # Script for streaming the dataset using Kafka.

Setup Instructions

1. Data Acquisition

Download and extract the Free Music Archive (FMA) dataset from here: https://github.com/mdeff/fma

2. Feature Extraction and Storage

Process and store music features by running `feature_extraction.py` to extract necessary audio features.

3. Model Training and Recommendation System

Develop and train the recommendation model: - Use model.py to apply the machine learning algorithms via Apache Spark. - Adjust parameters and algorithms as needed for optimal recommendations.

4. Web Application and Real-Time Recommendations

Deploy the web application and set up real-time recommendation: - Utilize app.py to launch a user-friendly music streaming interface. - Run producer.py to handle live music streaming based on user activity.

Technologies and Challenges:

Used Technologies:

MongoDB: For efficient management of large datasets.
Apache Spark: Utilized for scalable data processing and machine learning.
Apache Kafka: Employs real-time data streaming for dynamic music recommendations.
Python: Primary language for backend and data processing scripts.
Flask/Django: Frameworks for web application development.

Implementation Challenges:

Data Handling and Processing: Managing large volumes of audio data efficiently.
Real-Time Data Streaming: Implementing a robust system with Apache Kafka for live recommendations.
User Interface Development: Creating an engaging and responsive web interface.

Team:

Manal Aamir: GitHub
Mohammad Malik: GitHub
Aqsa Fayaz: GitHub

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
azure		azure
static		static
templates		templates
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
analysis_for_PCA.py		analysis_for_PCA.py
app.py		app.py
feature_extraction.py		feature_extraction.py
fma_small		fma_small
ml_recommendation.py		ml_recommendation.py
preprocessing_tracks_metadata.py		preprocessing_tracks_metadata.py
producer.py		producer.py
query.py		query.py
query_test.py		query_test.py
run.sh		run.sh
run_app.sh		run_app.sh
run_ml.sh		run_ml.sh
run_model.sh		run_model.sh
run_query.sh		run_query.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Spotify-like Music Streaming and Recommendation Service

This repository details the implementation of a music streaming and recommendation service similar to Spotify, utilizing a variety of technologies and datasets for a complete and dynamic user experience.

Project Overview:

The project leverages:

Repository Structure:

Setup Instructions

1. Data Acquisition

2. Feature Extraction and Storage

Process and store music features by running `feature_extraction.py` to extract necessary audio features.

3. Model Training and Recommendation System

4. Web Application and Real-Time Recommendations

Technologies and Challenges:

Used Technologies:

Implementation Challenges:

Team:

About

Releases

Packages

Contributors 3

Languages

License

mohammad-malik/project-spotify

Folders and files

Latest commit

History

Repository files navigation

Spotify-like Music Streaming and Recommendation Service

This repository details the implementation of a music streaming and recommendation service similar to Spotify, utilizing a variety of technologies and datasets for a complete and dynamic user experience.

Project Overview:

The project leverages:

Repository Structure:

Setup Instructions

1. Data Acquisition

2. Feature Extraction and Storage

Process and store music features by running feature_extraction.py to extract necessary audio features.

3. Model Training and Recommendation System

4. Web Application and Real-Time Recommendations

Technologies and Challenges:

Used Technologies:

Implementation Challenges:

Team:

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Process and store music features by running `feature_extraction.py` to extract necessary audio features.

Packages