Emotion-to-MBTI Prediction: Fusion Model and App

This repository contains two main components:

Model Training: Scripts for training the fusion model (Audio CNN + FNN).
Web Application: A Next.js frontend and Flask backend to interact with the trained model.

Getting Started

Prerequisites

We use python 3.10: brew install python@3.10

Python Environments

You need two separate Python virtual environments:

Training Environment: For training models.
Backend Environment: For running the Flask backend. (make sure you are in the backend directory when creating)

Create the environments as follows:

Training Environment:

python3.10 -m venv training_env
source training_env/bin/activate  # On Windows: training_env\Scripts\activate
pip install -r requirements.txt
deactivate

Backend Environment:

cd app/backend
python3.10 -m venv backend_env
source backend_env/bin/activate  # On Windows: backend_env\Scripts\activate
pip install -r requirements.txt
deactivate

Frontend Setup:

cd app/frontend
npm install

Part 1: Fusion Model Creation & Training

Step 1: Audio Convolutional Neural Network (CNN)

Feature Extraction: Extract audio features from the RAVDESS emotion audio dataset (takes a ~10-15 minutes to complete)

source training_env/bin/activate  # Activate the training environment
python cnn/ravdess_feat_extraction.py

Step 2: Fusion Feed-Forward Neural Network (FNN)

Feature Extraction: Extract features from the CREMA-D dataset (10-15 minutes to complete)

python fnn/01-cremad_feat_extraction.py

Train the FNN (~10-20 minutes to complete):

python fnn/02-cremad_FNN.py
deactivate

Part 2: Web Application

Backend Setup (Flask)

cd app/backend
source backend_env/bin/activate  # Activate the backend environment
flask run --port=5000  # Start the Flask backend server

Frontend Setup (Next.js)

cd ../frontend
npm run dev  # Start the Next.js frontend server

Access the Application

Open your browser and go to:

http://localhost:3000/

Usage Instructions

You will be automatically redirected to http://localhost:3000/AudioRecorder within 20 seconds.
Press the "Record" button and talk about your day for 20 seconds.
After recording, you will be redirected to the MBTI Prediction Page in approximately 30 seconds.

High Level File Structure

.
├── app/
│   ├── frontend/             # Next.js frontend
│   ├── backend/              # Flask backend
│   └── requirements.txt      # Python dependencies for backend
├── cnn/                      # CNN-related scripts
│   └── ravdess_CNN.py
│   └── ravdess_feat_extraction.py
├── fnn/                      # FNN-related scripts
│   ├── 01-cremad_feat_extraction.py
│   └── 02-cremad_FNN.py
└── requirements.txt          # Python dependencies for training

The cnn/ravdess_CNN.py file is the architecture of the CNN and is trained using extracted audio features from the RAVDESS dataset

Notes

Ensure you activate the correct virtual environment before running scripts or servers.
The training environment is for running CNN and FNN scripts.
The backend environment is for running the Flask backend.
Both the Flask backend and Next.js frontend must run simultaneously for the application to work.

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
app		app
cnn		cnn
fnn		fnn
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Emotion-to-MBTI Prediction: Fusion Model and App

Getting Started

Prerequisites

Python Environments

Training Environment:

Backend Environment:

Frontend Setup:

Part 1: Fusion Model Creation & Training

Step 1: Audio Convolutional Neural Network (CNN)

Feature Extraction: Extract audio features from the RAVDESS emotion audio dataset (takes a ~10-15 minutes to complete)

Step 2: Fusion Feed-Forward Neural Network (FNN)

Feature Extraction: Extract features from the CREMA-D dataset (10-15 minutes to complete)

Train the FNN (~10-20 minutes to complete):

Part 2: Web Application

Backend Setup (Flask)

Frontend Setup (Next.js)

Access the Application

Usage Instructions

High Level File Structure

Notes

About

Releases

Packages

Languages

arath7/mbti-fusion-model

Folders and files

Latest commit

History

Repository files navigation

Emotion-to-MBTI Prediction: Fusion Model and App

Getting Started

Prerequisites

Python Environments

Training Environment:

Backend Environment:

Frontend Setup:

Part 1: Fusion Model Creation & Training

Step 1: Audio Convolutional Neural Network (CNN)

Feature Extraction: Extract audio features from the RAVDESS emotion audio dataset (takes a ~10-15 minutes to complete)

Step 2: Fusion Feed-Forward Neural Network (FNN)

Feature Extraction: Extract features from the CREMA-D dataset (10-15 minutes to complete)

Train the FNN (~10-20 minutes to complete):

Part 2: Web Application

Backend Setup (Flask)

Frontend Setup (Next.js)

Access the Application

Usage Instructions

High Level File Structure

Notes

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages