STAT3007_Project

Teaching a robot to feel.

Instructions for runing training and testing files

Before running the notebooks with _training_testing.ipynb or _tuning.ipynb in the name, you will first have to create a shortcut to the google drive containing the noisy data. The link to this google drive is: https://drive.google.com/drive/folders/1n9xwoN4oa4teVaBLyc5bvzuJZ70zhhQk?usp=sharing

file description

model_training/_Train_Test.ipynb - a pipeline for training models
model_tuning/_Tuning.ipynb - hyperparameter for models

Audio sample names

Filename layout:emotion-intensity-statement-repetition-actor.wav

Fearful (06) Emotion (01 = neutral, 02 = calm, 03 = happy, 04 = sad, 05 = angry, 06 = fearful, 07 = disgust, 08 = surprised)
Normal intensity (01) Emotional intensity (01 = normal, 02 = strong). NOTE: There is no strong intensity for the 'neutral' emotion.
Statement "dogs" (02) Statement (01 = "Kids are talking by the door", 02 = "Dogs are sitting by the door").
1st Repetition (01 )Repetition (01 = 1st repetition, 02 = 2nd repetition).
12th Actor (12) Actor (01 to 24. Odd numbered actors are male, even numbered actors are female).

Filename example: 06-01-02-01-12.mp4

Pre-process steps:

load audio with downsampled sampling rate 16000Hz
truncate radio slience before and after each audio clip with a fixed, hand-crafted threshold
normalise amplitude waveform with 0 mean and unit variance
pick a certain initial duration of the audio sample (if shorter than the sampling duration, pre-pad the sample with zeros)
compute mel-spectrogram (amplitude -> power spectrum -> log-spectrogram -> mel-scaling)

Train/Test split:

CNN models:

split amongst authors (16/8)
induce noises

Autoencoder Training:

Strafified sampling (see report)

Example noisy data subset:

79 noises for the following:
5 emotions- strong intensity -dog statement - 1 rep - 1 actor
The 5 emotions: calm (02), happy(03), sad(04), angry(05), suprised(08)

Architecture included in our report:

CNN
CNN + LSTM
CNN + RGB
RGB + CNN + LSTM
Autoencoder + CNN

Integrating Colab with Github

The following link shows all the available .ipynb files from our repo that can be opened by colab: https://colab.research.google.com/github/JordanFoss/STAT3007_Project

More details can be found in colab-github-demo.ipynb

Presentation Slides

https://docs.google.com/presentation/d/1QSJ8ocBKJbPoVOcoXiKNEMI355Wn7ljikF_xhB7_KaM/edit#slide=id.p

Name		Name	Last commit message	Last commit date
Latest commit History 217 Commits
.ipynb_checkpoints		.ipynb_checkpoints
Audio_Speech_Actors_01-24		Audio_Speech_Actors_01-24
Noisy_generation_files		Noisy_generation_files
__pycache__		__pycache__
codes_to_hand_in		codes_to_hand_in
misc		misc
model_training		model_training
model_tuning		model_tuning
noisy_generation_code		noisy_generation_code
research_papers		research_papers
sample-noisy-speech-actor-11		sample-noisy-speech-actor-11
truncated_samples		truncated_samples
Final_Report-EmotionClassificationOnAudio.pdf		Final_Report-EmotionClassificationOnAudio.pdf
Model_Functions.py		Model_Functions.py
Models.py		Models.py
README.md		README.md
codes_to_hand_in.zip		codes_to_hand_in.zip
data_loading.py		data_loading.py
jet.mat		jet.mat
pre_process.py		pre_process.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

STAT3007_Project

Instructions for runing training and testing files

file description

Audio sample names

Pre-process steps:

Train/Test split:

CNN models:

Autoencoder Training:

Example noisy data subset:

Architecture included in our report:

Integrating Colab with Github

Presentation Slides

About

Releases

Packages

Contributors 3

Languages

JordanFoss/STAT3007_Project

Folders and files

Latest commit

History

Repository files navigation

STAT3007_Project

Instructions for runing training and testing files

file description

Audio sample names

Pre-process steps:

Train/Test split:

CNN models:

Autoencoder Training:

Example noisy data subset:

Architecture included in our report:

Integrating Colab with Github

Presentation Slides

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages