Speaker Diarization using OpenAI Whisper and Pyannote

Table of content

Introduction
Prerequisties
Docker Setup
Usage
Screenshots

Introduction

Speaker Diarization pipeline based on OpenAI Whispe and Pyannote.

Prerequisties

Docker==20.10.7
Nvidia-Docker

Docker Setup

I prefer using the Docker because of simplicity. You just need to run the gpu-enabled docker container and everything is setup for you

git clone https://github.com/leviethung2103/whisper_speaker_diarization
cd whisper_speaker_diarization
docker run --gpus all -d -it -p 8848:8888 -v $(pwd):/home/jovyan/work -e GRANT_SUDO=yes -e JUPYTER_ENABLE_LAB=yes --user root cschranz/gpu-jupyter:v1.4_cuda-11.6_ubuntu-20.04

Usage

Access the jupyter lab via http://localhost:8848
Start with jupyter notebook 01_Speaker_Diarizateion.ipynb.
Default password is gpu-jupyter

Screenshots

Here is the output after running modules.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
assets		assets
.gitignore		.gitignore
01_Speaker_Diarization.ipynb		01_Speaker_Diarization.ipynb
README.md		README.md
audio.mp3		audio.mp3
audio.wav		audio.wav
main.py		main.py
requirements.txt		requirements.txt
sample1.wav		sample1.wav
sample2.wav		sample2.wav

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Speaker Diarization using OpenAI Whisper and Pyannote

Table of content

Introduction

Prerequisties

Docker Setup

Usage

Screenshots

About

Releases

Packages

Languages

leviethung2103/whisper_speaker_diarization

Folders and files

Latest commit

History

Repository files navigation

Speaker Diarization using OpenAI Whisper and Pyannote

Table of content

Introduction

Prerequisties

Docker Setup

Usage

Screenshots

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages