Skip to content

hon9kon9ize/speech-dataset-pipeline

Repository files navigation

Speech Dataset Pipeline - WIP

  • Step 0: Download audio files from RTHK
  • Step 1: Split audio files into smaller chunks
  • Step 2: Source separation
  • Step 4: Language detection
  • Step 3: Voice enhancement

Prerequisites

pip install -r requirements.txt

Usage

# Download audio file and convert to 16kHz, at this stage, it would create a folder `audios` for original audio files and `audios_16k` for 16kHz audio files
python step-0.py

# Split audio files into smaller chunks and speaker diarization
python step-1.py --audio_root_path audios_16k

# Source separation, remove background music
python step-1.py --audio_root_path chunks

TODO...

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published