Kaldi-based Korean ASR (한국어 음성인식) open-source project
-
Updated
Aug 21, 2023 - Shell
Kaldi-based Korean ASR (한국어 음성인식) open-source project
A list of publically available audio data that anyone can download for ASR or other speech activities
Baseline Recipe for VoicePrivacy Challenge 2020: https://www.voiceprivacychallenge.org/vp2020/docs/VoicePrivacy_2020_Eval_Plan_v1_3.pdf
☕🇧🇷 Scripts para o Kaldi em Português Brasileiro
Vosk ASR Docker images with GPU for Jetson boards, PCs, M1 laptops and GPC
scripts to align a given wave to its transcription using trained models by Kaldi
This repository contains my attempt to use two famous speech recognition frameworks (Kaldi, CMU Sphinx4) for Arabic Language using the publicly-available dataset "Arabic Corpus of Isolated Words"
Scripts for training Kaldi for German speech recognition (ASR).
Long audio alignment using Kaldi
Automatic Speech Recognition (ASR) - German
BurrMill core
This repository is the official implementation of unimodal aggregation (UMA) for automaticspeech recognition (ASR).
This is the repository for my version of Kaldi for Dummies example.
A quick & dirty script to generate and view subtitles and transcriptions for your multimedia files using ggerganov/whisper.cpp
End-to-End Arabic ASR using DeepSpeech engine
HHM-based Arabic ASR using Kaldi engine
EC499: Major Project
Kaldi-based audio-visual speech recognition
Add a description, image, and links to the asr topic page so that developers can more easily learn about it.
To associate your repository with the asr topic, visit your repo's landing page and select "manage topics."