Highlights
- Pro
Stars
Pretraining, fine-tuning and evaluation scripts for Indic-Wav2Vec2
Metadata and versioning details for the Common Voice dataset
Web-based environment for live coding algorithmic patterns, incorporating a faithful port of TidalCycles to JavaScript
Geometric loss functions between point clouds, images and volumes
An official reimplementation of the method described in the INTERSPEECH 2021 paper - Speech Resynthesis from Discrete Disentangled Self-Supervised Representations.
Keyword spotting and forced alignment in any language
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
Open source speech to text models for Indic Languages
Zerospeech Challenge 2021: validation and evaluation software
A PyTorch Implementation of End-to-End Models for Speech-to-Text
This is a list of speech tasks and datasets, which can provide training data for Generative AI, AIGC, AI model training, intelligent speech tool development, and speech applications.
Speech Recognition using DeepSpeech2.
Self-Supervised Speech Pre-training and Representation Learning Toolkit
Perform analyses on results files obtained with the ABXpy library in a phone discrimination task BY (i.e. conditioned on) speaker and preceding and following contexts. (PHOne BY Speaker CONtext -> …
A non-native English corpus for pronunciation scoring task
A pure python module for reading and writing kaldi ark files
Unsupervised acoustic word embeddings evaluated on Buckeye English and NCHLT Xitsonga data in Python 3.
A Python toolbox for speech features extraction
Extension to Kaldi implementing the standard i-vector hyperparameter estimation and i-vector extraction procedure
Correspondence and autoencoder neural network training for speech using Pylearn2.
Robust Speech Recognition via Large-Scale Weak Supervision
A Python Library for Self Organizing Map (SOM)