Skip to content
View joselyn-rodriguez's full-sized avatar

Highlights

  • Pro

Organizations

@LeAP-laboratory

Block or report joselyn-rodriguez

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Pretraining, fine-tuning and evaluation scripts for Indic-Wav2Vec2

Jupyter Notebook 82 28 Updated Mar 14, 2024

Metadata and versioning details for the Common Voice dataset

JavaScript 145 15 Updated Dec 17, 2024

Web-based environment for live coding algorithmic patterns, incorporating a faithful port of TidalCycles to JavaScript

JavaScript 751 133 Updated Feb 11, 2025

Geometric loss functions between point clouds, images and volumes

Python 614 62 Updated Jan 19, 2024

POT : Python Optimal Transport

Python 2,499 511 Updated Jan 27, 2025

An official reimplementation of the method described in the INTERSPEECH 2021 paper - Speech Resynthesis from Discrete Disentangled Self-Supervised Representations.

Python 400 57 Updated Aug 29, 2023

Charsiu: A neural phonetic aligner.

Jupyter Notebook 292 36 Updated Sep 19, 2022

Keyword spotting and forced alignment in any language

Python 51 3 Updated Jun 29, 2024
Python 1 Updated Nov 2, 2021

Multilingual Automatic Speech Recognition with word-level timestamps and confidence

Python 2,247 170 Updated Feb 14, 2025

Open source speech to text models for Indic Languages

294 49 Updated Sep 16, 2022

Zerospeech Challenge 2021: validation and evaluation software

Python 12 4 Updated Jun 13, 2022

A PyTorch Implementation of End-to-End Models for Speech-to-Text

Python 757 177 Updated Jul 6, 2023
Python 5 3 Updated Nov 29, 2022

This is a list of speech tasks and datasets, which can provide training data for Generative AI, AIGC, AI model training, intelligent speech tool development, and speech applications.

75 7 Updated Jun 7, 2024

Speech Recognition using DeepSpeech2.

Python 2,113 620 Updated Dec 13, 2022

Self-Supervised Speech Pre-training and Representation Learning Toolkit

Python 2,332 491 Updated Feb 12, 2025

A Python wrapper for Kaldi

Python 1,007 246 Updated Jan 23, 2025

Perform analyses on results files obtained with the ABXpy library in a phone discrimination task BY (i.e. conditioned on) speaker and preceding and following contexts. (PHOne BY Speaker CONtext -> …

Python 1 1 Updated Dec 8, 2021

A non-native English corpus for pronunciation scoring task

123 20 Updated Jul 16, 2024

Binding generator to wrap C++ for Python using LLVM.

C++ 980 124 Updated Sep 6, 2024

A pure python module for reading and writing kaldi ark files

Python 252 36 Updated Sep 10, 2023

Unsupervised acoustic word embeddings evaluated on Buckeye English and NCHLT Xitsonga data in Python 3.

Jupyter Notebook 8 5 Updated May 3, 2022

A Python toolbox for speech features extraction

Python 161 24 Updated Feb 8, 2023

Extension to Kaldi implementing the standard i-vector hyperparameter estimation and i-vector extraction procedure

C++ 89 24 Updated Feb 23, 2018

Correspondence and autoencoder neural network training for speech using Pylearn2.

Python 13 7 Updated Dec 9, 2015

Robust Speech Recognition via Large-Scale Weak Supervision

Python 76,646 9,166 Updated Jan 4, 2025

A Python Library for Self Organizing Map (SOM)

Jupyter Notebook 542 244 Updated Apr 7, 2023
Next