New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

add parakeet finetuning tutorials #201

Merged

rmittal-github merged 5 commits into nvidia-riva:main from jmayank1511:parakeet_finetuning_demo

Nov 14, 2024

Contributor

jmayank1511 commented Oct 11, 2024

No description provided.

rmittal-github requested a review from nv-uvaidya

October 11, 2024 05:13

Mayank Jain (SW-TEGRA) and others added 2 commits

October 11, 2024 10:47


          add parakeet finetuning tutorials

67f89a5


          minor fix for broken link

76018fd

rmittal-github force-pushed the parakeet_finetuning_demo branch from 82996de to 76018fd Compare

October 11, 2024 05:23

Collaborator

rmittal-github commented Oct 11, 2024

rebased and added a minor fix found in other thread, @nv-uvaidya could you review the new tutorial as well?


          show finetuning using same tokenizers

728d950

myungjongk commented Oct 18, 2024

I think it is better to replace the finetuning tutorial file name from asr_finetune_parakeet_nemo.ipynb to asr-finetune-parakeet-nemo.ipynb.


          Rename file

8280e39

myungjongk reviewed

View reviewed changes

asr_finetune_parakeet_nemo.ipynb Outdated

+                      "# How to Fine-Tune a Riva ASR Acoustic Model with NVIDIA NeMo\n",
+                      "This tutorial walks you through how to fine-tune an NVIDIA Riva ASR acoustic model with NVIDIA NeMo.\n",
+                      "\n",
+                      "**Important**: If you plan to fine-tune an ASR acoustic model using the same tokenizer with which the model was trained, skip this tutorial and refer to the \"Sub-word Encoding CTC Model\" section (starting with the \"Load pre-trained model\" subsection) of the [NeMo ASR Language Finetuning tutorial](https://github.com/NVIDIA/NeMo/blob/main/tutorials/asr/ASR_CTC_Language_Finetuning.ipynb)."

myungjongk Oct 18, 2024

We can remove this line as we already handle finetuning with same tokenizer in this tutorial. Or we can just refer to NeMo's tutorial as additional finetuning resources.

myungjongk Nov 12, 2024

asr_finetune_parakeet_nemo.ipynb Outdated

+                      "\n",
+                      "Hybrid RNNT-CTC models is a group of models with both the RNNT and CTC decoders. Training a hybrid model would speedup the convergence for the CTC models and would enable the user to use a single model which works as both a CTC and RNNT model. This category can be used with any of the ASR models. Hybrid models uses two decoders of CTC and RNNT on the top of the encoder.\n",
+                      "\n",
+                      "NeMo uses `.yml` files to configure the training parameters. You may update them directly by editing the configuration file or from the command-line interface. For example, if the number of epochs needs to be modified, along with a change in the learning rate, you can add `trainer.max_epochs=100` and `optim.lr=0.02` and train the model.\n",

myungjongk Oct 18, 2024

.yml can be replaced with .yaml.

myungjongk Nov 12, 2024

asr_finetune_parakeet_nemo.ipynb Outdated

+                      "\n",
+                      "NeMo uses `.yml` files to configure the training parameters. You may update them directly by editing the configuration file or from the command-line interface. For example, if the number of epochs needs to be modified, along with a change in the learning rate, you can add `trainer.max_epochs=100` and `optim.lr=0.02` and train the model.\n",
+                      "\n",
+                      "The following sample command uses the `speech_to_text_hybrid_rnnt_ctc_bpe.py` script in the `examples` folder to train/fine-tune a Parakeet-Hybrid ASR model for 1 epoch. For other ASR models like Citrinet, Conformer, you may find the appropriate config files in the NeMo GitHub repo under [examples/asr/conf/](https://github.com/NVIDIA/NeMo/tree/main/examples/asr/conf).\n"

myungjongk Oct 18, 2024

speech_to_text_hybrid_rnnt_ctc_bpe.py needs to be replaced with speech_to_text_finetune.py.

myungjongk Nov 12, 2024

asr_finetune_parakeet_nemo.ipynb Outdated

+                    "source": [
+                      "#### Convert to Riva\n",
+                      "\n",
+                      "Convert the downloaded model to the `.riva` format. We will set the encryption key with `--key=nemotoriva`. Choose a different encryption key value when generating `.riva` models for production.\n",

myungjongk Oct 18, 2024

I think we can mention --onnx-opset 18 is needed for the Riva version 2.15.0 and above.

myungjongk Nov 12, 2024

Contributor Author

jmayank1511 Nov 12, 2024

not needed for RNNT models

asr_finetune_parakeet_nemo.ipynb Outdated

+                    "outputs": [],
+                    "source": [
+                      "riva_file_path = ctc_model_path[:-5]+\".riva\"\n",
+                      "!nemo2riva --key=nemotoriva  --out $riva_file_path $ctc_model_path"

myungjongk Oct 18, 2024

--onnx-opset 18 can be added.

myungjongk Nov 12, 2024


          minor fix

07cc7b8

myungjongk approved these changes

View reviewed changes

myungjongk left a comment

Looks good!

rmittal-github approved these changes

View reviewed changes

rmittal-github merged commit 915f9e7 into nvidia-riva:main

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet