add speaker diarization tutorial #144

virajkarandikar · 2023-04-19T13:21:58Z

Sample wav file taken from https://freesound.org/people/SamKolber/sounds/203020/

First block prints raw results.

ASR Transcript with Speaker Diarization:
 results {
  alternatives {
    transcript: "Well, I\'m Bill Bill Turner. I\'m from "
    confidence: -1.78428745
    words {
      start_time: 2800
      end_time: 2960
      word: "Well,"
      confidence: -1.66857708
      speaker_tag: 1
    }
    words {
      start_time: 3000
      end_time: 3160
      word: "I\'m"
      confidence: -0.574845552
      speaker_tag: 1
    }
    words {
      start_time: 3240
      end_time: 3480
      word: "Bill"
      confidence: -3.25688601
      speaker_tag: 1
    }
...

Second block prints colored text based on speaker tag.

Bug 4046294

LynseyFabel

I've completed my review and left a few comments. Thank you.

asr-speaker-diarization.ipynb

LynseyFabel · 2023-04-20T20:38:31Z

asr-speaker-diarization.ipynb

+    "\n",
+    "### Sample applications\n",
+    "\n",
+    "Riva comes with various sample applications. They demonstrate how to use the APIs to build applications such as a [chatbot](https://docs.nvidia.com/deeplearning/riva/user-guide/docs/samples/weather.html), a domain specific speech recognition, [keyword (entity) recognition system](https://docs.nvidia.com/deeplearning/riva/user-guide/docs/samples/callcenter.html), or simply how Riva allows scaling out for handling massive amounts of requests at the same time. Refer to ([SpeechSquad)](https://docs.nvidia.com/deeplearning/riva/user-guide/docs/samples/speechsquad.html) for more information.  \n",


Suggested change

"Riva comes with various sample applications. They demonstrate how to use the APIs to build applications such as a [chatbot](https://docs.nvidia.com/deeplearning/riva/user-guide/docs/samples/weather.html), a domain specific speech recognition, [keyword (entity) recognition system](https://docs.nvidia.com/deeplearning/riva/user-guide/docs/samples/callcenter.html), or simply how Riva allows scaling out for handling massive amounts of requests at the same time. Refer to ([SpeechSquad)](https://docs.nvidia.com/deeplearning/riva/user-guide/docs/samples/speechsquad.html) for more information. \n",

"Riva comes with various sample applications. They demonstrate how to use the APIs to build applications such as a [chatbot](https://docs.nvidia.com/deeplearning/riva/user-guide/docs/samples/weather.html), a domain specific speech recognition, [keyword (entity) recognition system](https://docs.nvidia.com/deeplearning/riva/user-guide/docs/samples/callcenter.html), or simply how Riva allows scaling out for handling massive amounts of requests at the same time. Refer to [SpeechSquad](https://docs.nvidia.com/deeplearning/riva/user-guide/docs/samples/speechsquad.html) for more information. \n",

These links are broken: https://docs.nvidia.com/deeplearning/riva/user-guide/docs/samples/weather.html and https://docs.nvidia.com/deeplearning/riva/user-guide/docs/samples/callcenter.html

asr-speaker-diarization.ipynb

LynseyFabel · 2023-04-20T20:40:22Z

asr-speaker-diarization.ipynb

+    "\n",
+    "### Additional resources\n",
+    "\n",
+    "For more information about each of the APIs and their functionalities, refer to the [documentation](https://docs.nvidia.com/deeplearning/riva/user-guide/docs/protobuf-api/protobuf-api-root.html)."


this link is broken

Co-authored-by: LynseyFabel <46456803+LynseyFabel@users.noreply.github.com>

LynseyFabel

I've completed my review and left a few comments. Thank you.

LynseyFabel · 2023-04-25T15:38:15Z

asr-basics.ipynb

    "\n",
-    "Riva comes with various sample applications. They demonstrate how to use the APIs to build applications such as a [chatbot](https://docs.nvidia.com/deeplearning/riva/user-guide/docs/samples/weather.html), a domain specific speech recognition, [keyword (entity) recognition system](https://docs.nvidia.com/deeplearning/riva/user-guide/docs/samples/callcenter.html), or simply how Riva allows scaling out for handling massive amounts of requests at the same time. Refer to ([SpeechSquad)](https://docs.nvidia.com/deeplearning/riva/user-guide/docs/samples/speechsquad.html) for more information.  \n",
-    "Refer to the *Sample Application* section in the [Riva developer documentation](https://developer.nvidia.com/) for more information.\n",
+    "Riva comes with various sample applications. They demonstrate how to use the APIs to build various applications. Refer to [Riva Sampple Apps](https://docs.nvidia.com/deeplearning/riva/user-guide/docs/samples/index.html) for more information.  \n",


Suggested change

"Riva comes with various sample applications. They demonstrate how to use the APIs to build various applications. Refer to [Riva Sampple Apps](https://docs.nvidia.com/deeplearning/riva/user-guide/docs/samples/index.html) for more information. \n",

"Riva comes with various sample applications. They demonstrate how to use the APIs to build various applications. Refer to [Riva Sample Apps](https://docs.nvidia.com/deeplearning/riva/user-guide/docs/samples/index.html) for more information. \n",

LynseyFabel · 2023-04-25T15:39:20Z

asr-customize-vocabulary-and-lexicon.ipynb

    "\n",
-    "Riva comes with various sample applications. They demonstrate how to use the APIs to build applications such as a [chatbot](https://docs.nvidia.com/deeplearning/riva/user-guide/docs/samples/weather.html), a domain specific speech recognition, [keyword (entity) recognition system](https://docs.nvidia.com/deeplearning/riva/user-guide/docs/samples/callcenter.html), or simply how Riva allows scaling out for handling massive amounts of requests at the same time. Refer to ([SpeechSquad)](https://docs.nvidia.com/deeplearning/riva/user-guide/docs/samples/speechsquad.html) for more information.  \n",
-    "Refer to the *Sample Application* section in the [Riva developer documentation](https://developer.nvidia.com/) for more information.\n",
+    "Riva comes with various sample applications. They demonstrate how to use the APIs to build various applications. Refer to [Riva Sampple Apps](https://docs.nvidia.com/deeplearning/riva/user-guide/docs/samples/index.html) for more information.  \n",


Suggested change

"Riva comes with various sample applications. They demonstrate how to use the APIs to build various applications. Refer to [Riva Sampple Apps](https://docs.nvidia.com/deeplearning/riva/user-guide/docs/samples/index.html) for more information. \n",

"Riva comes with various sample applications. They demonstrate how to use the APIs to build various applications. Refer to [Riva Sample Apps](https://docs.nvidia.com/deeplearning/riva/user-guide/docs/samples/index.html) for more information. \n",

LynseyFabel · 2023-04-25T15:40:02Z

asr-wordboosting.ipynb

    "\n",
-    "Riva comes with various sample applications. They demonstrate how to use the APIs to build applications such as a [chatbot](https://docs.nvidia.com/deeplearning/riva/user-guide/docs/samples/weather.html), a domain specific speech recognition, [keyword (entity) recognition system](https://docs.nvidia.com/deeplearning/riva/user-guide/docs/samples/callcenter.html), or simply how Riva allows scaling out for handling massive amounts of requests at the same time. Refer to ([SpeechSquad)](https://docs.nvidia.com/deeplearning/riva/user-guide/docs/samples/speechsquad.html) for more information.  \n",
-    "Refer to the *Sample Application* section in the [Riva developer documentation](https://developer.nvidia.com/) for more information.\n",
+    "Riva comes with various sample applications. They demonstrate how to use the APIs to build various applications. Refer to [Riva Sampple Apps](https://docs.nvidia.com/deeplearning/riva/user-guide/docs/samples/index.html) for more information.  \n",


Suggested change

"Riva comes with various sample applications. They demonstrate how to use the APIs to build various applications. Refer to [Riva Sampple Apps](https://docs.nvidia.com/deeplearning/riva/user-guide/docs/samples/index.html) for more information. \n",

"Riva comes with various sample applications. They demonstrate how to use the APIs to build various applications. Refer to [Riva Sample Apps](https://docs.nvidia.com/deeplearning/riva/user-guide/docs/samples/index.html) for more information. \n",

LynseyFabel · 2023-04-25T15:40:20Z

asr-speaker-diarization.ipynb

+    "\n",
+    "### Sample Applications\n",
+    "\n",
+    "Riva comes with various sample applications. They demonstrate how to use the APIs to build various applications. Refer to [Riva Sampple Apps](https://docs.nvidia.com/deeplearning/riva/user-guide/docs/samples/index.html) for more information.  \n",


Suggested change

"Riva comes with various sample applications. They demonstrate how to use the APIs to build various applications. Refer to [Riva Sampple Apps](https://docs.nvidia.com/deeplearning/riva/user-guide/docs/samples/index.html) for more information. \n",

"Riva comes with various sample applications. They demonstrate how to use the APIs to build various applications. Refer to [Riva Sample Apps](https://docs.nvidia.com/deeplearning/riva/user-guide/docs/samples/index.html) for more information. \n",

* add speaker diarization tutorial (#144) * add speaker diarization tutorial * fix broken links Bug 4046294 Co-authored-by: LynseyFabel <46456803+LynseyFabel@users.noreply.github.com> * [NMT] Add Megatron multilingual and S2S/S2T services (#145) * Add Megatron multilingual and S2S/S2T services * apply suggestions for changes in markdown Co-authored-by: LynseyFabel <46456803+LynseyFabel@users.noreply.github.com> --------- Co-authored-by: LynseyFabel <46456803+LynseyFabel@users.noreply.github.com> * Fix EKS deployment tutorial (#146) There is no separate client image now. Use built-in clients in riva-speech image. * Updating Notebook Table with Megatron NMT models (#143) * Adding megatron models to the notebook + tested Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * Updated models table Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * Update review suggestion Co-authored-by: LynseyFabel <46456803+LynseyFabel@users.noreply.github.com> --------- Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: rmittal-github <61574997+rmittal-github@users.noreply.github.com> Co-authored-by: LynseyFabel <46456803+LynseyFabel@users.noreply.github.com> * fix s2s config typo (#150) --------- Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: Viraj Karandikar <16838694+virajkarandikar@users.noreply.github.com> Co-authored-by: LynseyFabel <46456803+LynseyFabel@users.noreply.github.com> Co-authored-by: Ashish Sardana <ashishsardana21@gmail.com> Co-authored-by: David <amosalla@asu.edu> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com>

add speaker diarization tutorial

836c450

Bug 4046294

virajkarandikar force-pushed the vkarandikar_add_diarization_tutorial branch from 13779b4 to 836c450 Compare April 19, 2023 13:22

virajkarandikar requested review from messiaen, LynseyFabel and rmittal-github April 19, 2023 13:23

LynseyFabel reviewed Apr 20, 2023

View reviewed changes

virajkarandikar and others added 2 commits April 21, 2023 10:39

Apply suggestions from code review

25cf35e

Co-authored-by: LynseyFabel <46456803+LynseyFabel@users.noreply.github.com>

fix broken links

4a9e9ae

virajkarandikar requested a review from LynseyFabel April 24, 2023 03:00

LynseyFabel approved these changes Apr 25, 2023

View reviewed changes

virajkarandikar merged commit aa79927 into nvidia-riva:release/2.11.0 Apr 26, 2023

virajkarandikar deleted the vkarandikar_add_diarization_tutorial branch April 26, 2023 03:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add speaker diarization tutorial #144

add speaker diarization tutorial #144

virajkarandikar commented Apr 19, 2023 •

edited

Loading

LynseyFabel left a comment

LynseyFabel Apr 20, 2023

LynseyFabel Apr 20, 2023

virajkarandikar Apr 21, 2023

LynseyFabel Apr 20, 2023

virajkarandikar Apr 21, 2023

LynseyFabel left a comment

LynseyFabel Apr 25, 2023

LynseyFabel Apr 25, 2023

LynseyFabel Apr 25, 2023

LynseyFabel Apr 25, 2023

	"Riva comes with various sample applications. They demonstrate how to use the APIs to build various applications. Refer to [Riva Sampple Apps](https://docs.nvidia.com/deeplearning/riva/user-guide/docs/samples/index.html) for more information. \n",
	"Riva comes with various sample applications. They demonstrate how to use the APIs to build various applications. Refer to [Riva Sample Apps](https://docs.nvidia.com/deeplearning/riva/user-guide/docs/samples/index.html) for more information. \n",

add speaker diarization tutorial #144

add speaker diarization tutorial #144

Conversation

virajkarandikar commented Apr 19, 2023 • edited Loading

LynseyFabel left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

LynseyFabel left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

virajkarandikar commented Apr 19, 2023 •

edited

Loading