Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

add speaker diarization tutorial #144

Conversation

virajkarandikar
Copy link
Contributor

@virajkarandikar virajkarandikar commented Apr 19, 2023

Sample wav file taken from https://freesound.org/people/SamKolber/sounds/203020/

First block prints raw results.

ASR Transcript with Speaker Diarization:
 results {
  alternatives {
    transcript: "Well, I\'m Bill Bill Turner. I\'m from "
    confidence: -1.78428745
    words {
      start_time: 2800
      end_time: 2960
      word: "Well,"
      confidence: -1.66857708
      speaker_tag: 1
    }
    words {
      start_time: 3000
      end_time: 3160
      word: "I\'m"
      confidence: -0.574845552
      speaker_tag: 1
    }
    words {
      start_time: 3240
      end_time: 3480
      word: "Bill"
      confidence: -3.25688601
      speaker_tag: 1
    }
...

Second block prints colored text based on speaker tag.

image

@virajkarandikar virajkarandikar force-pushed the vkarandikar_add_diarization_tutorial branch from 13779b4 to 836c450 Compare April 19, 2023 13:22
Copy link
Contributor

@LynseyFabel LynseyFabel left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I've completed my review and left a few comments. Thank you.

asr-speaker-diarization.ipynb Outdated Show resolved Hide resolved
asr-speaker-diarization.ipynb Outdated Show resolved Hide resolved
asr-speaker-diarization.ipynb Outdated Show resolved Hide resolved
asr-speaker-diarization.ipynb Outdated Show resolved Hide resolved
asr-speaker-diarization.ipynb Outdated Show resolved Hide resolved
asr-speaker-diarization.ipynb Outdated Show resolved Hide resolved
asr-speaker-diarization.ipynb Outdated Show resolved Hide resolved
"\n",
"### Sample applications\n",
"\n",
"Riva comes with various sample applications. They demonstrate how to use the APIs to build applications such as a [chatbot](https://docs.nvidia.com/deeplearning/riva/user-guide/docs/samples/weather.html), a domain specific speech recognition, [keyword (entity) recognition system](https://docs.nvidia.com/deeplearning/riva/user-guide/docs/samples/callcenter.html), or simply how Riva allows scaling out for handling massive amounts of requests at the same time. Refer to ([SpeechSquad)](https://docs.nvidia.com/deeplearning/riva/user-guide/docs/samples/speechsquad.html) for more information. \n",
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
"Riva comes with various sample applications. They demonstrate how to use the APIs to build applications such as a [chatbot](https://docs.nvidia.com/deeplearning/riva/user-guide/docs/samples/weather.html), a domain specific speech recognition, [keyword (entity) recognition system](https://docs.nvidia.com/deeplearning/riva/user-guide/docs/samples/callcenter.html), or simply how Riva allows scaling out for handling massive amounts of requests at the same time. Refer to ([SpeechSquad)](https://docs.nvidia.com/deeplearning/riva/user-guide/docs/samples/speechsquad.html) for more information. \n",
"Riva comes with various sample applications. They demonstrate how to use the APIs to build applications such as a [chatbot](https://docs.nvidia.com/deeplearning/riva/user-guide/docs/samples/weather.html), a domain specific speech recognition, [keyword (entity) recognition system](https://docs.nvidia.com/deeplearning/riva/user-guide/docs/samples/callcenter.html), or simply how Riva allows scaling out for handling massive amounts of requests at the same time. Refer to [SpeechSquad](https://docs.nvidia.com/deeplearning/riva/user-guide/docs/samples/speechsquad.html) for more information. \n",

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Fixed

asr-speaker-diarization.ipynb Outdated Show resolved Hide resolved
"\n",
"### Additional resources\n",
"\n",
"For more information about each of the APIs and their functionalities, refer to the [documentation](https://docs.nvidia.com/deeplearning/riva/user-guide/docs/protobuf-api/protobuf-api-root.html)."
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

this link is broken

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

fixed

virajkarandikar and others added 2 commits April 21, 2023 10:39
Co-authored-by: LynseyFabel <46456803+LynseyFabel@users.noreply.github.com>
Copy link
Contributor

@LynseyFabel LynseyFabel left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I've completed my review and left a few comments. Thank you.

"\n",
"Riva comes with various sample applications. They demonstrate how to use the APIs to build applications such as a [chatbot](https://docs.nvidia.com/deeplearning/riva/user-guide/docs/samples/weather.html), a domain specific speech recognition, [keyword (entity) recognition system](https://docs.nvidia.com/deeplearning/riva/user-guide/docs/samples/callcenter.html), or simply how Riva allows scaling out for handling massive amounts of requests at the same time. Refer to ([SpeechSquad)](https://docs.nvidia.com/deeplearning/riva/user-guide/docs/samples/speechsquad.html) for more information. \n",
"Refer to the *Sample Application* section in the [Riva developer documentation](https://developer.nvidia.com/) for more information.\n",
"Riva comes with various sample applications. They demonstrate how to use the APIs to build various applications. Refer to [Riva Sampple Apps](https://docs.nvidia.com/deeplearning/riva/user-guide/docs/samples/index.html) for more information. \n",
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
"Riva comes with various sample applications. They demonstrate how to use the APIs to build various applications. Refer to [Riva Sampple Apps](https://docs.nvidia.com/deeplearning/riva/user-guide/docs/samples/index.html) for more information. \n",
"Riva comes with various sample applications. They demonstrate how to use the APIs to build various applications. Refer to [Riva Sample Apps](https://docs.nvidia.com/deeplearning/riva/user-guide/docs/samples/index.html) for more information. \n",

"\n",
"Riva comes with various sample applications. They demonstrate how to use the APIs to build applications such as a [chatbot](https://docs.nvidia.com/deeplearning/riva/user-guide/docs/samples/weather.html), a domain specific speech recognition, [keyword (entity) recognition system](https://docs.nvidia.com/deeplearning/riva/user-guide/docs/samples/callcenter.html), or simply how Riva allows scaling out for handling massive amounts of requests at the same time. Refer to ([SpeechSquad)](https://docs.nvidia.com/deeplearning/riva/user-guide/docs/samples/speechsquad.html) for more information. \n",
"Refer to the *Sample Application* section in the [Riva developer documentation](https://developer.nvidia.com/) for more information.\n",
"Riva comes with various sample applications. They demonstrate how to use the APIs to build various applications. Refer to [Riva Sampple Apps](https://docs.nvidia.com/deeplearning/riva/user-guide/docs/samples/index.html) for more information. \n",
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
"Riva comes with various sample applications. They demonstrate how to use the APIs to build various applications. Refer to [Riva Sampple Apps](https://docs.nvidia.com/deeplearning/riva/user-guide/docs/samples/index.html) for more information. \n",
"Riva comes with various sample applications. They demonstrate how to use the APIs to build various applications. Refer to [Riva Sample Apps](https://docs.nvidia.com/deeplearning/riva/user-guide/docs/samples/index.html) for more information. \n",

"\n",
"Riva comes with various sample applications. They demonstrate how to use the APIs to build applications such as a [chatbot](https://docs.nvidia.com/deeplearning/riva/user-guide/docs/samples/weather.html), a domain specific speech recognition, [keyword (entity) recognition system](https://docs.nvidia.com/deeplearning/riva/user-guide/docs/samples/callcenter.html), or simply how Riva allows scaling out for handling massive amounts of requests at the same time. Refer to ([SpeechSquad)](https://docs.nvidia.com/deeplearning/riva/user-guide/docs/samples/speechsquad.html) for more information. \n",
"Refer to the *Sample Application* section in the [Riva developer documentation](https://developer.nvidia.com/) for more information.\n",
"Riva comes with various sample applications. They demonstrate how to use the APIs to build various applications. Refer to [Riva Sampple Apps](https://docs.nvidia.com/deeplearning/riva/user-guide/docs/samples/index.html) for more information. \n",
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
"Riva comes with various sample applications. They demonstrate how to use the APIs to build various applications. Refer to [Riva Sampple Apps](https://docs.nvidia.com/deeplearning/riva/user-guide/docs/samples/index.html) for more information. \n",
"Riva comes with various sample applications. They demonstrate how to use the APIs to build various applications. Refer to [Riva Sample Apps](https://docs.nvidia.com/deeplearning/riva/user-guide/docs/samples/index.html) for more information. \n",

"\n",
"### Sample Applications\n",
"\n",
"Riva comes with various sample applications. They demonstrate how to use the APIs to build various applications. Refer to [Riva Sampple Apps](https://docs.nvidia.com/deeplearning/riva/user-guide/docs/samples/index.html) for more information. \n",
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
"Riva comes with various sample applications. They demonstrate how to use the APIs to build various applications. Refer to [Riva Sampple Apps](https://docs.nvidia.com/deeplearning/riva/user-guide/docs/samples/index.html) for more information. \n",
"Riva comes with various sample applications. They demonstrate how to use the APIs to build various applications. Refer to [Riva Sample Apps](https://docs.nvidia.com/deeplearning/riva/user-guide/docs/samples/index.html) for more information. \n",

@virajkarandikar virajkarandikar merged commit aa79927 into nvidia-riva:release/2.11.0 Apr 26, 2023
@virajkarandikar virajkarandikar deleted the vkarandikar_add_diarization_tutorial branch April 26, 2023 03:21
rmittal-github added a commit that referenced this pull request May 4, 2023
* add speaker diarization tutorial (#144)

* add speaker diarization tutorial
* fix broken links

Bug 4046294

Co-authored-by: LynseyFabel <46456803+LynseyFabel@users.noreply.github.com>

* [NMT] Add Megatron multilingual and S2S/S2T services (#145)

* Add Megatron multilingual and S2S/S2T services

* apply suggestions for changes in markdown

Co-authored-by: LynseyFabel <46456803+LynseyFabel@users.noreply.github.com>

---------

Co-authored-by: LynseyFabel <46456803+LynseyFabel@users.noreply.github.com>

* Fix EKS deployment tutorial (#146)

There is no separate client image now. Use built-in clients in
riva-speech image.

* Updating Notebook Table with Megatron NMT models (#143)

* Adding megatron models to the notebook + tested

Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>

* Updated models table

Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>

* Update review suggestion

Co-authored-by: LynseyFabel <46456803+LynseyFabel@users.noreply.github.com>

---------

Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com>
Co-authored-by: rmittal-github <61574997+rmittal-github@users.noreply.github.com>
Co-authored-by: LynseyFabel <46456803+LynseyFabel@users.noreply.github.com>

* fix s2s config typo (#150)

---------

Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
Co-authored-by: Viraj Karandikar <16838694+virajkarandikar@users.noreply.github.com>
Co-authored-by: LynseyFabel <46456803+LynseyFabel@users.noreply.github.com>
Co-authored-by: Ashish Sardana <ashishsardana21@gmail.com>
Co-authored-by: David <amosalla@asu.edu>
Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants