GitHub - dlion168/spoken_stereoset: The official repo for speech based stereoset

Spoken Stereoset: On Evaluating Social Bias Toward Speaker in Speech Large Language Models

This is the official repository for Spoken Stereoset, a dataset measures stereotypical bias on speech large language models (SLLMs). The construction detail can be found in our paper soon.

Metadata

id: The unique id of instance.
speaker: The speaker of the speech segment in azure TTS.
age/gender: The demogrpahic attribute of the speaker that might link to stereotypical associations.
context: The transcription of spoken context sentence.
irrelevant: An irrelevant continuation to the context.
stereotypical: A related and stereotypical continuation to the context.
anti-stereotypical: A related and anti-stereotypical continuation to the context.
labels: The labels annotated by the annotators for 3 possinle continuations.
annotators: The annotator id of the annotations.\

Contact

If you have any concerns, please contact: even.dlion8@gmail.com

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
example		example
metadata		metadata
speech		speech
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Spoken Stereoset: On Evaluating Social Bias Toward Speaker in Speech Large Language Models

Metadata

Contact

About

Releases

Packages

dlion168/spoken_stereoset

Folders and files

Latest commit

History

Repository files navigation

Spoken Stereoset: On Evaluating Social Bias Toward Speaker in Speech Large Language Models

Metadata

Contact

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages