We are sharing the dataset we used in the paper "Relationship Between Diversity of Collaborative Group Members’ Race and Ethnicity and the Frequency of their Collaborative Contributions in GitHub" in the interest of encouraging others to replicate and build upon our work.
In this repository, you can find:
- The scripts we used to infer the gender, race and ethnicity of the developers
- The scripts we used to answer RQ1, RQ2.1, and RQ2.2
- The datasets we used in our analysis.