Skip to content

Commit

Permalink
Merge pull request #81 from greenelab/trang1618-patch-1
Browse files Browse the repository at this point in the history
Address #57
  • Loading branch information
trangdata authored Mar 14, 2020
2 parents b134895 + 3efbbab commit f976cd7
Show file tree
Hide file tree
Showing 2 changed files with 2 additions and 2 deletions.
2 changes: 1 addition & 1 deletion content/10.methods.md
Original file line number Diff line number Diff line change
Expand Up @@ -172,7 +172,7 @@ We refer to this dataset as Wiki2019 (available online in [`annotated_names.tsv`
Table: **Predicting name-origin groups of names trained on Wikipedia's living people.**
The table lists the 8 groups and the number of living people for each region that the LSTM was trained on.
Example names shows actual author names that received a high prediction for each region.
Full information about which countries comprised each region can be found in the online dataset [`country_to_region.tsv`](https://github.com/greenelab/wiki-nationality-estimate/blob/master/data/country_to_region.tsv).
Full information about which countries comprised each region can be found in the online dataset [`country_to_region.tsv`](https://github.com/greenelab/iscb-diversity/blob/make-letters/data/countries/2020-01-31_groupings.tsv).
{#tbl:example_names}

### Affiliation Analysis
Expand Down
2 changes: 1 addition & 1 deletion content/20.results.md
Original file line number Diff line number Diff line change
Expand Up @@ -91,7 +91,7 @@ When we directly compared honoree composition with PubMed, we observed discrepan
Outside of the primary range of our analyses, the two names of 2020 PSB keynote speakers were predicted to be of Group A (65% probability) and Group H (99% probability), respectively.


![Compared to the name collection of Pubmed authors, Group A honorees are overrepresented while Group C honorees are underrepresented. Category O represents all other groups. Estimated composition of name origin prediction over the years of
![Compared to the name collection of Pubmed authors, Group A honorees are overrepresented while Group C honorees are underrepresented. Category O represents all other groups (D, E, F, G and H, see Table @tbl:example_names). Estimated composition of name origin prediction over the years of
(A, left) all Pubmed computational biology and bioinformatics journal authors,
and (A, right) all ISCB Fellows and keynote speakers
was computed as the average of prediction probabilities of Pubmed articles or ISCB honorees each year.
Expand Down

0 comments on commit f976cd7

Please sign in to comment.