Skip to content

Commit

Permalink
Merge branch 'pluralitybook:main' into main
Browse files Browse the repository at this point in the history
  • Loading branch information
audreyt authored Mar 23, 2024
2 parents 833bd99 + cdb9a44 commit 27b7444
Show file tree
Hide file tree
Showing 15 changed files with 2,203 additions and 1,414 deletions.
50 changes: 32 additions & 18 deletions contents/english/6-0-from-⿻-to-reality.md

Large diffs are not rendered by default.

66 changes: 42 additions & 24 deletions contents/english/6-1-workplace.md

Large diffs are not rendered by default.

37 changes: 19 additions & 18 deletions contents/english/6-2-health.md

Large diffs are not rendered by default.

24 changes: 16 additions & 8 deletions contents/english/6-3-media.md

Large diffs are not rendered by default.

46 changes: 26 additions & 20 deletions contents/english/6-4-environment.md

Large diffs are not rendered by default.

345 changes: 314 additions & 31 deletions scripts/index/Plurality Book Indexing Exercise - Main.csv

Large diffs are not rendered by default.

8 changes: 7 additions & 1 deletion scripts/index/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -2,12 +2,15 @@

- `Plurality Book Indexing Exercise - Main.csv`: raw file exported from [Spreadsheet](https://docs.google.com/spreadsheets/d/1gmyjFbErt_CW8-qLKChSpciLlCDGUhLriYFov0HO3qA/edit#gid=0)
- `main.py`: output POC count, occurence of each keywords in each sections, and the count of occurences
- `ignore.txt`: keywords which should avoid mechine search

## output
- `contributors.tsv`: number of contribution on the spreadsheet
- `keyword_occurrence.tsv`: occurrence of each keywords in each sections
- `section_occurrence.tsv`: number of occurrences in each sections of any keywords. It is to find less-covered sections.
- `no_occurence.txt`: Keywords which does not occur in the contents.
- `too_many_occurrence.tsv`: Keywords which occur in more than 5 sections.
- `similar_keywords.txt`: Output if there are keywords like `Neural network` and `Neural Network`.


## memo
Expand All @@ -17,4 +20,7 @@
- no_occurence: Some looks mistake (e.g. `W. Mitchell Waldrop` does not occur but `M. Mitchell Waldrop` occurs), some may because of acronym in palens (e.g. `Distributed Ledger Technology (DLT)`)
- Changed `Universal Record Locator` to `Uniform Resource Locator`, and fixed `W. Mitchell Waldrop`.
- Fix some upper/lower diversity (e.g. `Virtual Reality` and `Virtual reality`)
- Keywords with acronym such as `Artificial Intelligence (AI)`: If it does not occerred, remove after palens and search again.
- Keywords with acronym such as `Artificial Intelligence (AI)`: If it does not occerred, remove after palens and search again.
- Keywords with quotes such as `Diversity of "groups"`: remove quotes
- `keyword_occurrence.tsv`: Output "by human" keywords and "by script" keywords on the different columns
- Fix bug: I ignored `X` derived from `X (formerly Twitter)` but the comparison was done after lower().
6 changes: 3 additions & 3 deletions scripts/index/contributors.tsv
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
cFQ 1123
cFQ 1378
glenweyl 1
gnomevan 27
nishio 156
gnomevan 56
nishio 155
tsuzumik 44
11 changes: 11 additions & 0 deletions scripts/index/ignore.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,11 @@
X
Her
UN
Author
Writing
Translation
Signal
Participation
Outcome
Output
Administration
Loading

0 comments on commit 27b7444

Please sign in to comment.