Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Automating Quantifying the Commons (Google Summer of Code 2024) #123

Merged
merged 133 commits into from
Aug 23, 2024
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
133 commits
Select commit Hold shift + click to select a range
bae2355
Initialize data directory
naishasinha Jun 7, 2024
19b460d
Delete data/fetched directory
naishasinha Jun 7, 2024
d8eb6e1
Create google_custom_search_fetched.csv
naishasinha Jun 7, 2024
6620c63
Delete data/fetched directory
naishasinha Jun 7, 2024
ef37d47
Create google_custom_search_fetched.csv
naishasinha Jun 7, 2024
ed8e523
Delete data/google_custom_search_fetched.csv
naishasinha Jun 7, 2024
e0d7bd8
Create google_custom_search_fetched.csv
naishasinha Jun 7, 2024
f7c1446
Create google_custom_search_processed.csv
naishasinha Jun 7, 2024
162a841
Create google_custom_search_final.csv
naishasinha Jun 7, 2024
427f6eb
Rename google_custom_search_fetched.csv to gcs_fetched.placeholder
naishasinha Jun 7, 2024
16010b3
Rename google_custom_search_processed.csv to gcs_processed.placeholder
naishasinha Jun 7, 2024
28a2330
Rename google_custom_search_final.csv to gcs_final.placeholder
naishasinha Jun 7, 2024
278e913
rename files to be generic
TimidRobot Jun 7, 2024
08facaf
Merge pull request #116 from creativecommons/google-custom-search
TimidRobot Jun 7, 2024
dfb0bda
Add files for scripts
naishasinha Jun 7, 2024
0c3677b
fixed end of files pre-commit issue
naishasinha Jun 7, 2024
9849c3b
Developed initial gcs_query code
naishasinha Jun 9, 2024
2d5b13f
rename placeholder files and shared library
naishasinha Jun 10, 2024
5065b5a
updated gcs_query.py and changed folder names
naishasinha Jun 11, 2024
f8f2745
added states, shared library initialization, and added gcs_query and …
naishasinha Jun 13, 2024
5f31035
made script executable with `chmod u+x scripts/1-fetched/gcs_query.py`
TimidRobot Jun 13, 2024
88e1510
resolved execution errors in gcs_query.py
naishasinha Jun 13, 2024
5005858
(incorrect code) with updated lang and country
naishasinha Jun 17, 2024
a391d94
Updated gcs_query.py with working code and API Simulation through log…
naishasinha Jun 18, 2024
b121acf
disable cache discovery per https://github.com/googleapis/google-api-…
TimidRobot Jun 18, 2024
6bef5a7
Merge pull request #117 from creativecommons/disable-google-cache-dis…
naishasinha Jun 18, 2024
c715c9c
Update gcs_query with stripped link_site
naishasinha Jun 18, 2024
24ef617
Updated working code for a limited license amount
naishasinha Jun 19, 2024
311b1e6
refactor paths
TimidRobot Jun 19, 2024
43555ee
fix repo path after moving shared module
TimidRobot Jun 19, 2024
d6aa639
move data file
TimidRobot Jun 19, 2024
f484b34
rename gcs env variables to match use and add additional comment docu…
TimidRobot Jun 20, 2024
20e036a
update source documentation for GCS (Google Custom Search)
TimidRobot Jun 20, 2024
9da9564
Merge pull request #119 from creativecommons/rename-gsc-env-variables
naishasinha Jun 20, 2024
7831f6c
Merge pull request #118 from creativecommons/data-location
naishasinha Jun 20, 2024
c6b8ef2
Updated code with initial state management
naishasinha Jun 21, 2024
335e0ed
created boilerplate code for phases 2 and 3
naishasinha Jun 25, 2024
9416a77
standardize names
naishasinha Jun 25, 2024
599b6f2
implement exponential backoff
naishasinha Jun 26, 2024
dce795b
add reason to 429 error
TimidRobot Jun 26, 2024
faf80d3
correct gsc query limit
TimidRobot Jun 26, 2024
1ac4a54
new GCS data
naishasinha Jun 28, 2024
1baecce
add more queries to gcs_fetched
naishasinha Jun 29, 2024
5657052
complete base codebase for gcs_reports.py
naishasinha Jul 2, 2024
056b230
add psuedocode for gcs phase 2
naishasinha Jul 2, 2024
deb086d
update reports to generate README.md
naishasinha Jul 9, 2024
4baec68
update reports to generate README.md
naishasinha Jul 9, 2024
ee92f8f
update reports to generate README.md
naishasinha Jul 9, 2024
254c70a
standardize phases
naishasinha Jul 9, 2024
2662be0
update report README generation
naishasinha Jul 11, 2024
b4fd8d9
add flickr phases 1, 2, 3 (in progress)
naishasinha Jul 11, 2024
42ac7fe
commit initial automation workflow
naishasinha Jul 15, 2024
99e67a8
update automation workflow
naishasinha Jul 16, 2024
cd27ed6
set up YAML file for each phase
naishasinha Jul 20, 2024
24b6bc5
add "on push" to YAML files
naishasinha Jul 20, 2024
bf1192e
update fetch.yml typo
naishasinha Jul 20, 2024
505ccb1
update git env
naishasinha Jul 21, 2024
6d01a4f
temporarily add on push
naishasinha Jul 21, 2024
4f44ee8
update env.example
naishasinha Jul 21, 2024
b64d3e5
trigger on push
naishasinha Jul 21, 2024
251c65d
add logging statement
naishasinha Jul 21, 2024
bd0e17a
fetch data
naishasinha Jul 21, 2024
216a018
test workflow
naishasinha Jul 21, 2024
11c47e0
add repeated runs till API limit
naishasinha Jul 21, 2024
d480c74
on push - all branches
naishasinha Jul 21, 2024
6e7dbc1
update shared.py
naishasinha Jul 22, 2024
ac1865c
update fetch YAML
naishasinha Jul 22, 2024
55e1635
Automated data fetch and commit
Jul 22, 2024
a2c9ee4
update process and report workflows
naishasinha Jul 23, 2024
3f195f3
Automated data fetch and commit
Jul 23, 2024
e95aa70
add fetch and merge
naishasinha Jul 23, 2024
e9ad68e
update shared.py
naishasinha Jul 23, 2024
10e3a06
Automated data fetch and commit
Jul 23, 2024
ff44eec
test phases 1 and 2 only
naishasinha Jul 23, 2024
80759fb
Automated data fetch and commit
Jul 23, 2024
56b660c
update fetch and merge
naishasinha Jul 23, 2024
eb702e0
Automated data fetch and commit
Jul 23, 2024
fb89e48
phase 3 YAML
naishasinha Jul 23, 2024
f3b1dc5
Automated data fetch and commit
Jul 23, 2024
14e95b6
fix report YAML
naishasinha Jul 23, 2024
6e0e59e
Automated data report and commit
Jul 23, 2024
b338379
Automated data fetch and commit
Jul 23, 2024
9632900
Merge pull request #120 from creativecommons/automation-workflow
naishasinha Jul 24, 2024
b19da55
Merge pull request #121 from creativecommons/fetch-automation
naishasinha Jul 24, 2024
c25d060
Automated data fetch and commit
Jul 24, 2024
08b887a
Automated data fetch and commit
Jul 25, 2024
3d6cfe4
update Pipfile and combine cron statements
naishasinha Jul 29, 2024
5309c66
fix requirements deletion
naishasinha Jul 29, 2024
204866d
Automated data fetch and commit
Jul 29, 2024
cba0219
initial shared module call switch / error state update
naishasinha Jul 31, 2024
6aec4fb
Fetched and updated new data
Jul 31, 2024
2d62b3c
update reports script with shared module integration and error state …
naishasinha Jul 31, 2024
e6da04c
Fetched and updated new data
Jul 31, 2024
6b5ce0e
run reports workflow individually
naishasinha Jul 31, 2024
9dabbe7
Added and committed new reports
Jul 31, 2024
1f76897
Added and committed new reports
naishasinha Aug 2, 2024
9b13f94
Added and committed new reports
Aug 2, 2024
b2dab03
Update gcs_reports.py
naishasinha Aug 2, 2024
ec829d4
Update gcs_reports.py
naishasinha Aug 2, 2024
8e06233
Update README.md
naishasinha Aug 2, 2024
f878763
fix flake8/black formatting clash
naishasinha Aug 9, 2024
575abf7
update scripts with all feedback (syntax, exceptions, etc.)
naishasinha Aug 9, 2024
dd8e815
Added and committed new reports
Aug 9, 2024
43bb972
Added and committed new reports
TimidRobot Aug 9, 2024
468ae99
Added and committed new reports
Aug 9, 2024
b065cbd
add --skip-commit, --skip-push, and --show-plots options
TimidRobot Aug 9, 2024
a08a957
reduce duplication
TimidRobot Aug 9, 2024
2416146
Added and committed new reports
Aug 9, 2024
bbaef6e
add all phase 2 with error handling system
naishasinha Aug 12, 2024
852aa0d
Added and committed new reports
Aug 12, 2024
d70e7f2
Merge pull request #122 from creativecommons/refine-automation
naishasinha Aug 12, 2024
4b208ce
fix QuantifyingException handling issue
naishasinha Aug 17, 2024
c9b13c7
add fetched for all data sources
naishasinha Aug 17, 2024
25079de
change local branch name for testing purposes
naishasinha Aug 17, 2024
ad2d10f
Added and committed new reports
Aug 17, 2024
0399f64
test error handling system with phase 2 - process
naishasinha Aug 17, 2024
6e07014
fix typo in error handling system
naishasinha Aug 17, 2024
634ef17
fix exit code issue in quantifying error handling
naishasinha Aug 17, 2024
64407a1
Added and committed new GitHub reports
naishasinha Aug 18, 2024
18d9633
Added and committed new GitHub reports
naishasinha Aug 18, 2024
656591f
add successful report generation of all data sources (including READM…
naishasinha Aug 18, 2024
353b2b3
add all data sources to automated scripts
naishasinha Aug 18, 2024
e21c4ea
implement consistency across fetched and remove fabricated data
naishasinha Aug 20, 2024
030c2d1
test branch switch for fetch_and_merge
naishasinha Aug 21, 2024
fdbd74c
final commits and addition of directory tree
naishasinha Aug 21, 2024
e67e825
update directory tree
naishasinha Aug 21, 2024
6f3b869
update directory tree
naishasinha Aug 21, 2024
5f45a08
update history.md
naishasinha Aug 21, 2024
84d6f1f
update history.md
naishasinha Aug 21, 2024
82c9ca0
fix history.md
naishasinha Aug 21, 2024
82857f6
update history.md
naishasinha Aug 22, 2024
f17e27e
update history.md
naishasinha Aug 22, 2024
199d562
improve consistency of whitespace
TimidRobot Aug 22, 2024
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
46 changes: 46 additions & 0 deletions .github/workflows/fetch.yml.disabled
Original file line number Diff line number Diff line change
@@ -0,0 +1,46 @@
name: Fetch Data

on:
schedule:
# 1:15am onwards, days 1-20, first month of each quarter
- cron: '15 1,5,9,13,17,21,23 1-20 1,4,7,10 *'
workflow_dispatch:

jobs:
fetch:
runs-on: ubuntu-latest

env:
GCS_DEVELOPER_KEY: ${{ secrets.GCS_DEVELOPER_KEY }}
GCS_CX: ${{ secrets.GCS_CX }}

steps:
- name: Checkout repository
uses: actions/checkout@v4

- name: Set up Python
uses: actions/setup-python@v5
with:
python-version: '3.11'

- name: Install dependencies
run: |
pip install --upgrade pip
pip install pipenv
pipenv sync --system

- name: Set PYTHONPATH
run: echo "PYTHONPATH=./scripts" >> $GITHUB_ENV

- name: Run fetch script
run:
# python scripts/1-fetch/deviantart_fetched.py
# python scripts/1-fetch/flickr_fetched.py
# python scripts/1-fetch/github_fetched.py
# python scripts/1-fetch/internetarchive_fetched.py
# python scripts/1-fetch/metmuseum_fetched.py
# python scripts/1-fetch/vimeo_fetched.py
# python scripts/1-fetch/wikicommons_fetched.py
# python scripts/1-fetch/wikipedia_fetched.py
# python scripts/1-fetch/youtube_fetched.py
python scripts/1-fetch/gcs_fetched.py
42 changes: 42 additions & 0 deletions .github/workflows/process.yml.disabled
Original file line number Diff line number Diff line change
@@ -0,0 +1,42 @@
name: Process Data

on:
schedule:
# 1:15am onwards, days 1-20, second month of each quarter
- cron: '15 1,5,9,13,17,21,23 1-20 2,5,8,11 *'
workflow_dispatch:

jobs:
process:
runs-on: ubuntu-latest

steps:
- name: Checkout repository
uses: actions/checkout@v4

- name: Set up Python
uses: actions/setup-python@v5
with:
python-version: '3.11'

- name: Install dependencies
run: |
pip install --upgrade pip
pip install pipenv
pipenv sync --system

- name: Set PYTHONPATH
run: echo "PYTHONPATH=./scripts" >> $GITHUB_ENV

- name: Run process script
run:
# python scripts/2-process/deviantart_processed.py
# python scripts/2-process/flickr_processed.py
# python scripts/2-process/github_processed.py
# python scripts/2-process/internetarchive_processed.py
# python scripts/2-process/metmuseum_processed.py
# python scripts/2-process/vimeo_processed.py
# python scripts/2-process/wikicommons_processed.py
# python scripts/2-process/wikipedia_processed.py
# python scripts/2-process/youtube_processed.py
python scripts/2-process/gcs_processed.py
43 changes: 43 additions & 0 deletions .github/workflows/report.yml.disabled
Original file line number Diff line number Diff line change
@@ -0,0 +1,43 @@
name: Generate Report

on:
schedule:
# 1:15am onwards, days 1-20, third month of each quarter
- cron: '15 1,5,9,13,17,21,23 1-20 3,6,9,12 *'
workflow_dispatch:


jobs:
generate-report:
runs-on: ubuntu-latest

steps:
- name: Checkout repository
uses: actions/checkout@v4

- name: Set up Python
uses: actions/setup-python@v5
with:
python-version: '3.11'

- name: Install dependencies
run: |
pip install --upgrade pip
pip install pipenv
pipenv sync --system

- name: Set PYTHONPATH
run: echo "PYTHONPATH=./scripts" >> $GITHUB_ENV

- name: Run report script
run:
# python scripts/3-report/deviantart_reports.py
# python scripts/3-report/flickr_reports.py
# python scripts/3-report/github_reports.py
# python scripts/3-report/internetarchive_reports.py
# python scripts/3-report/metmuseum_reports.py
# python scripts/3-report/vimeo_reports.py
# python scripts/3-report/wikicommons_reports.py
# python scripts/3-report/wikipedia_reports.py
# python scripts/3-report/youtube_reports.py
python scripts/3-report/gcs_reports.py
8 changes: 8 additions & 0 deletions Pipfile
Original file line number Diff line number Diff line change
Expand Up @@ -5,6 +5,8 @@ name = "pypi"

[packages]
flickrapi = "*"
GitPython = "*"
google-api-python-client = "*"
internetarchive = "*"
jupyterlab = ">=3.6.7"
matplotlib = "*"
Expand All @@ -27,3 +29,9 @@ pre-commit = "*"

[requires]
python_version = "3.11"

[scripts]
gcs_fetched = "./scripts/1-fetch/gcs_fetched.py"
flickr_fetched = "./scripts/1-fetch/flickr_fetched.py"
gcs_processed = "./scripts/2-process/gcs_processed.py"
gcs_reports = "./scripts/3-report/gcs_reports.py"
1,454 changes: 805 additions & 649 deletions Pipfile.lock

Large diffs are not rendered by default.

43 changes: 43 additions & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -39,6 +39,49 @@ See [`CONTRIBUTING.md`][org-contrib].
[org-contrib]: https://github.com/creativecommons/.github/blob/main/CONTRIBUTING.md


### Project structure

Please note that in the directory tree below, all instances of `fetch`,
`process`, and `report` are referring to the three phases of data gathering,
processing, and report generation.

```
Quantifying/
├── .github/
│ ├── workflows/
│ │ ├── fetch.yml
│ │ ├── process.yml
│ │ ├── report.yml
│ │ └── static_analysis.yml
├── data/ # Data generated by script runs
│ ├── 20XXQX/
│ │ ├── 1-fetch/
│ │ ├── 2-process/
│ │ ├── 3-report/
│ │ │ └── README.md # All generated reports are displayed in the README
│ └── ...
├── dev/
├── pre-automation/ # All Quantifying work prior to adding automation system
├── scripts/ # Run scripts for all phases
│ ├── 1-fetch/
│ ├── 2-process/
│ ├── 3-report/
│ └── shared.py
├── .cc-metadata.yml
├── .flake8 # Python tool configuration
├── .gitignore
├── .pre-commit-config.yaml # Static analysis configuration
├── LICENSE
├── Pipfile # Specifies the project's dependencies and Python version
├── Pipfile.lock
├── README.md
├── env.example
├── history.md
├── pyproject.toml # Python tools configuration
└── sources.md
```


## Development


Expand Down
22 changes: 22 additions & 0 deletions data/2024Q2/1-fetch/gcs_fetched.csv
Original file line number Diff line number Diff line change
@@ -0,0 +1,22 @@
LICENSE TYPE, No Priori, United States, Canada, India, United Kingdom, Australia, Japan, English, Spanish, French, Arabic, Chinese (Simplified), Indonesian
https://creativecommons.org/licenses/nc/2.0,57000000,43700000,39400,11500,157000,31100,29300,54600000,1160000,2720000,2960,68100000,101000
https://creativecommons.org/licenses/sa/1.0,108000000,91000000,34400,19100,158000,43100,25100,104000000,268000,332000,28500,106000000,223000
https://creativecommons.org/licenses/nd/1.0,222000000,259000000,76700,63400,519000,165000,79600,215000000,492000,383000,69500,222000000,2480000
https://creativecommons.org/licenses/by-sa/3.0,108000000,88800000,64700,47600,254000,151000,145000,93500000,5010000,3980000,90500,114000000,112000
https://creativecommons.org/licenses/sampling/1.0,170000000,151000000,26500,23600,263000,56800,34300,170000000,87500,21500,1630,171000000,209000
https://creativecommons.org/licenses/nc-sa/1.0,24700000,17600000,12600,3800,75300,11600,13400,22600000,67700,272000,1200,24700000,42400
https://creativecommons.org/licenses/nd-nc/2.0,56900000,43700000,43500,11100,156000,36500,28300,54200000,237000,549000,2660,67600000,11500
https://creativecommons.org/licenses/by-nd-nc/1.0,16000000,10400000,13500,2900,64700,9990,21300,15500000,63400,28800,1920,16000000,27600
https://creativecommons.org/licenses/sa/1.0,108000000,91500000,23000,17200,158000,38200,21700,104000000,270000,337000,28600,108000000,224000
https://creativecommons.org/licenses/nd-nc/1.0,57200000,45500000,23100,8400,123000,21400,30000,56100000,135000,249000,3340,57200000,10200
https://creativecommons.org/publicdomain/zero/1.0,32700000,30000000,20200,18200,48600,12100,32200,30900000,131000,93400,16100,32700000,10500
https://creativecommons.org/licenses/nc-sa/2.0,25000000,17700000,19700,4640,80700,14700,11900,23400000,368000,1340000,1490,25000000,36100
https://creativecommons.org/licenses/sa/1.0,108000000,91400000,23000,17100,159000,38200,21600,104000000,270000,337000,28600,108000000,224000
https://creativecommons.org/licenses/by-sa/1.0,102000000,85900000,21900,15800,150000,36100,21200,97400000,267000,330000,28300,102000000,219000
https://creativecommons.org/licenses/by-nd/2.5,49500000,35500000,45100,8640,133000,24600,23600,47800000,367000,37900,2200,49500000,11200
https://creativecommons.org/licenses/by-nd-nc/1.0,15800000,10300000,15600,2940,66800,10200,21400,15300000,62400,29000,1980,15800000,27500
https://creativecommons.org/licenses/nd/1.0,220000000,194000000,89900,56500,424000,147000,88400,213000000,490000,297000,69500,220000000,2570000
https://creativecommons.org/licenses/by-nc-sa/4.0,62900000,44700000,49900,32900,163000,32900,79400,58900000,1020000,3610000,11500,62900000,102000
https://creativecommons.org/licenses/sampling+/1.0,169000000,151000000,27300,22900,274000,57300,34700,168000000,123000,62800,1310,169000000,200000
https://creativecommons.org/licenses/by-nc-sa/2.5,31600000,22800000,20100,5400,83100,15200,10400,29900000,258000,250000,1570,31600000,7290
https://creativecommons.org/licenses/nc-sa/1.0,24100000,17500000,10400,3820,58800,11800,10600,23300000,67500,280000,1200,24100000,42400
Binary file added data/2024Q2/3-report/gcs_country_report.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added data/2024Q2/3-report/gcs_language_report.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added data/2024Q2/3-report/gcs_licensetype_report.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
20 changes: 20 additions & 0 deletions data/2024Q2/README.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,20 @@
# 2024Q2 Quantifying the Commons
<!-- GCS Start -->
## Data Source: Google Custom Search

<!-- Country Report Start -->
### Country Report
![Number of Google Webpages Licensed by Country](3-report/gcs_country_report.png)
Number of Google Webpages Licensed by Country
<!-- Country Report End -->
<!-- License Type Report Start -->
### License Type Report
![Number of Webpages Licensed by License Type](3-report/gcs_licensetype_report.png)
Number of Webpages Licensed by License Type
<!-- License Type Report End -->
<!-- Language Report Start -->
### Language Report
![Number of Google Webpages Licensed by Language](3-report/gcs_language_report.png)
Number of Google Webpages Licensed by Language
<!-- Language Report End -->
<!-- GCS End -->
1 change: 1 addition & 0 deletions data/2024Q2/state.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
total_records_retrieved: 273
Empty file.
Empty file.
1 change: 1 addition & 0 deletions data/2024Q3/1-fetch/flickr_fetched/rec.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
1 1 1
18 changes: 18 additions & 0 deletions data/2024Q3/1-fetch/gcs_fetched.csv
Original file line number Diff line number Diff line change
@@ -0,0 +1,18 @@
LICENSE TYPE, No Priori, United States, Canada, India, United Kingdom, Australia, Japan, English, Spanish, French, Arabic, Chinese (Simplified), Indonesian
https://creativecommons.org/licenses/by/2.5,235000000,208000000,112000,82400,406000,701000,76800,225000000,3940000,835000,88300,235000000,115000
https://creativecommons.org/licenses/by/4.0,412000000,334000000,702000,360000,7250000,770000,675000,381000000,16100000,5260000,255000,412000000,11900000
https://creativecommons.org/licenses/by-nc-sa/2.5,31400000,22700000,21200,5400,83000,15000,10300,29800000,254000,248000,1580,31400000,7480
https://creativecommons.org/licenses/nc/1.0,54600000,43400000,25100,8390,128000,22200,30500,53600000,136000,479000,3760,54500000,81500
https://creativecommons.org/licenses/by-nc-sa/1.0,24000000,17400000,11300,3810,59100,11900,10600,23300000,66900,276000,1220,24000000,42900
https://creativecommons.org/licenses/by/2.1,219000000,189000000,132000,56200,477000,139000,269000,211000000,494000,232000,19800,219000000,119000
https://creativecommons.org/licenses/nc-sampling+/1.0,41200000,32000000,12900,4590,82300,14600,18800,41000000,52800,37800,381,41200000,31100
https://creativecommons.org/licenses/by-sa/4.0,175000000,140000000,126000,79300,341000,103000,123000,158000000,6800000,3860000,81300,175000000,626000
https://creativecommons.org/licenses/by/2.1,217000000,189000000,120000,55500,456000,134000,278000,209000000,219000,241000,20300,217000000,116000
https://creativecommons.org/licenses/by-nc-nd/2.1,66000000,51100000,21000,9200,144000,25100,30400,64300000,167000,581000,3690,66100000,19300
https://creativecommons.org/licenses/by-nc/3.0,88200000,77800000,32000,14000,289000,91900,43000,85000000,7630000,6890000,13700,88300000,40200
https://creativecommons.org/licenses/by/2.1,269000000,235000000,122000,54500,477000,124000,277000,261000000,455000,1900000,20100,269000000,113000
https://creativecommons.org/licenses/by-nd/2.0,32500000,23600000,17000,6910,57500,19500,20100,31200000,121000,47200,5290,32500000,7660
https://creativecommons.org/licenses/by-nc-sa/3.0,35800000,29800000,18000,5760,149000,69400,12200,33700000,623000,4440000,7890,35800000,16000
https://creativecommons.org/licenses/sa/1.0,120000000,101000000,23500,16600,167000,56500,24400,116000000,247000,314000,29100,120000000,213000
https://creativecommons.org/licenses/sa/1.0,120000000,101000000,23500,16600,167000,56500,24400,116000000,247000,314000,29100,120000000,213000
https://creativecommons.org/licenses/by-nc-sa/2.0,20400000,13800000,21400,3730,65000,21700,9270,19100000,92600,773000,4470,20400000,5740
Binary file added data/2024Q3/3-report/gcs_country_report.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added data/2024Q3/3-report/gcs_language_report.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added data/2024Q3/3-report/gcs_licensetype_report.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
19 changes: 19 additions & 0 deletions data/2024Q3/README.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,19 @@
# 2024Q3 Quantifying the Commons
## Data Source: Google Custom Search
<!-- Google Custom Search Start -->
<!-- Country Report Start -->
### Country Report
![Number of Google Webpages Licensed by Country](3-report/gcs_country_report.png)
Number of Google Webpages Licensed by Country
<!-- Country Report End -->
<!-- License Type Report Start -->
### License Type Report
![Number of Webpages Licensed by License Type](3-report/gcs_licensetype_report.png)
Number of Webpages Licensed by License Type
<!-- License Type Report End -->
<!-- Language Report Start -->
### Language Report
![Number of Google Webpages Licensed by Language](3-report/gcs_language_report.png)
Number of Google Webpages Licensed by Language
<!-- Language Report End -->
<!-- Google Custom Search End -->
1 change: 1 addition & 0 deletions data/2024Q3/state.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
total_records_retrieved: 390
Loading