https://files.tmdb.org/p/exports/production_company_ids_MM_DD_YYYY.json.gz
Compares the titles using levenshtein
go run cmd/001_titlecompare/main.go production_company_ids_MM_DD_YYYY.json.gz wikidata-companies.csv title_compare.csv
Downloads media ids for company id
export TMDB_API_KEY="<your key>"
go run cmd/002_download_tmdbcompanymedia/main.go title_compare.csv tmdb_media_mapping.csv
go run cmd/003_download_wikidatacompanymedia/main.go title_compare.csv wikidata_media_mapping.csv
Compare media id sets for company in tmdb and wikidata and find best match
titleCompareCSVPath := os.Args[1]
tmdbMediaCSVPath := os.Args[2]
wikidataMediaCSVPath := os.Args[3]
outputMatchCSVPath := os.Args[4]
go run cmd/004_mediaidscompare/main.go title_compare.csv tmdb_media_mapping.csv wikidata_media_mapping.csv result.csv
- PROBABLY - name similar, at least one common media
- MAYBE - name not similar, at least one common media OR name very similar, no common media
- UNLIKELY - name similar, no common media
- NOPE - name not similar, no common media
https://www.wikidata.org/wiki/Wikidata:Property_proposal/TMDB_company_ID