Skip to content

Where did the CSV of all danbooru tags come from? #194

Answered by DominikDoom
PaperOrb asked this question in Q&A
Discussion options

You must be logged in to vote

For a long time danbooru had a public daily dump under https://danbooru.donmai.us/cache/tags.json, and a similar one for aliases.
However, this was removed some time at the end of 2022, probably to discourage data scraping. But that is where my copy came from, I think my version is a snapshot from November. It was all tags on danbooru at that point as far as I know, including deprecated ones.

I then converted the json file to an SQLite database using https://github.com/dcmoura/spyql so I could query and export it easier and simply created a view with the top 100k tags sorted by post count and joined with their aliases, then exported that to csv.

You are correct that there are a few tags n…

Replies: 1 comment 2 replies

Comment options

You must be logged in to vote
2 replies
@PaperOrb
Comment options

@DominikDoom
Comment options

Answer selected by PaperOrb
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants