Downloading and extracting data from DATASUS

First gather the prefix for each link by going to https://datasus.saude.gov.br/transferencia-de-arquivos/ and choosing your source. For example, to download "Equipes de saúde em MG de 2007 a 2021" we get an arbitrary link to extract the prefix, I got ftp://ftp.datasus.gov.br/dissemin/publicos/CNES/200508_/Dados/EP/EPMG2101.dbc which I then remove the date and suffix and end up with ftp://ftp.datasus.gov.br/dissemin/publicos/CNES/200508_/Dados/EP/EPMG. Then I

Put the prefix obtainded in get_links.py's variable named link_base and run it. It should output a links.json file.
Run python download_data.py in this folder to download every .dbc file into the folder bases_raw.
Now that you've downloaded the data, run python uncompress_data.py to decompress every file from .dbc to .csv into the directory bases_descomprimidas.

And you're done!

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.gitignore		.gitignore
README.md		README.md
dbc2csv.R		dbc2csv.R
download_data.py		download_data.py
get_links.py		get_links.py
uncompress_data.py		uncompress_data.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Downloading and extracting data from DATASUS

About

Releases

Packages

Languages

henrypickler/grab_datasus

Folders and files

Latest commit

History

Repository files navigation

Downloading and extracting data from DATASUS

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages