DataDuck ETL is probably the quickest extract-transform-load framework system to set up. If you want to set up a data warehouse, give DataDuck ETL a try.
DataDuck ETL is currently focused on loading to Amazon Redshift (through Amazon S3).
Create a new, empty directory. Inside this directory, create a file named Gemfile, and add the following to it:
source 'https://rubygems.org'
gem 'dataduck'
Then execute:
$ bundle install
Finally, run the quickstart command:
$ dataduck quickstart
The quickstart wizard will ask you for credentials to your database, then create the basic setup for your project. After the quickstart, you'll still need to edit the .env and config/base.yml files, as well as customize your tables. Finally, your project's ETL can be run by running the dataduck etl all
command.
If you'd like to run this regularly, such as every night, it's recommended to use the whenever gem to manage a cron job to regularly run the ETL.
Visit the docs page to read the documentation. The docs page is autogenerated from the files in this project's docs directory.
There are many third party integrations, including Optimizely, SEMrush, Sentry, and Airbrake. Visit the docs page to learn more about integrating.
To contribute, get in touch at http://DataDuckETL.com/ so that we can share the Contributor License Agreement (CLA) with you, then create a pull request.
You can buy a license directly from Supported Source at https://supportedsource.org/projects/dataduck-etl
You're also welcome to get in touch at http://dataducketl.com/ for invoicing or anything else.