This is an Ansible playbook that installs Hadoop Pseudo-distributed Mode based on CDH 5.x with ZooKeeper, HBase, Hive, Spark, etc. Vagrantfile (in vagrant) that creates an Hadoop environment provisioned by the playbook is available.
Ansible playbook (ansible)
The Ansible playbook in ansible consists of following roles:
cdh_common
: sets up Cloudera's repository and keycommon
: sets up OS settings and install common packageshadoop
: installs Hadoophbase
: installs HBasehive
: installs Hive (it requires Metastore DB is installed)mariadb
: installs MariaDB for Hive Metastoremysql_connector_java
: installs MySQL Connector/J for Hive Metastoreoracle_jdk
: installs Oracle JDKspark
: installs Sparkzookeeper
: installs ZooKeeper Server
- Ansible 2.0.0 or later
This playbook is designed for following OS:
- CentOS 6.8
- CentOS 7.3
- Ubuntu 12.04 (precise)
- Ubuntu 14.04 (trusty)
Vagrant (vagrant)
You can use the Vagrantfile in vagrant to create an Hadoop environment provisioned by the playbook.
- VirtualBox 5.0.0 or later
- Vagrant 1.7.0 or later