Skip to content

This is an Ansible playbook that installs Hadoop Pseudo-distributed Mode based on CDH 5.x with ZooKeeper, HBase, Hive, Spark, etc. Vagrantfile that creates an Hadoop environment provisioned by the playbook is available.

License

Notifications You must be signed in to change notification settings

qilab/cdh5-hadoop-pseudo-env

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

67 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

cdh5-hadoop-pseudo-env

Circle CI

This is an Ansible playbook that installs Hadoop Pseudo-distributed Mode based on CDH 5.x with ZooKeeper, HBase, Hive, Spark, etc. Vagrantfile (in vagrant) that creates an Hadoop environment provisioned by the playbook is available.

Ansible playbook (ansible)

The Ansible playbook in ansible consists of following roles:

  • cdh_common: sets up Cloudera's repository and key
  • common: sets up OS settings and install common packages
  • hadoop: installs Hadoop
  • hbase: installs HBase
  • hive: installs Hive (it requires Metastore DB is installed)
  • mariadb: installs MariaDB for Hive Metastore
  • mysql_connector_java: installs MySQL Connector/J for Hive Metastore
  • oracle_jdk: installs Oracle JDK
  • spark: installs Spark
  • zookeeper: installs ZooKeeper Server

Requirements

This playbook is designed for following OS:

  • CentOS 6.8
  • CentOS 7.3
  • Ubuntu 12.04 (precise)
  • Ubuntu 14.04 (trusty)

Vagrant (vagrant)

You can use the Vagrantfile in vagrant to create an Hadoop environment provisioned by the playbook.

Requirements

About

This is an Ansible playbook that installs Hadoop Pseudo-distributed Mode based on CDH 5.x with ZooKeeper, HBase, Hive, Spark, etc. Vagrantfile that creates an Hadoop environment provisioned by the playbook is available.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published