This repo contains the Vagrantfile script for generating virtual machines of Hadoop and Spark.
-
Install Vagrant for your specific operating system
-
Clone this repository
-
Open the Vagrantfile script using your favourite IDE and change the config as you need. The specific configuration can be found in the next part of this document.
-
From your console/terminal, inside the main directory, run the vagrant with following command and wait until it finishes.
vagrant up
- Once it is finished, enter the node1 machine by following command
vagrant ssh node1
- After it enters the shell of node1, run the script for configuring password-less SSH to slave nodes by executing following commands
sudo chmod +x /vagrant/scripts/setup-ssh.sh
/vagrant/script/setup-ssh.sh -t [n]
where [n] is amount of node you would like to setup in cluster
Following option can be configured in the Vagrant file
--memory
To spcifiy the memory for each machine
numNodes
To specify the amount of nodes in cluster