Skip to content
Bhaskar Shubham edited this page Sep 2, 2014 · 2 revisions

Hadoop

CDH3

Download the required
version
of Hadoop and
install it using the following commands-

  1. Extract Hadoop: tar -xvf hadoop-0.20.2-cdh3u6.tar.gz
  2. Set HADOOP_HOME: Add the following commands to your environment file
export HADOOP_HOME=/location/where/you/extracted/hadoop               
export PATH=$PATH:$HADOOP_HOME/bin

Edit $HADOOP_HOME/conf/hadoop-env.sh file and update the following commands-

export JAVA_HOME=/location/where/you/extracted/jdk                    
export HADOOP_OPTS=-Djava.net.preferIPv4Stack=true                               
export HADOOP_LOG_DIR=/location/where/you/want/logs

Edit $HADOOP_HOME/conf/core-site.xml file and update the following commands-

<configuration>                                                                  
 <property>                                                                      
    <name>hadoop.tmp.dir</name>                                                  
    <value>/app/hadoop/tmp</value>                                               
    <description>A base for other temporary directories.</description>           
 </property>                                                                     
 <property>                                                                      
    <name>fs.default.name</name>                                                 
    <value>hdfs://localhost:54310</value>                                        
    <description>The name of the default file system.  A URI whose               
    scheme and authority determine the FileSystem implementation.  The           
    uri's scheme determines the config property (fs.SCHEME.impl) naming          
    the FileSystem implementation class.  The uri's authority is used to         
    determine the host, port, etc. for a filesystem.</description>

Clone this wiki locally