hadoop2 - Hadoop installation and configuration: multiple versions side-by-side on same host -
can install different versions of hadoop on same system in pseudo distributed mode?
actually want explore features of different version of hadoop-1.x , hadoop-2.x , have configured hadoop-1.x , hadoop-2.x on 2 different systems running linux. there way configure on same machine?
yes, possible install multiple versions of hadoop software side-by-side on same host.
- extract different versions of hadoop software separate sub-directories.
- create separate configuration files (core-site.xml, hdfs-site.xml, yarn-site.xml, etc.) different versions, , keep these sets of files in separate directories.
- make sure configure different directories things use local storage, such
dfs.namenode.name.dir
,dfs.datanode.data.dir
in hdfs-site.xml. 2 versions should have data isolated 1 another. alternatively, if want attempt share data between 2 different versions, , run 1 version @ time, might able achieve that. depending on software versions involved, might run data incompatibilities or need run upgrade commands. - if want run multiple versions simultaneously, make sure configured distinct server ports things
dfs.namenode.rpc-address
in hdfs-site.xml ,yarn.server.resourcemanager.address
in yarn-site.xml. - when invoking commands, set environment variables
hadoop_home
,hadoop_conf_dir
control version execute , configurations use.
everything above works fine manually deployed installation, such using apache hadoop distro. bigtop distros or vendor distros, rpm/deb/etc. packaging might less tolerant of side-by-side deployment.
Comments
Post a Comment