Hadoop集群部署,就是以Cluster mode方式进行部署。
Hadoop的节点构成如下:
HDFS daemon: NameNode, SecondaryNameNode, DataNode
YARN damones: ResourceManager, NodeManager, WebAppProxy
MapReduce Job History Server
本次测试的分布式环境为:Master 1台 (test166),Slave 1台(test167)
安装方法参照 Hadoop系列之(一):Hadoop单机部署
详细参照 Hadoop系列之(一):Hadoop单机部署
让设置生效
# source /etc/profile# vi etc/hadoop/hdfs-site.xml <configuration> <property> <name>dfs.replication</name> <value>1</value> </property> </configuration>
# vi etc/hadoop/mapred-site.xml <configuration> <property> <name>mapreduce.framework.name</name> <value>yarn</value> </property> </configuration>
Master节点: namenode
创建目录并赋予权限
# mkdir -p /usr/local/hadoop-2.7.0/tmp/dfs/name # chmod -R 777 /usr/local/hadoop-2.7.0/tmp# vi etc/hadoop/hdfs-site.xml <property> <name>dfs.namenode.name.dir</name> <value>file:///usr/local/hadoop-2.7.0/tmp/dfs/name</value> </property>
Slave节点:datanode
创建目录并赋予权限
# mkdir -p /usr/local/hadoop-2.7.0/tmp/dfs/data # chmod -R 777 /usr/local/hadoop-2.7.0/tmp# vi etc/hadoop/hdfs-site.xml <property> <name>dfs.datanode.data.dir</name> <value>file:///usr/local/hadoop-2.7.0/tmp/dfs/data</value> </property>
Master节点: resourcemanager
# vi etc/hadoop/yarn-site.xml <configuration> <property> <name>yarn.resourcemanager.hostname</name> <value>test166</value> </property> </configuration>
Slave节点: nodemanager
# vi etc/hadoop/yarn-site.xml <configuration> <property> <name>yarn.resourcemanager.hostname</name> <value>test166</value> </property> <property> <name>yarn.nodemanager.aux-services</name> <value>mapreduce_shuffle</value> </property> </configuration>Slave节点:
# vi etc/hadoop/mapred-site.xml <property> <name>mapreduce.jobhistory.address</name> <value>test166:10020</value> </property>启动HDFS
# sbin/start-dfs.sh启动YARN
# sbin/start-yarn.sh启动job history server
# sbin/mr-jobhistory-daemon.sh start historyserver确认
Master节点:
# jpsSlave节点:
# jps查看
# hdfs dfs -ls /user/test22/input确认执行结果
# hdfs dfs -cat output/*本次集群部署主要是为了测试验证,生产环境中的HA,安全等设定,接下来会进行介绍。
转载于:https://www.cnblogs.com/ee900222/p/hadoop_2.html
