為學習spark,虛擬機中開4台虛擬機安裝spark3.0.0底層hadoop集群已經安裝好,見ol7.7安裝部署4節點hadoop 3.2.1分散式集群學習環境首先,去http://spark.apache.org/downloads.html下載對應安裝包解壓[hadoop@master ~]$... ...
為學習spark,虛擬機中開4台虛擬機安裝spark3.0.0
底層hadoop集群已經安裝好,見ol7.7安裝部署4節點hadoop 3.2.1分散式集群學習環境
首先,去http://spark.apache.org/downloads.html下載對應安裝包
解壓
[hadoop@master ~]$ sudo tar -zxf spark-3.0.0-bin-without-hadoop.tgz -C /usr/local [hadoop@master ~]$ cd /usr/local [hadoop@master /usr/local]$ sudo mv ./spark-3.0.0-bin-without-hadoop/ spark [hadoop@master /usr/local]$ sudo chown -R hadoop: ./spark
四個節點都添加環境變數
export SPARK_HOME=/usr/local/spark export PATH=$PATH:$SPARK_HOME/bin:$SPARK_HOME/sbin
配置spark
spark目錄中的conf目錄下cp ./conf/spark-env.sh.template ./conf/spark-env.sh後面添加
export SPARK_MASTER_IP=192.168.168.11 export HADOOP_CONF_DIR=/usr/local/hadoop/etc/hadoop export SPARK_LOCAL_DIRS=/usr/local/hadoop export SPARK_DIST_CLASSPATH=$(/usr/local/hadoop/bin/hadoop classpath)
然後配置work節點,cp ./conf/slaves.template ./conf/slaves修改為
master slave1 slave2 slave3
寫死JAVA_HOME,sbin/spark-config.sh最後添加
export JAVA_HOME=/usr/lib/jvm/jdk1.8.0_191
複製spark目錄到其他節點
sudo scp -r /usr/local/spark/ slave1:/usr/local/ sudo scp -r /usr/local/spark/ slave2:/usr/local/ sudo scp -r /usr/local/spark/ slave3:/usr/local/ sudo chown -R hadoop ./spark/
...
啟動集群
先啟動hadoop集群/usr/local/hadoop/sbin/start-all.sh
然後啟動spark集群
通過master8080埠監控
完成安裝