Installing Spark on MacOS

From the Apache Software Foundation

Installing Spark on MacOS High Sierra

Apache provides multiple ways to accomplish this depending on your personal preferences: use Homebrew, download a prebuilt file or build it yourself from source. This tutorial describes building it yourself from source. Apache’s source provides a build signature that installs all of your choice of prerequisites including: Maven, Scala, Hadoop, Yarn, and Zinc). So the only prerequisites you are responsible for are Maven 3.39+ or Java 8+.

Clone from git

Next we can clone the source for the build.

%cd /usr/local
%sudo git clone https://github.com/apache/spark.git

You may choose to sudo as yourself to build Spark but for later configuration you may also want to chown the folder so you can edit it

%sudo chown -R abe:admin /usr/local/spark

Building Spark with Maven

You can now build Spark with Yarn, Hadoop-2.7 and scala-2.11.

%cd /usr/local/spark
%export MAVEN_OPTS="-Xmx1300M -XX:MaxPermSize=512M -XX:ReservedCodeCacheSize=512m"
%./build/mvn -Pyarn -Phadoop-2.7 -Dscala-2.11 -DskipTests clean package

Once the build succeeds, edit your .bash_profile to include:

%export SPARK_HOME=/usr/local/spark
%export PATH=$PATH:$SPARK_HOME/bin

Then source your .bash_profile update:your settings.

%source .bash_profile

Test the Installation

Now you can test the installation.

%spark-shell

 

Terminal View of Spark-Shell