Installation and Getting Started

Install and Run Spark from Docker

This Download is memory intensive. Stop WORD, PowerPoint, Docker, etc before going on.

Open a terminal window in your native machine.

Change Directory (cd) into Vagrant directory and start your VM

cd msbx5420vagrant
vagrant up

Open a terminal in your VM and do the following:

  1. Create a work directory that will contain all of your class Jupyter notebooks

  2. Download notebooks I have created so far for the class.

cd
mkdir work
cd work
rm -rf MSBX5420-001-FunctionalProgramming
git clone  https://github.com/marilynwaldman/MSBX5420-001-FunctionalProgramming.git && rm -rf /MSBX5420-001-FunctionalProgramming.git

Stop all running docker containers

If you do not have docker containers and images you will see error - ignore them.

docker kill $(docker ps -q)
docker rm $(docker ps -a -q)
docker rmi jupyter/all-spark-notebook

Pull the spark image

docker pull  jupyter/all-spark-notebook

Verify that Spark is running

docker run -d --name spark  -p 8888:8888  \
    -v $HOME/work:/home/jovyan/work:rw  \
     jupyter/all-spark-notebook \
     start-notebook.sh --NotebookApp.token='' 

Goto your notebook

Open a browser window and issue:

http://localhost:8888

You can exit the browser and stop the spark container

docker stop spark

Stop your VM from your local machine (Mac OS or Windows)

vagrant halt

Last updated

Was this helpful?