# Install HDFS

## Installation

### WARNING:  This download is memory intensive.  Before doing this exercise stop all unnecessary processes.  This means WORD, PowerPoint, Docker, etc.

### Start your VM from Vagrant

```
cd msbx5420vagrant
vagrant up
```

After the VM starts, enter the following from the VM's terminal window.  This will stop and remove all docker contains and remove any hadoop image.

If you do not have docker containers and images you will see error - ignore them

```
docker kill $(docker ps -q)
docker rm $(docker ps -a -q)
docker rmi sequenceiq/hadoop-docker
```

### Pull Hadoop <a href="#pull-and-run-hadoop" id="pull-and-run-hadoop"></a>

## ***NOTICE:***&#x20;

It was reported Sunday afternoon that students were unable to "pull" the docker container.  Please try the following instead of the pull.  If it succeeds you will see  a bash# prompt.  Type exit and then shutdown Vagrant

```
docker run -it sequenceiq/hadoop-docker:2.7.1 /etc/bootstrap.sh -bash
```

## *This command may fail per the above*

```
docker pull sequenceiq/hadoop-docker:2.7.1
```

### Stop the VM from Vagrant - from the NATIVE terminal <a href="#stop-the-vm-from-vagrant-from-the-native-terminal" id="stop-the-vm-from-vagrant-from-the-native-terminal"></a>

Be sure to restart Vagrant to relieve memory issues

```
vagrant halt
```
