Docker-Compose Demo¶
For demonstration and experimentation purposes, a docker-compose setup can be found here.
To run, first install docker and docker-compose, following the instructions
for your OS. You’ll also need to make sure that docker is started with
sufficient resources - we recommend having at least 4 GB allocated to your
docker-machine.
The demo cluster can then be started as follows:
# Clone the repository
$ git clone https://github.com/jupyterhub/jupyterhub-on-hadoop.git
# Enter the `docker-demo` directory
$ cd jupyterhub-on-hadoop/docker-demo
# Start the demo cluster
$ docker-compose up -d
JupyterHub will then be available on port 8888 at your docker-machine
IP address. This IP address can be found at:
$ docker-machine inspect --format {{.Driver.IPAddress}})
Once you’re done using the demo, it can be shutdown with (from the
docker-demo directory):
$ docker-compose down
The demo comes with the following features:
- A realistic Hadoop 3 (CDH 6) cluster (1 master, 1 worker, 1 edge node), with Kerberos security enabled.
- 3 user accounts (
alice,bob, andcarl). The password for each istestpass. - Both Jupyter Notebook and JupyterLab are available. The default upon login is
the Notebook interface, replace
/treewith/labin the URL to access JupyterLab. - Each user gets access to a Python 3.7 environment, with common packages like
numpyandpandasalready installed. - Dask and Spark are both installed and fully configured. See Integration with Dask and Integration with Spark for more information on use.
For a walkthrough using the same demo cluster, see this video: