This Dockerfile only build Zeppelin which has dependency on the rest of other dockers in "docker-spark-bde2020" including Hadoop, Spark, Hive, etc.
You can get from https://hub.docker.com/r/openkbs/docker-spark-bde2020-zeppelin/
docker pull openkbs/docker-spark-bde2020-zeppelin
To build,
./build.sh
docker-compose -f docker-compose-hive.yml up -d zeppelin
There two options to run the entire suite of "docker-spark-bde2020"
- start-hadoop-spark-workbench.sh (no Hive support)
- start-hadoop-spark-workbench-with-hive.sh (with Hive support)
For example, to start the entire "docker-spark-bde2020 and zeppelin with Hive support:
./start-hadoop-spark-workbench-with-hive.sh
For example, to start the entire "docker-spark-bde2020 and zeppelin without Hive support:
./start-hadoop-spark-workbench.sh
To see how this Container work with with the entire big-data-europe/docker-hadoop-spark-workbench, go to "./example-docker-spark-bde2020" directory to explore the entire suite build.
** For example usage see docker-compose.yml and SANSA-Notebooks repository.