Create a Apache Spark cluster in Docker.
Note that this is just a proof of concept I developed just for testing purposes. See the Sources section for more information about the places I checked out. Moreover, some cleanup activities and enhancements are required, which I will try to do in my spare time.
Build images (with base):
DEPLOY_BASE=1 ./build
Build images (reusing base image):
./build
Start cluster:
docker-compose up --scale spark-master=1 --scale spark-worker=6 --scale spark-submit=0
Submit task:
docker-compose run spark-submit
./run <script args>