Skip to content

ls12styler/docker-spark

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

24 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Docker Apache Spark Cluster

A scalable Apache Spark cluster running in Docker.

Build: Build Status

Usage:

Cluster

Bring up a 4 node cluster (1 Master, 3 Workers):

docker-compose up --scale spark-slave=3

spark-shell

By default, running the image without a command will take you into the Spark shell:

docker run --rm -it -e SPARK_MASTER=spark://spark-master:7077 --network docker-spark_spark-network ls12styler/spark:latest

Passing Arguments to spark-shell

docker run --rm -it -e SPARK_MASTER=spark://spark-master:7077 --network docker-spark_spark-network ls12styler/spark:latest

bash shell

You can overide the command like so if you need anything else:

docker run --rm -it -e SPARK_MASTER=spark://spark-master:7077 --network docker-spark_spark-network ls12styler/spark:latest /bin/bash

Workflow

docker-compose up --scale spark-slave=3
docker build -t ls12styler/spark:latest .
docker run --rm -it -e SPARK_MASTER=spark://spark-master:7077 --network docker-spark_spark-network ls12styler/spark:latest

Used as a reference: https://medium.com/@marcovillarreal_40011/creating-a-spark-standalone-cluster-with-docker-and-docker-compose-ba9d743a157f

About

A docker based, scalable Apache Spark Cluster

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published