Spark - How to run a standalone cluster locally

luke

Is there the possibility to run the Spark standalone cluster locally on just one machine (which is basically different from just developing jobs locally (i.e., local[*]))?.

So far I am running 2 different VMs to build a cluster, what if I could run a standalone cluster on the very same machine, having for instance three different JVMs running?

Could something like having multiple loopback addresses do the trick?

banjara

yes you can do it, launch one master and one worker node and you are good to go

launch master

./sbin/start-master.sh

launch worker

./bin/spark-class org.apache.spark.deploy.worker.Worker  spark://localhost:7077 -c 1 -m 512M

run SparkPi example

./bin/spark-submit  --class org.apache.spark.examples.SparkPi   --master spark://localhost:7077  lib/spark-examples-1.2.1-hadoop2.4.0.jar 

Apache Spark Standalone Mode Documentation

Collected from the Internet

Please contact [email protected] to delete if infringement.

edited at
0

Comments

0 comments
Login to comment

Related

How to send Spark metrics to Graphite on Standalone cluster?

Authentication for Spark standalone cluster

How to check if code is run locally or on a cluster in Python

Forcing driver to run on specific slave in spark standalone cluster running with "--deploy-mode cluster"

How to set path to files in Apache Spark Standalone Cluster?

How multiple executors are managed on the worker nodes with a Spark standalone cluster?

SPARK standalone cluster: Executors exit, how to track the source of the error?

How can I see the aggregated logs for a Spark standalone cluster

How to install Apache Zeppelin on existing Apache Spark standalone cluster

Spark Standalone how to pass local .jar file to cluster

Standalone spark cluster Authorization with Ranger

Submitting Spark application on standalone cluster

How to submit spark job from within java program to standalone spark cluster without using spark-submit?

How to run Spark application assembled with Spark 2.1 on cluster with Spark 1.6?

How to run Spark Sql on a 10 Node cluster

How do I run multiple spark applications in parallel in standalone master

Error while running spark on standalone cluster

Spark Standalone Cluster :Configuring Distributed File System

zeppelin with spark 1.5.2 standalone cluster error

What are workers, executors, cores in Spark Standalone cluster?

Spark - Understanding a simple application on Standalone cluster

Spark resource scheduling - Standalone cluster manager

Spark running in local machine instead of standalone cluster

Spark tasks blockes randomly on standalone cluster

Docker Container with Apache Spark in standalone cluster mode

SnappyData: Connect Standalone Spark Job to Embedded Cluster

AWS Spark standalone cluster does not start

Find the leader Node in a Spark Standalone Cluster with Zookeeper

How to enable spark-history server for standalone cluster non hdfs mode