Standalone is a simple cluster manager included with Spark that makes it easy to set up a cluster.
The connection URL is:
Example with sparklyr:
sc <- sparklyr::spark_connect(master = "spark://nicoLaptop:7077")
where: master = Spark - Master (Connection URL )
./sbin/start-master.sh
# The start master script will call the following command
# ./bin/spark-class.cmd org.apache.spark.deploy.master.Master --host HI-LAPTOP-NGD1 --port 7077 --webui-port 8082
# Then
# C:\Program Files\Java\jdk1.8.0_45\jre\bin\java -cp C:/spark-2.1.1-bin-hadoop2.7/conf\;C:\spark-2.1.1-bin-hadoop2.7\bin\..\jars\* -Xmx1g org.apache.spark.deploy.master.Master --host HI-LAPTOP-NGD1 --port 7077 --webui-port 8080
./sbin/start-slave.sh <master-spark-URL>
For the worker and the master
The port of the UI by default is:
It can be change through:
The port of the service by default is:
It can be change through: