cancel
Showing results forย 
Search instead forย 
Did you mean:ย 
Data Engineering
cancel
Showing results forย 
Search instead forย 
Did you mean:ย 

Spark - Cluster Mode - Driver

BST
New Contributor

When running a Spark Job in Cluster Mode, how does Spark decide which worker node to place the driver resources ?

 

1 ACCEPTED SOLUTION

Accepted Solutions

Kaniz
Community Manager
Community Manager

Hi @BSTWhen running a Spark job in cluster mode, it involves a central manager (e.g., YARN, Mesos, Kubernetes), a driver program, and worker nodes. The driver program is submitted to the central manager, which allocates resources and decides where to run the driver. Worker nodes execute the tasks. Data location influences driver placement, as Spark may aim to put it on the same or nearby machines to minimize data transfer. The central manager makes these decisions based on resource availability and data locality.

View solution in original post

1 REPLY 1

Kaniz
Community Manager
Community Manager

Hi @BSTWhen running a Spark job in cluster mode, it involves a central manager (e.g., YARN, Mesos, Kubernetes), a driver program, and worker nodes. The driver program is submitted to the central manager, which allocates resources and decides where to run the driver. Worker nodes execute the tasks. Data location influences driver placement, as Spark may aim to put it on the same or nearby machines to minimize data transfer. The central manager makes these decisions based on resource availability and data locality.