Data Engineering

by uzairm • New Contributor III

03-10-2023 3:20:05 AM

5268 Views
2 replies
1 kudos

My whole code is running on driver node, I want my code to run on worker nodes so that the memory of driver node is not exhausted. Please tell me improvement is my codes. My spark crashes frequently when the pulled data from s3 is huge.

I am running process which has 4 steps.Querying s3 file paths from dynamo DB based on certain parameters given by user. (function to do so provided by client, just have to import). Returns a list of filesCheck if those file paths have already been qu...

Data Engineering

5268 Views
2 replies
1 kudos

03-10-2023 3:20:05 AM

View Replies

Latest Reply

Vartika
Databricks Employee

03-31-2023 2:38:44 AM

1 kudos

Hi @uzair mustafa Thank you for posting your question in our community! We are happy to assist you.Does @Suteja Kanuri's answer help? If it does, would you be happy to mark it as best?This will help other community members who may have similar ques...

1 kudos

03-31-2023 2:38:44 AM

1 More Replies

Databricks Community

Forum Posts

My whole code is running on driver node, I want my code to run on worker nodes so that the memory of driver node is not exhausted. Please tell me improvement is my codes. My spark crashes frequently when the pulled data from s3 is huge.