Data Engineering

by HariharaSam • Contributor

09-09-2022 5:51:59 AM

26268 Views
4 replies
2 kudos

Parallel Processing of Databricks Notebook

I have a scenario where I need to run same databricks notebook multiple times in parallel.What is the best approach to do this ?

Data Engineering

26268 Views
4 replies
2 kudos

09-09-2022 5:51:59 AM

View Replies

Latest Reply

Anonymous
Not applicable

09-24-2022 1:08:54 AM

2 kudos

Hi @Hariharan Sambath Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear from you....

2 kudos

09-24-2022 1:08:54 AM

3 More Replies

by narek_margaryan • New Contributor II

10-06-2021 12:51:06 PM

3635 Views
1 replies
3 kudos

Resolved! Do Spark nodes read data from storage in a sequence?

I'm new to Spark and trying to understand how some of its components work.I understand that once the data is loaded into the memory of separate nodes, they process partitions in parallel, within their own memory (RAM).But I'm wondering whether the in...

Data Engineering

3635 Views
1 replies
3 kudos

10-06-2021 12:51:06 PM

View Replies

Latest Reply

-werners-
Esteemed Contributor III

10-08-2021 12:11:36 AM

3 kudos

@Narek Margaryan , Normally the reading is done in parallel because the underlying file system is already distributed (if you use HDFS-based storage or something like, a data lake f.e.).The number of partitions in the file itself also matters.This l...

3 kudos

10-08-2021 12:11:36 AM

Databricks Community

Forum Posts

Parallel Processing of Databricks Notebook

Resolved! Do Spark nodes read data from storage in a sequence?