Spark read not working in parallel
Options
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
06-20-2023 12:29 PM
Hi All,
Am trying to run a spark jdbc query and the read is running on only 1 worker and is taking a lot of time. Usually, it takes 4 minutes in aws lambda but now it is taking approx 10 minutes while using databricks. There are only 4m records (write is not taking long but read is consuming the max time)
Labels:
- Labels:
-
Parallel
-
Spark
-
Spark JDBC Query
1 REPLY 1

Anonymous
Not applicable
Options
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
06-20-2023 09:04 PM
Hi @Saloni Bhatia
Great to meet you, and thanks for your question!
Let's see if your peers in the community have an answer to your question. Thanks.

