Python running far slower than locally, even with large cluster and multiple workers
Options
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
11-21-2024 04:21 AM
I have a notebook that is running extremely slowly even when I try to do pretty basic python functions. It is running far slower than locally no matter what I try, this is in spite of using a 32gb 4 core cluster with 4-8 workers. For context, my data is around 7m rows with about 20 columns, so not huge! Any help would be much appreciated.
2 REPLIES 2
Options
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
11-21-2024 04:23 AM
Just noticed it says Python, in title - it should say Spark.
Options
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
11-26-2024 11:33 AM
Please share more information, for example:
- Type of data source
- Type of operations being executed (sharing code if possible)
- Timings of local runs and Databricks runs

