Topics with Label: Cluster Autoscaling

Forum Posts

Sorted by:

by Mr__D • New Contributor II

04-04-2023 10:55:57 AM

7306 Views
1 replies
0 kudos

Databricks Cluster Autoscaling

Hello All,Could anyone please suggest impact of Autoscaling in cluster cost ?Suppose if I have a cluster where min worker is 2 and max is 10 but most of the time active worker are 3 so the cluster will be billed for only 3 workers or for 10 worker(...

Data Engineering

7306 Views
1 replies
0 kudos

04-04-2023 10:55:57 AM

View Replies

Latest Reply

Anonymous
Not applicable

04-06-2023 7:00:14 PM

0 kudos

@Deepak Bhatt :Autoscaling in Databricks can have a significant impact on cluster cost, as it allows the cluster to dynamically add or remove workers based on the workload.In the scenario you described, if the active worker count is consistently at ...

0 kudos

04-06-2023 7:00:14 PM

by KellenO • New Contributor II

12-08-2022 12:30:22 PM

2594 Views
2 replies
8 kudos

Resolved! How can I use cluster autoscaling with intensive subprocess calls?

I have a custom application/executable that I upload to DBFS and transfer to my cluster's local storage for execution. I want to call multiple instances of this application in parallel, which I've only been able to successfully do with Python's subpr...

Data Engineering

2594 Views
2 replies
8 kudos

12-08-2022 12:30:22 PM

View Replies

Latest Reply

Anonymous
Not applicable

12-08-2022 4:18:17 PM

8 kudos

Autoscaling works for spark jobs only. It works by monitoring the job queue, which python code won't go into. If it's just python code, try single node.https://docs.databricks.com/clusters/configure.html#cluster-size-and-autoscaling

8 kudos

12-08-2022 4:18:17 PM

1 More Replies

by dataslicer • Contributor

04-14-2022 3:29:53 PM

6096 Views
7 replies
2 kudos

Resolved! Exploring additional cost saving options for structured streaming 24x7x365 uptime workloads

I currently have multiple jobs (each running its own job cluster) for my spark structured streaming pipelines that are long running 24x7x365 on DBR 9.x/10.x LTS. My SLAs are 24x7x365 with 1 minute latency. I have already accomplished the following co...

Data Engineering

6096 Views
7 replies
2 kudos

04-14-2022 3:29:53 PM

View Replies

Latest Reply

Anonymous
Not applicable

05-18-2022 5:29:05 AM

2 kudos

http://doramasmp4.tv/

2 kudos

05-18-2022 5:29:05 AM

6 More Replies

by User16826992666 • Valued Contributor

06-15-2021 9:03:50 PM

2749 Views
1 replies
0 kudos

Resolved! How does cluster autoscaling work?

What determines when the cluster autoscaling activates to add and remove workers? Also, can it be adjusted?

Data Engineering

2749 Views
1 replies
0 kudos

06-15-2021 9:03:50 PM

View Replies

Latest Reply

sajith_appukutt
Honored Contributor II

06-17-2021 3:26:46 PM

0 kudos

> What determines when the cluster autoscaling activates to add and remove workersDuring scale-down, the service removes a worker only if it is idle and does not contain any shuffle data. This allows aggressive resizing without killing tasks or recom...

0 kudos

06-17-2021 3:26:46 PM

Databricks Community

Databricks Cluster Autoscaling

Resolved! How can I use cluster autoscaling with intensive subprocess calls?

Resolved! Exploring additional cost saving options for structured streaming 24x7x365 uptime workloads

Resolved! How does cluster autoscaling work?