cancel
Showing results for 
Search instead for 
Did you mean: 
Warehousing & Analytics
Engage in discussions on data warehousing, analytics, and BI solutions within the Databricks Community. Share insights, tips, and best practices for leveraging data for informed decision-making.
cancel
Showing results for 
Search instead for 
Did you mean: 

How to implement automatic scaling of cluster size in Serverless Warehouse

fishingrod
New Contributor II

I would like to know if the cluster size of a Serverless Warehouse can automatically scale up and down, and what determines the number of workers used when executing queries. Does it use all workers within the cluster size fixedly, or does it use part or all of the workers in a cluster according to the query plan?

3 REPLIES 3

Alberto_Umana
Databricks Employee
Databricks Employee

Hello @fishingrod,

Yes, the cluster size of a Serverless Warehouse can automatically scale up and down. The autoscaling behavior is managed by Intelligent Workload Management (IWM), which dynamically adjusts the compute resources based on the workload demands. This system uses machine learning models to predict the resource needs of incoming queries and monitors the warehouse’s available compute capacity in real time.

The number of workers used when executing queries is not fixed. Instead, the system scales the number of resources to accommodate varying demands. The autoscaling mechanism ensures that compute resources are rapidly upscaled to maintain low latency and quickly downscaled to minimize costs when demand is low. This dynamic management allows the warehouse to handle large numbers of queries efficiently and cost-effectively.

Please see: https://docs.databricks.com/en/compute/sql-warehouse/warehouse-behavior.html

So what is the purpose of the `cluster size` I set when I created the Serverless Warehouse? Isn’t it automatically adjusted?

Takuya-Omi
Valued Contributor II

@fishingrod 

My understanding is that Intelligent Workload Management (IWM) in Serverless SQL Warehouses adjusts the number of clusters, but it does not automatically scale the cluster size.

This means that if you need to improve the execution performance of individual SQL queries, you will need to explicitly specify the cluster size yourself.

--------------------------
Takuya Omi (尾美拓哉)

Join Us as a Local Community Builder!

Passionate about hosting events and connecting people? Help us grow a vibrant local community—sign up today to get started!

Sign Up Now