cancel
Showing results for 
Search instead for 
Did you mean: 
Data Engineering
Join discussions on data engineering best practices, architectures, and optimization strategies within the Databricks Community. Exchange insights and solutions with fellow data engineers.
cancel
Showing results for 
Search instead for 
Did you mean: 

Cost estimation before query execution similar to google cloud Big Query equivalent of --dry_run

NehaR
New Contributor III

Hi ,

 

In databricks do we have a option to estimate cost of query before execution which is similar to Big Query equivalent of --dry_run.

Our use case is to estimate cost before execution and get alerted.

 

Regards

Neha 

 

 

1 REPLY 1

Alberto_Umana
Databricks Employee
Databricks Employee

Hello @NehaR,

Currently, Databricks does not have a direct equivalent to BigQuery's --dry_run feature for estimating the cost of a query before execution. However, there are some mechanisms and ongoing projects that aim to provide similar functionality. There is no ETA yet, I will update over here if any update on its implementations.

For now, you can monitor the DBU consumption of your clusters and use historical data to estimate the cost of similar queries. Additionally, you can run smaller versions of your queries to get an idea of their cost and then extrapolate for larger datasets

Connect with Databricks Users in Your Area

Join a Regional User Group to connect with local Databricks users. Events will be happening in your city, and you won’t want to miss the chance to attend and share knowledge.

If there isn’t a group near you, start one and help create a community that brings people together.

Request a New Group