cancel
Showing results forย 
Search instead forย 
Did you mean:ย 
Data Engineering
Join discussions on data engineering best practices, architectures, and optimization strategies within the Databricks Community. Exchange insights and solutions with fellow data engineers.
cancel
Showing results forย 
Search instead forย 
Did you mean:ย 

Cost of running a job on databricks

Prem1902
New Contributor II

Hi All, 

I need assistance with the cost of running the job on Databricks where I have 20-30 TB (one-time job) and daily data would be around 2GB. 

The level of transformation would be medium.  Source and destination is AWS s3.

Looking for your quick response .

Thanking in anticipation

 

1 ACCEPTED SOLUTION

Accepted Solutions

Ayushi_Suthar
Databricks Employee
Databricks Employee

Hi @Prem1902, Good Day! 

Thank you for raising your concern with us. Can you please check this document on the Billable usage system table. With system tables, you can view your accountโ€™s global usage from whichever region your workspace is in.

Document : https://docs.databricks.com/en/administration-guide/system-tables/billing.html#billable-usage-system...

However please note that System Tables gives us the details on a per Job ID basis and not per run ID.

Apart from this another option is to configure overwatch which has jobruncostpotentialfact schema which contains the DBU costs along with the run ID.

jobruncostpotentialfact Data Dictionary : https://databrickslabs.github.io/overwatch/dataengineer/definitions/#jobruncostpotentialfact
Deploy OverWatch : https://databrickslabs.github.io/overwatch/deployoverwatch/

Please let me know if this helps and leave a like if this information is useful, followups are appreciated.
Kudos
Ayushi

View solution in original post

2 REPLIES 2

Ayushi_Suthar
Databricks Employee
Databricks Employee

Hi @Prem1902, Good Day! 

Thank you for raising your concern with us. Can you please check this document on the Billable usage system table. With system tables, you can view your accountโ€™s global usage from whichever region your workspace is in.

Document : https://docs.databricks.com/en/administration-guide/system-tables/billing.html#billable-usage-system...

However please note that System Tables gives us the details on a per Job ID basis and not per run ID.

Apart from this another option is to configure overwatch which has jobruncostpotentialfact schema which contains the DBU costs along with the run ID.

jobruncostpotentialfact Data Dictionary : https://databrickslabs.github.io/overwatch/dataengineer/definitions/#jobruncostpotentialfact
Deploy OverWatch : https://databrickslabs.github.io/overwatch/deployoverwatch/

Please let me know if this helps and leave a like if this information is useful, followups are appreciated.
Kudos
Ayushi

Prem1902
New Contributor II

Is there a way to predict the cost before building the solution? I mean we wanted to see our option on different platforms.

Connect with Databricks Users in Your Area

Join a Regional User Group to connect with local Databricks users. Events will be happening in your city, and you wonโ€™t want to miss the chance to attend and share knowledge.

If there isnโ€™t a group near you, start one and help create a community that brings people together.

Request a New Group