โ07-04-2024 11:27 PM
I'm trying to run a workflow with job compute.
Job compute needs to be pending for about 5 to 7 minutes before executing the workflow. I think it takes time to find a suitable instance in the cloud, configure the environment, install libraries, etc.
And Databricks charges us by the time we use it. So, is the 5-7 minutes required for pending included in the cost? Or is only the time when pending ends and the workflow actually starts included in the cost?
Thank you for your answer.
โ07-08-2024 10:45 PM
There is a big discrepancy to take note of here. It is the difference between Databricks job costs (DBUs) and billing for the cloud provider (the actual VM doing the the work).
Billing for the Databricks job starts when the Spark context is being initiated (you can see so in the event log). It is usually at 4-5 minutes in.
Billing for the cloud provider starts the moment you press the start button on your pipeline (this will provision the VM in 5 minutes).
Hope this helps!
โ07-06-2024 08:27 AM
The cost of JOB COMPUTE starts to be calculated from the moment the job starts running.
You can monitor the cost of jobs that use serverless compute for workflows by querying the billable usage system table.
โ07-08-2024 10:00 PM
Thank you for your answer. That means pending time is also included in the cost, right?
โ07-08-2024 10:45 PM
There is a big discrepancy to take note of here. It is the difference between Databricks job costs (DBUs) and billing for the cloud provider (the actual VM doing the the work).
Billing for the Databricks job starts when the Spark context is being initiated (you can see so in the event log). It is usually at 4-5 minutes in.
Billing for the cloud provider starts the moment you press the start button on your pipeline (this will provision the VM in 5 minutes).
Hope this helps!
Passionate about hosting events and connecting people? Help us grow a vibrant local communityโsign up today to get started!
Sign Up Now