<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Does Databricks have a google cloud Big Query equivalent of --dry_run to estimate costs before executing? in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/does-databricks-have-a-google-cloud-big-query-equivalent-of-dry/m-p/18288#M12107</link>
    <description>&lt;P&gt;Databricks does give you a view on how many DBUs/hour a cluster consumes (from-to interval in case of autoscaling), see the cluster pane for this.&lt;/P&gt;&lt;P&gt;With that and a duration of the job, you can make an estimate.  But the duration... for that you need to run the program (perhaps on small data and extrapolate).&lt;/P&gt;&lt;P&gt;This is a pretty rough estimate though.  Maybe others have succeeded in doing this.&lt;/P&gt;</description>
    <pubDate>Thu, 09 Jun 2022 11:00:40 GMT</pubDate>
    <dc:creator>-werners-</dc:creator>
    <dc:date>2022-06-09T11:00:40Z</dc:date>
    <item>
      <title>Does Databricks have a google cloud Big Query equivalent of --dry_run to estimate costs before executing?</title>
      <link>https://community.databricks.com/t5/data-engineering/does-databricks-have-a-google-cloud-big-query-equivalent-of-dry/m-p/18285#M12104</link>
      <description>&lt;P&gt;Databricks uses DBU's as a costing unit whether based onto of AWS/Azure/GCP and I want to know if Databricks has a google cloud Big Query equivalent of --dry_run for estimating costs?&amp;nbsp;&lt;A href="https://cloud.google.com/bigquery/docs/estimate-costs" alt="https://cloud.google.com/bigquery/docs/estimate-costs" target="_blank"&gt;https://cloud.google.com/bigquery/docs/estimate-costs&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 08 Jun 2022 11:16:25 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/does-databricks-have-a-google-cloud-big-query-equivalent-of-dry/m-p/18285#M12104</guid>
      <dc:creator>zach</dc:creator>
      <dc:date>2022-06-08T11:16:25Z</dc:date>
    </item>
    <item>
      <title>Re: Does Databricks have a google cloud Big Query equivalent of --dry_run to estimate costs before executing?</title>
      <link>https://community.databricks.com/t5/data-engineering/does-databricks-have-a-google-cloud-big-query-equivalent-of-dry/m-p/18286#M12105</link>
      <description>&lt;P&gt;Not that I know of.&lt;/P&gt;&lt;P&gt;Google uses number of bytes read to determine the cost.&lt;/P&gt;&lt;P&gt;Databricks uses DBU. The number of DBU's spent is not only dependent on the amount of bytes read (the more you read, the longer the program will run probably), but also the type of VM used.&lt;/P&gt;&lt;P&gt;Then there is also autoscaling which makes it harder to predict a price.&lt;/P&gt;&lt;P&gt;Also the total cost is not only DBU but also the provisioning cost of the VMs.&lt;/P&gt;&lt;P&gt;So that makes it pretty hard to predict a cost.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;It would of course be very cool to have such a prediction.&lt;/P&gt;&lt;P&gt;​&lt;/P&gt;</description>
      <pubDate>Thu, 09 Jun 2022 10:35:17 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/does-databricks-have-a-google-cloud-big-query-equivalent-of-dry/m-p/18286#M12105</guid>
      <dc:creator>-werners-</dc:creator>
      <dc:date>2022-06-09T10:35:17Z</dc:date>
    </item>
    <item>
      <title>Re: Does Databricks have a google cloud Big Query equivalent of --dry_run to estimate costs before executing?</title>
      <link>https://community.databricks.com/t5/data-engineering/does-databricks-have-a-google-cloud-big-query-equivalent-of-dry/m-p/18287#M12106</link>
      <description>&lt;P&gt;Hi @Werner Stinckens​&amp;nbsp;thank you for taking the time to reply and for the thoughtful response. I find it hard to believe that so many companies are using the type of compute when the price is hard to know. I understand there is some ambiguity with the bytes read and cluster type, do you know of a way to give a rough estimate?&lt;/P&gt;</description>
      <pubDate>Thu, 09 Jun 2022 10:49:02 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/does-databricks-have-a-google-cloud-big-query-equivalent-of-dry/m-p/18287#M12106</guid>
      <dc:creator>zach</dc:creator>
      <dc:date>2022-06-09T10:49:02Z</dc:date>
    </item>
    <item>
      <title>Re: Does Databricks have a google cloud Big Query equivalent of --dry_run to estimate costs before executing?</title>
      <link>https://community.databricks.com/t5/data-engineering/does-databricks-have-a-google-cloud-big-query-equivalent-of-dry/m-p/18288#M12107</link>
      <description>&lt;P&gt;Databricks does give you a view on how many DBUs/hour a cluster consumes (from-to interval in case of autoscaling), see the cluster pane for this.&lt;/P&gt;&lt;P&gt;With that and a duration of the job, you can make an estimate.  But the duration... for that you need to run the program (perhaps on small data and extrapolate).&lt;/P&gt;&lt;P&gt;This is a pretty rough estimate though.  Maybe others have succeeded in doing this.&lt;/P&gt;</description>
      <pubDate>Thu, 09 Jun 2022 11:00:40 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/does-databricks-have-a-google-cloud-big-query-equivalent-of-dry/m-p/18288#M12107</guid>
      <dc:creator>-werners-</dc:creator>
      <dc:date>2022-06-09T11:00:40Z</dc:date>
    </item>
    <item>
      <title>Re: Does Databricks have a google cloud Big Query equivalent of --dry_run to estimate costs before executing?</title>
      <link>https://community.databricks.com/t5/data-engineering/does-databricks-have-a-google-cloud-big-query-equivalent-of-dry/m-p/18290#M12109</link>
      <description>&lt;P&gt;Hi Kaniz, unfortunately there are no answers in the thread. It would be good to get a steer from someone at Databricks if possible. &lt;/P&gt;</description>
      <pubDate>Fri, 17 Jun 2022 10:45:41 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/does-databricks-have-a-google-cloud-big-query-equivalent-of-dry/m-p/18290#M12109</guid>
      <dc:creator>zach</dc:creator>
      <dc:date>2022-06-17T10:45:41Z</dc:date>
    </item>
  </channel>
</rss>

