<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: How to get cost per job which runs on ALL_PURPOSE_COMPUTE ?? in Administration &amp; Architecture</title>
    <link>https://community.databricks.com/t5/administration-architecture/how-to-get-cost-per-job-which-runs-on-all-purpose-compute/m-p/96490#M2176</link>
    <description>&lt;P&gt;You’re right &lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/129087"&gt;@KUMAR__111&lt;/a&gt;—tracking costs for jobs on &lt;STRONG&gt;ALL_PURPOSE_COMPUTE&lt;/STRONG&gt; clusters can be tricky since &lt;STRONG&gt;DBU usage isn’t directly tied to specific jobs&lt;/STRONG&gt;. When multiple jobs run in parallel on the same cluster, it’s challenging to allocate costs accurately. Consider using &lt;STRONG&gt;cluster tags&lt;/STRONG&gt; to label clusters by job, which can help with grouping costs even when jobs share clusters. Running &lt;STRONG&gt;job-specific clusters&lt;/STRONG&gt; for key workloads could provide clearer cost attribution. You could also &lt;STRONG&gt;cross-reference job logs with cluster usage metrics&lt;/STRONG&gt;, though this can be manual. Leveraging the &lt;STRONG&gt;Databricks REST API&lt;/STRONG&gt; can help gather more detailed metrics to better estimate costs per job.&lt;/P&gt;&lt;P&gt;Just a thought. Give a try and let me know.&lt;/P&gt;&lt;P&gt;Regards,&lt;/P&gt;&lt;P&gt;Brahma&lt;/P&gt;</description>
    <pubDate>Mon, 28 Oct 2024 12:44:15 GMT</pubDate>
    <dc:creator>Brahmareddy</dc:creator>
    <dc:date>2024-10-28T12:44:15Z</dc:date>
    <item>
      <title>How to get cost per job which runs on ALL_PURPOSE_COMPUTE ??</title>
      <link>https://community.databricks.com/t5/administration-architecture/how-to-get-cost-per-job-which-runs-on-all-purpose-compute/m-p/95942#M2154</link>
      <description>&lt;P&gt;with&amp;nbsp;&lt;SPAN&gt;system&lt;/SPAN&gt;&lt;SPAN&gt;.&lt;/SPAN&gt;&lt;SPAN&gt;billing&lt;/SPAN&gt;&lt;SPAN&gt;.usage table i could get cost per jobs which are runs on JOB_COMPUTE but not for jobs which runs on&amp;nbsp;ALL_PURPOSE_COMPUTE.&lt;BR /&gt;&lt;BR /&gt;&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 24 Oct 2024 11:09:54 GMT</pubDate>
      <guid>https://community.databricks.com/t5/administration-architecture/how-to-get-cost-per-job-which-runs-on-all-purpose-compute/m-p/95942#M2154</guid>
      <dc:creator>KUMAR__111</dc:creator>
      <dc:date>2024-10-24T11:09:54Z</dc:date>
    </item>
    <item>
      <title>Re: How to get cost per job which runs on ALL_PURPOSE_COMPUTE ??</title>
      <link>https://community.databricks.com/t5/administration-architecture/how-to-get-cost-per-job-which-runs-on-all-purpose-compute/m-p/96064#M2164</link>
      <description>&lt;P&gt;Hi Kumar,&lt;/P&gt;&lt;P&gt;How are you? As per my understanding, please c&lt;SPAN&gt;onsider checking if your jobs running on &lt;/SPAN&gt;&lt;STRONG&gt;ALL_PURPOSE_COMPUTE&lt;/STRONG&gt;&lt;SPAN&gt; are being tracked properly in the &lt;/SPAN&gt;&lt;STRONG&gt;system.billing.usage&lt;/STRONG&gt;&lt;SPAN&gt; table. For &lt;/SPAN&gt;&lt;STRONG&gt;ALL_PURPOSE_COMPUTE&lt;/STRONG&gt;&lt;SPAN&gt; workloads, billing can sometimes be aggregated under &lt;/SPAN&gt;&lt;STRONG&gt;interactive clusters&lt;/STRONG&gt;&lt;SPAN&gt;, and the costs might not be attributed directly to specific jobs, making it harder to get a job-specific breakdown. You might want to &lt;/SPAN&gt;&lt;STRONG&gt;cross-reference cluster usage&lt;/STRONG&gt;&lt;SPAN&gt; with job runs using the &lt;/SPAN&gt;&lt;STRONG&gt;cluster usage metrics&lt;/STRONG&gt;&lt;SPAN&gt; or &lt;/SPAN&gt;&lt;STRONG&gt;cluster events logs&lt;/STRONG&gt;&lt;SPAN&gt;. This will help you map costs from ALL_PURPOSE_COMPUTE clusters to the jobs they are supporting. Alternatively, you can explore &lt;/SPAN&gt;&lt;STRONG&gt;Databricks' cost management tools&lt;/STRONG&gt;&lt;SPAN&gt; or integrate with external billing tools to get a more granular view of job-level costs on these compute types.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Give a try and let me know.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Regards,&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Brahma&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 25 Oct 2024 03:21:56 GMT</pubDate>
      <guid>https://community.databricks.com/t5/administration-architecture/how-to-get-cost-per-job-which-runs-on-all-purpose-compute/m-p/96064#M2164</guid>
      <dc:creator>Brahmareddy</dc:creator>
      <dc:date>2024-10-25T03:21:56Z</dc:date>
    </item>
    <item>
      <title>Re: How to get cost per job which runs on ALL_PURPOSE_COMPUTE ??</title>
      <link>https://community.databricks.com/t5/administration-architecture/how-to-get-cost-per-job-which-runs-on-all-purpose-compute/m-p/96362#M2172</link>
      <description>&lt;P&gt;If nowhere DBU is captured for jobs under&amp;nbsp;&lt;STRONG&gt;ALL_PURPOSE_COMPUTE &lt;/STRONG&gt;then&amp;nbsp;cost breakdown-based cluster events is very difficult as more than 2 jobs can parallel. So mapping is very difficult to break down cost for specific job.&lt;BR /&gt;let me know if I am missing anything.&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 28 Oct 2024 05:08:37 GMT</pubDate>
      <guid>https://community.databricks.com/t5/administration-architecture/how-to-get-cost-per-job-which-runs-on-all-purpose-compute/m-p/96362#M2172</guid>
      <dc:creator>KUMAR__111</dc:creator>
      <dc:date>2024-10-28T05:08:37Z</dc:date>
    </item>
    <item>
      <title>Re: How to get cost per job which runs on ALL_PURPOSE_COMPUTE ??</title>
      <link>https://community.databricks.com/t5/administration-architecture/how-to-get-cost-per-job-which-runs-on-all-purpose-compute/m-p/96490#M2176</link>
      <description>&lt;P&gt;You’re right &lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/129087"&gt;@KUMAR__111&lt;/a&gt;—tracking costs for jobs on &lt;STRONG&gt;ALL_PURPOSE_COMPUTE&lt;/STRONG&gt; clusters can be tricky since &lt;STRONG&gt;DBU usage isn’t directly tied to specific jobs&lt;/STRONG&gt;. When multiple jobs run in parallel on the same cluster, it’s challenging to allocate costs accurately. Consider using &lt;STRONG&gt;cluster tags&lt;/STRONG&gt; to label clusters by job, which can help with grouping costs even when jobs share clusters. Running &lt;STRONG&gt;job-specific clusters&lt;/STRONG&gt; for key workloads could provide clearer cost attribution. You could also &lt;STRONG&gt;cross-reference job logs with cluster usage metrics&lt;/STRONG&gt;, though this can be manual. Leveraging the &lt;STRONG&gt;Databricks REST API&lt;/STRONG&gt; can help gather more detailed metrics to better estimate costs per job.&lt;/P&gt;&lt;P&gt;Just a thought. Give a try and let me know.&lt;/P&gt;&lt;P&gt;Regards,&lt;/P&gt;&lt;P&gt;Brahma&lt;/P&gt;</description>
      <pubDate>Mon, 28 Oct 2024 12:44:15 GMT</pubDate>
      <guid>https://community.databricks.com/t5/administration-architecture/how-to-get-cost-per-job-which-runs-on-all-purpose-compute/m-p/96490#M2176</guid>
      <dc:creator>Brahmareddy</dc:creator>
      <dc:date>2024-10-28T12:44:15Z</dc:date>
    </item>
  </channel>
</rss>

