<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Cost in Administration &amp; Architecture</title>
    <link>https://community.databricks.com/t5/administration-architecture/cost/m-p/121780#M3473</link>
    <description>&lt;P&gt;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/167880"&gt;@Athul97&lt;/a&gt;&amp;nbsp;provided a pretty solid list of best practices.&amp;nbsp; To go deeper into Budgets &amp;amp; Alerts, I have found a lot of good success with the Consumption and Budget feature in the Databricks Account Portal under the Usage menu.&amp;nbsp; Once you embed tagging into all Databricks assets, you can really get a good picture of usage and can get a handle of where the spend is occurring.&amp;nbsp; This can obviously get married up with the general cloud consumption costs for things like Storage and Networking, but gives you more granular reporting inside your workspaces.&lt;/P&gt;&lt;P&gt;The other area where I see opportunity is setting up some type of engineering code review and optimization process.&amp;nbsp; I still see a lot of poor development practices where incorrect usage of libraries or poor data processing algorithms cause unnecessary cluster cycles.&amp;nbsp; I recently audited a customer's worst performing jobs and made a number of coding suggestions that led to significant reductions in execution times.&amp;nbsp; Many of the jobs that ran for hours, now complete in 30-45 minutes without any changes to the cluster configurations.&lt;/P&gt;</description>
    <pubDate>Sat, 14 Jun 2025 13:59:43 GMT</pubDate>
    <dc:creator>jameshughes</dc:creator>
    <dc:date>2025-06-14T13:59:43Z</dc:date>
    <item>
      <title>Cost</title>
      <link>https://community.databricks.com/t5/administration-architecture/cost/m-p/121520#M3463</link>
      <description>&lt;P&gt;Do you have information that helps me optimize costs and follow up?&lt;/P&gt;</description>
      <pubDate>Wed, 11 Jun 2025 18:26:56 GMT</pubDate>
      <guid>https://community.databricks.com/t5/administration-architecture/cost/m-p/121520#M3463</guid>
      <dc:creator>Gmera</dc:creator>
      <dc:date>2025-06-11T18:26:56Z</dc:date>
    </item>
    <item>
      <title>Re: Cost</title>
      <link>https://community.databricks.com/t5/administration-architecture/cost/m-p/121581#M3465</link>
      <description>&lt;P&gt;1.Use Jobs clusters instead of All-Purpose clusters&lt;BR /&gt;2.Enable Auto-Termination to shut down idle clusters&lt;BR /&gt;3.Archive cold data to low-cost storage tiers (e.g., Azure Blob cool tier, AWS S3 Glacier)&lt;BR /&gt;4.Run jobs in off-peak hours to leverage spot pricing&lt;BR /&gt;5.Use Photon Engine for faster, cheaper queries&lt;BR /&gt;6.Set spending budgets and alerts in cloud cost tools&lt;BR /&gt;7.Regularly review cluster &amp;amp; job usage reports&lt;/P&gt;</description>
      <pubDate>Thu, 12 Jun 2025 09:53:27 GMT</pubDate>
      <guid>https://community.databricks.com/t5/administration-architecture/cost/m-p/121581#M3465</guid>
      <dc:creator>Athul97</dc:creator>
      <dc:date>2025-06-12T09:53:27Z</dc:date>
    </item>
    <item>
      <title>Re: Cost</title>
      <link>https://community.databricks.com/t5/administration-architecture/cost/m-p/121780#M3473</link>
      <description>&lt;P&gt;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/167880"&gt;@Athul97&lt;/a&gt;&amp;nbsp;provided a pretty solid list of best practices.&amp;nbsp; To go deeper into Budgets &amp;amp; Alerts, I have found a lot of good success with the Consumption and Budget feature in the Databricks Account Portal under the Usage menu.&amp;nbsp; Once you embed tagging into all Databricks assets, you can really get a good picture of usage and can get a handle of where the spend is occurring.&amp;nbsp; This can obviously get married up with the general cloud consumption costs for things like Storage and Networking, but gives you more granular reporting inside your workspaces.&lt;/P&gt;&lt;P&gt;The other area where I see opportunity is setting up some type of engineering code review and optimization process.&amp;nbsp; I still see a lot of poor development practices where incorrect usage of libraries or poor data processing algorithms cause unnecessary cluster cycles.&amp;nbsp; I recently audited a customer's worst performing jobs and made a number of coding suggestions that led to significant reductions in execution times.&amp;nbsp; Many of the jobs that ran for hours, now complete in 30-45 minutes without any changes to the cluster configurations.&lt;/P&gt;</description>
      <pubDate>Sat, 14 Jun 2025 13:59:43 GMT</pubDate>
      <guid>https://community.databricks.com/t5/administration-architecture/cost/m-p/121780#M3473</guid>
      <dc:creator>jameshughes</dc:creator>
      <dc:date>2025-06-14T13:59:43Z</dc:date>
    </item>
  </channel>
</rss>

