<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: How to use serverless clusters in DAB deployments with Unity Catalog in private network? in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/how-to-use-serverless-clusters-in-dab-deployments-with-unity/m-p/139010#M51070</link>
    <description>&lt;P&gt;A lot of questions&amp;nbsp;&lt;span class="lia-unicode-emoji" title=":grinning_face:"&gt;😀&lt;/span&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Concerning usage of &lt;STRONG&gt;serverless clusters&lt;/STRONG&gt; in &lt;STRONG&gt;databricks.yml&lt;/STRONG&gt; and assuming you're using those clusters in jobs, you must define them in the job definition. Take a look here:&amp;nbsp;&lt;A href="https://github.com/databricks/bundle-examples/tree/main/knowledge_base/serverless_job" target="_blank"&gt;https://github.com/databricks/bundle-examples/tree/main/knowledge_base/serverless_job&lt;/A&gt;&amp;nbsp;Notice how there is no explicit reference to "existing all-purpose" or classic "jobs compute" cluster.&amp;nbsp;&lt;/P&gt;&lt;P&gt;Concerning configuration to &lt;STRONG&gt;access to your private storage accounts backing Unity Catalog managed tables&lt;/STRONG&gt;, you must enable your firewall on them. Otherwise, serverless clusters are not allowed to access. This is the same for jobs/notebook serverless cluster or SQL Warehouse clusters. Take a look here:&amp;nbsp;&lt;A href="https://docs.databricks.com/aws/en/security/network/serverless-network-security/serverless-firewall" target="_blank"&gt;https://docs.databricks.com/aws/en/security/network/serverless-network-security/serverless-firewall&lt;/A&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;If you switch to all-purpose or jobs compute, there are pros and cons. I really like that serverless compute is very fast to start workloads but not the same for jobs compute, as it takes minutes. In my case, that delay is completely unacceptable, so using all-purpose clusters already active and/or serverless compute, depending type of job workload. Concerning pros and cons, there are a lot to talk about. I'm not going to copy/paste content from chat-gpt xDD Take a look here:&amp;nbsp;&lt;A href="https://docs.databricks.com/gcp/en/compute/choose-compute" target="_blank"&gt;https://docs.databricks.com/gcp/en/compute/choose-compute&lt;/A&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I would recommend you use service principals to run DAB from CI/CD pipelines or even manually via Databricks CLI while learning how to use it.&lt;/P&gt;</description>
    <pubDate>Fri, 14 Nov 2025 08:16:41 GMT</pubDate>
    <dc:creator>Coffee77</dc:creator>
    <dc:date>2025-11-14T08:16:41Z</dc:date>
    <item>
      <title>How to use serverless clusters in DAB deployments with Unity Catalog in private network?</title>
      <link>https://community.databricks.com/t5/data-engineering/how-to-use-serverless-clusters-in-dab-deployments-with-unity/m-p/138977#M51067</link>
      <description>&lt;P&gt;&lt;SPAN&gt;Hi everyone,&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;I’m deploying Jobs and Pipelines using &lt;STRONG&gt;Databricks Asset Bundles (DAB)&lt;/STRONG&gt; in an Azure Databricks workspace configured with &lt;STRONG&gt;private networking&lt;/STRONG&gt;. I’m trying to use &lt;STRONG&gt;serverless compute&lt;/STRONG&gt; for some workloads, but I’m running into issues when Unity Catalog-backed storage accounts are also in private networks.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Here’s what I’ve done so far:&lt;/SPAN&gt;&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;&lt;P&gt;&lt;SPAN&gt;Defined the cluster in databricks.yml using the compute block with mode: serverless&lt;/SPAN&gt;&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P&gt;&lt;SPAN&gt;The workspace is configured with Unity Catalog, and the backing storage accounts (like sysdlh) are also private&lt;/SPAN&gt;&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P&gt;&lt;SPAN&gt;During deployment, the cluster fails to access the Unity Catalog storage paths&lt;/SPAN&gt;&lt;/P&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;&lt;SPAN&gt;&lt;STRONG&gt;Questions:&lt;/STRONG&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;OL&gt;&lt;LI&gt;&lt;P&gt;&lt;SPAN&gt;&lt;STRONG&gt;Where exactly should I define serverless compute in the &lt;/STRONG&gt;databricks.yml&lt;STRONG&gt;?&lt;/STRONG&gt; Is it under resources.jobs.job_name.compute or in a shared compute block?&lt;/SPAN&gt;&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P&gt;&lt;SPAN&gt;&lt;STRONG&gt;Are there known limitations when using serverless clusters in private network setups?&lt;/STRONG&gt; Does serverless compute require public access to Unity Catalog storage accounts?&lt;/SPAN&gt;&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P&gt;&lt;SPAN&gt;&lt;STRONG&gt;Is there a recommended workaround to use serverless clusters with Unity Catalog in private environments?&lt;/STRONG&gt; Should I switch to shared or single-user clusters with VNet injection instead?&lt;/SPAN&gt;&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P&gt;&lt;SPAN&gt;&lt;STRONG&gt;Any best practices for configuring DAB deployments with Unity Catalog + private networking?&lt;/STRONG&gt; Especially for ensuring cluster access to storage paths during job execution.&lt;/SPAN&gt;&lt;/P&gt;&lt;/LI&gt;&lt;/OL&gt;&lt;P&gt;&lt;SPAN&gt;Would appreciate any insights or examples from others who’ve tackled this setup!&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 13 Nov 2025 22:03:34 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/how-to-use-serverless-clusters-in-dab-deployments-with-unity/m-p/138977#M51067</guid>
      <dc:creator>Charansai</dc:creator>
      <dc:date>2025-11-13T22:03:34Z</dc:date>
    </item>
    <item>
      <title>Re: How to use serverless clusters in DAB deployments with Unity Catalog in private network?</title>
      <link>https://community.databricks.com/t5/data-engineering/how-to-use-serverless-clusters-in-dab-deployments-with-unity/m-p/139010#M51070</link>
      <description>&lt;P&gt;A lot of questions&amp;nbsp;&lt;span class="lia-unicode-emoji" title=":grinning_face:"&gt;😀&lt;/span&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Concerning usage of &lt;STRONG&gt;serverless clusters&lt;/STRONG&gt; in &lt;STRONG&gt;databricks.yml&lt;/STRONG&gt; and assuming you're using those clusters in jobs, you must define them in the job definition. Take a look here:&amp;nbsp;&lt;A href="https://github.com/databricks/bundle-examples/tree/main/knowledge_base/serverless_job" target="_blank"&gt;https://github.com/databricks/bundle-examples/tree/main/knowledge_base/serverless_job&lt;/A&gt;&amp;nbsp;Notice how there is no explicit reference to "existing all-purpose" or classic "jobs compute" cluster.&amp;nbsp;&lt;/P&gt;&lt;P&gt;Concerning configuration to &lt;STRONG&gt;access to your private storage accounts backing Unity Catalog managed tables&lt;/STRONG&gt;, you must enable your firewall on them. Otherwise, serverless clusters are not allowed to access. This is the same for jobs/notebook serverless cluster or SQL Warehouse clusters. Take a look here:&amp;nbsp;&lt;A href="https://docs.databricks.com/aws/en/security/network/serverless-network-security/serverless-firewall" target="_blank"&gt;https://docs.databricks.com/aws/en/security/network/serverless-network-security/serverless-firewall&lt;/A&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;If you switch to all-purpose or jobs compute, there are pros and cons. I really like that serverless compute is very fast to start workloads but not the same for jobs compute, as it takes minutes. In my case, that delay is completely unacceptable, so using all-purpose clusters already active and/or serverless compute, depending type of job workload. Concerning pros and cons, there are a lot to talk about. I'm not going to copy/paste content from chat-gpt xDD Take a look here:&amp;nbsp;&lt;A href="https://docs.databricks.com/gcp/en/compute/choose-compute" target="_blank"&gt;https://docs.databricks.com/gcp/en/compute/choose-compute&lt;/A&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I would recommend you use service principals to run DAB from CI/CD pipelines or even manually via Databricks CLI while learning how to use it.&lt;/P&gt;</description>
      <pubDate>Fri, 14 Nov 2025 08:16:41 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/how-to-use-serverless-clusters-in-dab-deployments-with-unity/m-p/139010#M51070</guid>
      <dc:creator>Coffee77</dc:creator>
      <dc:date>2025-11-14T08:16:41Z</dc:date>
    </item>
  </channel>
</rss>

