<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Ingress/Egress private endpoint in Administration &amp; Architecture</title>
    <link>https://community.databricks.com/t5/administration-architecture/ingress-egress-private-endpoint/m-p/97648#M2237</link>
    <description>&lt;P&gt;I see this article :&amp;nbsp;&lt;A href="https://learn.microsoft.com/en-us/azure/virtual-network/virtual-network-service-endpoints-overview" target="_blank"&gt;Azure virtual network service endpoints | Microsoft Learn&lt;/A&gt;&lt;/P&gt;&lt;P&gt;i'm guessing if i move from private link to Virtual network service endpoint that could be a good replacement to reduce the cost&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Mon, 04 Nov 2024 22:25:39 GMT</pubDate>
    <dc:creator>Fkebbati</dc:creator>
    <dc:date>2024-11-04T22:25:39Z</dc:date>
    <item>
      <title>Ingress/Egress private endpoint</title>
      <link>https://community.databricks.com/t5/administration-architecture/ingress-egress-private-endpoint/m-p/97588#M2230</link>
      <description>&lt;P&gt;&lt;SPAN&gt;Hello ,&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;We have configured our Databricks environment with private endpoint connections injected into our VNET, which includes two subnets (public and private). We have disabled public IPs and are using Network Security Groups (NSGs) on the subnet, as suggested by Microsoft. Additionally, we have a private endpoint for our Azure Data Lake Storage account, where our tables are created, and this storage is located within the same VNET.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;We also utilize a private endpoint for authentication in a separate VNET that has been successfully peered with our main VNET. Currently, our developers are running shared or job compute clusters to create tables by transferring data from Storage Account A to Storage Account B.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;However, we are seeing many ingress traffic that we believe should not be occurring. Given that both the cluster and the boths storage accounts &amp;nbsp;are in the same VNET, my understanding is that there should be no costs associated with ingress/egress traffic between these resources.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Could you please provide guidance on whether other teams have encountered similar issues? Additionally, any insights into how we might resolve this unexpected ingress traffic would be greatly appreciated.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Thank you for your assistance.&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 04 Nov 2024 17:28:10 GMT</pubDate>
      <guid>https://community.databricks.com/t5/administration-architecture/ingress-egress-private-endpoint/m-p/97588#M2230</guid>
      <dc:creator>Fkebbati</dc:creator>
      <dc:date>2024-11-04T17:28:10Z</dc:date>
    </item>
    <item>
      <title>Re: Ingress/Egress private endpoint</title>
      <link>https://community.databricks.com/t5/administration-architecture/ingress-egress-private-endpoint/m-p/97608#M2231</link>
      <description>&lt;P&gt;Are all the resources created within the same region?&amp;nbsp;If there is any cross-region traffic, even within the same VNET, it could incur additional costs.&lt;/P&gt;</description>
      <pubDate>Mon, 04 Nov 2024 20:08:09 GMT</pubDate>
      <guid>https://community.databricks.com/t5/administration-architecture/ingress-egress-private-endpoint/m-p/97608#M2231</guid>
      <dc:creator>Walter_C</dc:creator>
      <dc:date>2024-11-04T20:08:09Z</dc:date>
    </item>
    <item>
      <title>Re: Ingress/Egress private endpoint</title>
      <link>https://community.databricks.com/t5/administration-architecture/ingress-egress-private-endpoint/m-p/97610#M2232</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/131075"&gt;@Fkebbati&lt;/a&gt;&amp;nbsp;,&lt;/P&gt;&lt;P&gt;There always be some costs related to data transfer between those account. Let's have a look at private link pricing page. So it's expected, but MS likes to hide this kind of information &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="szymon_dybczak_0-1730750844650.png" style="width: 400px;"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/12620i1783753F7BBA247A/image-size/medium?v=v2&amp;amp;px=400" role="button" title="szymon_dybczak_0-1730750844650.png" alt="szymon_dybczak_0-1730750844650.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 04 Nov 2024 20:10:35 GMT</pubDate>
      <guid>https://community.databricks.com/t5/administration-architecture/ingress-egress-private-endpoint/m-p/97610#M2232</guid>
      <dc:creator>szymon_dybczak</dc:creator>
      <dc:date>2024-11-04T20:10:35Z</dc:date>
    </item>
    <item>
      <title>Re: Ingress/Egress private endpoint</title>
      <link>https://community.databricks.com/t5/administration-architecture/ingress-egress-private-endpoint/m-p/97616#M2234</link>
      <description>&lt;P&gt;All in&amp;nbsp; same regions actually , i just ran this there 3 minutes , this job workflow to read from strorage A and create table in storage B all same vnet same region, when i sort ressource cost by job tag it classified as databricks cost ,&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Fkebbati_0-1730751394430.png" style="width: 400px;"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/12621i13CDF0BC632B3F45/image-size/medium?v=v2&amp;amp;px=400" role="button" title="Fkebbati_0-1730751394430.png" alt="Fkebbati_0-1730751394430.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 04 Nov 2024 20:19:03 GMT</pubDate>
      <guid>https://community.databricks.com/t5/administration-architecture/ingress-egress-private-endpoint/m-p/97616#M2234</guid>
      <dc:creator>Fkebbati</dc:creator>
      <dc:date>2024-11-04T20:19:03Z</dc:date>
    </item>
    <item>
      <title>Re: Ingress/Egress private endpoint</title>
      <link>https://community.databricks.com/t5/administration-architecture/ingress-egress-private-endpoint/m-p/97648#M2237</link>
      <description>&lt;P&gt;I see this article :&amp;nbsp;&lt;A href="https://learn.microsoft.com/en-us/azure/virtual-network/virtual-network-service-endpoints-overview" target="_blank"&gt;Azure virtual network service endpoints | Microsoft Learn&lt;/A&gt;&lt;/P&gt;&lt;P&gt;i'm guessing if i move from private link to Virtual network service endpoint that could be a good replacement to reduce the cost&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 04 Nov 2024 22:25:39 GMT</pubDate>
      <guid>https://community.databricks.com/t5/administration-architecture/ingress-egress-private-endpoint/m-p/97648#M2237</guid>
      <dc:creator>Fkebbati</dc:creator>
      <dc:date>2024-11-04T22:25:39Z</dc:date>
    </item>
    <item>
      <title>Re: Ingress/Egress private endpoint</title>
      <link>https://community.databricks.com/t5/administration-architecture/ingress-egress-private-endpoint/m-p/97906#M2243</link>
      <description>&lt;P&gt;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/131075"&gt;@Fkebbati&lt;/a&gt;&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;First, traffic cost in Azure are not reported as a separate Resource Type, but appended to main resource causing the traffic. If you want to distinguish them use for instance Service Name. In this case traffic cost is appended to Databricks and not Storage Accounts.&amp;nbsp;&lt;/P&gt;&lt;P&gt;Cost for network traffic for Databricks and Storage Account with Private Endpoints is not a trivial case.&lt;/P&gt;&lt;P&gt;Simple use of Databricks clusters to read and write data over Private Endpoint incurs Inbound and Outbound cost. Those cost are not &lt;EM&gt;some&lt;/EM&gt; in volume as&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/110502"&gt;@szymon_dybczak&lt;/a&gt;&amp;nbsp;mentioned, but in my experience can double overall Databricks cost, and will scale as traffic volume changes.&amp;nbsp;&lt;/P&gt;&lt;P&gt;Also from experience Inbound cost will be much higher than Outbound cost. Circa 6 times more inbound. I imagine that Databricks/Spark makes a big overhead of data read, and to put data to worker nodes, or reading entire Delta Lake parquets. Read a lot, write some, i imagine.&amp;nbsp;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Also it is worth reminding that if you do work between different peered Vnets you will be charged with peering transit Private Link cost. VMs of worker nodes simply do not have a Private Endpoint and storage account in your case do.&lt;/P&gt;&lt;P&gt;And this is all regarding transfer within a single region.&amp;nbsp;&lt;/P&gt;&lt;P&gt;If you need to use Private Endpoint you need to accept those extra transfer cost.&lt;/P&gt;&lt;P&gt;Alternative to get some security, and avoid transit cost are Service Endpoint as you mentioned.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 06 Nov 2024 10:48:41 GMT</pubDate>
      <guid>https://community.databricks.com/t5/administration-architecture/ingress-egress-private-endpoint/m-p/97906#M2243</guid>
      <dc:creator>JakubSkibicki</dc:creator>
      <dc:date>2024-11-06T10:48:41Z</dc:date>
    </item>
  </channel>
</rss>

