<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Unity Catalog Shared compute Issues in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/unity-catalog-shared-compute-issues/m-p/74599#M34762</link>
    <description>&lt;P&gt;Am I the only one experiencing challenges in migrating to Databricks Unity Catalog? I observed that in Unity Catalog-enabled compute, the "Shared" access mode is still tagged as a Preview feature. This means it is not yet safe for use in production workloads. Having a compute resource that can be shared in production is crucial because various developers and service principals need to be able to execute queries on the cluster. I'm wondering how others are working around this issue since it is a major blocker to effectively migrating all workloads to Unity Catalog.&lt;/P&gt;&lt;P&gt;Additionally, when I tested my code using the Shared access mode compute, I noticed that it gets stuck when trying to read a file stored in an external location into a data frame..&lt;BR /&gt;Watch this video for the demo of the issue&amp;nbsp;&lt;A href="https://www.youtube.com/watch?v=J1bn6P7elKI&amp;amp;ab_channel=AfroInfoTech" target="_blank"&gt;https://www.youtube.com/watch?v=J1bn6P7elKI&amp;amp;ab_channel=AfroInfoTech&lt;/A&gt;&lt;/P&gt;</description>
    <pubDate>Mon, 17 Jun 2024 12:13:25 GMT</pubDate>
    <dc:creator>Zume</dc:creator>
    <dc:date>2024-06-17T12:13:25Z</dc:date>
    <item>
      <title>Unity Catalog Shared compute Issues</title>
      <link>https://community.databricks.com/t5/data-engineering/unity-catalog-shared-compute-issues/m-p/74599#M34762</link>
      <description>&lt;P&gt;Am I the only one experiencing challenges in migrating to Databricks Unity Catalog? I observed that in Unity Catalog-enabled compute, the "Shared" access mode is still tagged as a Preview feature. This means it is not yet safe for use in production workloads. Having a compute resource that can be shared in production is crucial because various developers and service principals need to be able to execute queries on the cluster. I'm wondering how others are working around this issue since it is a major blocker to effectively migrating all workloads to Unity Catalog.&lt;/P&gt;&lt;P&gt;Additionally, when I tested my code using the Shared access mode compute, I noticed that it gets stuck when trying to read a file stored in an external location into a data frame..&lt;BR /&gt;Watch this video for the demo of the issue&amp;nbsp;&lt;A href="https://www.youtube.com/watch?v=J1bn6P7elKI&amp;amp;ab_channel=AfroInfoTech" target="_blank"&gt;https://www.youtube.com/watch?v=J1bn6P7elKI&amp;amp;ab_channel=AfroInfoTech&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 17 Jun 2024 12:13:25 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/unity-catalog-shared-compute-issues/m-p/74599#M34762</guid>
      <dc:creator>Zume</dc:creator>
      <dc:date>2024-06-17T12:13:25Z</dc:date>
    </item>
    <item>
      <title>Re: Unity Catalog Shared compute Issues</title>
      <link>https://community.databricks.com/t5/data-engineering/unity-catalog-shared-compute-issues/m-p/75792#M35061</link>
      <description>&lt;P&gt;Have you tried creating a volume on top of the external location, and using the volume in spark.read.parquet?&lt;BR /&gt;&lt;BR /&gt;i.e.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;LI-CODE lang="python"&gt;spark.read.parquet('/Volumes/&amp;lt;volume_name&amp;gt;/&amp;lt;folder_name&amp;gt;/&amp;lt;file_name.parquet&amp;gt;')&lt;/LI-CODE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Edit: also, not sure why the Databricks community manager here said Shared access mode is "in preview" and that its "&lt;SPAN&gt;not recommended for production workloads", because this is completely false. It is not in preview and completely safe for production workloads. It has been for almost 2 years. The only thing in preview for shared access mode clusters right now are scala workloads.&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 26 Jun 2024 08:17:39 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/unity-catalog-shared-compute-issues/m-p/75792#M35061</guid>
      <dc:creator>jacovangelder</dc:creator>
      <dc:date>2024-06-26T08:17:39Z</dc:date>
    </item>
  </channel>
</rss>

