<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Unity Catalog in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/unity-catalog/m-p/96442#M39279</link>
    <description>&lt;P&gt;Unfortunately, as of now, there isn't a direct, seamless integration between Unity Catalog and Athena to automatically synchronize table updates.&lt;/P&gt;&lt;P&gt;However, here are a few potential approaches to achieve your desired outcome:&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;1. AWS Glue Data Catalog:&lt;/STRONG&gt;&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;&lt;STRONG&gt;Manual Synchronization:&lt;/STRONG&gt;&lt;UL&gt;&lt;LI&gt;Create a Glue Crawler to scan the S3 location where your Delta Lake tables are stored.&lt;/LI&gt;&lt;LI&gt;Configure the crawler to update the Glue Data Catalog periodically.&lt;/LI&gt;&lt;LI&gt;Athena can then query the Glue Data Catalog to access the latest table definitions.&lt;/LI&gt;&lt;/UL&gt;&lt;/LI&gt;&lt;LI&gt;&lt;STRONG&gt;Semi-Automated Synchronization:&lt;/STRONG&gt;&lt;UL&gt;&lt;LI&gt;Use a scripting approach (Python, Scala) to trigger the Glue Crawler whenever changes are made to Unity Catalog.&lt;/LI&gt;&lt;/UL&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;&lt;STRONG&gt;2. Databricks Delta Sharing:&lt;/STRONG&gt;&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;&lt;STRONG&gt;Share Delta Tables:&lt;/STRONG&gt;&lt;UL&gt;&lt;LI&gt;Share your Delta tables from Databricks with external users or applications.&lt;/LI&gt;&lt;LI&gt;Configure Athena to access these shared Delta tables directly.&lt;/LI&gt;&lt;LI&gt;This approach provides a more seamless integration but requires careful management of access controls and data security.&lt;/LI&gt;&lt;/UL&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;&lt;STRONG&gt;3. Custom Connectors or APIs:&lt;/STRONG&gt;&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;&lt;STRONG&gt;Develop a Custom Connector:&lt;/STRONG&gt;&lt;UL&gt;&lt;LI&gt;Build a custom connector to integrate Athena with Databricks Unity Catalog.&lt;/LI&gt;&lt;LI&gt;This approach requires significant development effort and may not be feasible for all use cases.&lt;/LI&gt;&lt;/UL&gt;&lt;/LI&gt;&lt;LI&gt;&lt;STRONG&gt;Use APIs:&lt;/STRONG&gt;&lt;UL&gt;&lt;LI&gt;Leverage the Databricks REST API to retrieve metadata about tables and schemas from Unity Catalog.&lt;/LI&gt;&lt;LI&gt;This information can then be used to update the Glue Data Catalog or create custom Athena views&lt;/LI&gt;&lt;/UL&gt;&lt;/LI&gt;&lt;/UL&gt;</description>
    <pubDate>Mon, 28 Oct 2024 09:58:35 GMT</pubDate>
    <dc:creator>ArunKhandelwal</dc:creator>
    <dc:date>2024-10-28T09:58:35Z</dc:date>
    <item>
      <title>Unity Catalog</title>
      <link>https://community.databricks.com/t5/data-engineering/unity-catalog/m-p/96432#M39278</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;I have requirement to setup the athena tables. We have a unity catalog setup in databricks workspace and I would like know is there any possibility that Athen can be point to unity catalog so that all the tables are available in athena.&lt;/P&gt;&lt;P&gt;whenever we update or create any new table in unity catalog it gets updated in aws athena as well.&lt;/P&gt;&lt;P&gt;i'm interested in available options&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks&lt;/P&gt;</description>
      <pubDate>Mon, 28 Oct 2024 09:33:54 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/unity-catalog/m-p/96432#M39278</guid>
      <dc:creator>Bhuvnesh</dc:creator>
      <dc:date>2024-10-28T09:33:54Z</dc:date>
    </item>
    <item>
      <title>Re: Unity Catalog</title>
      <link>https://community.databricks.com/t5/data-engineering/unity-catalog/m-p/96442#M39279</link>
      <description>&lt;P&gt;Unfortunately, as of now, there isn't a direct, seamless integration between Unity Catalog and Athena to automatically synchronize table updates.&lt;/P&gt;&lt;P&gt;However, here are a few potential approaches to achieve your desired outcome:&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;1. AWS Glue Data Catalog:&lt;/STRONG&gt;&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;&lt;STRONG&gt;Manual Synchronization:&lt;/STRONG&gt;&lt;UL&gt;&lt;LI&gt;Create a Glue Crawler to scan the S3 location where your Delta Lake tables are stored.&lt;/LI&gt;&lt;LI&gt;Configure the crawler to update the Glue Data Catalog periodically.&lt;/LI&gt;&lt;LI&gt;Athena can then query the Glue Data Catalog to access the latest table definitions.&lt;/LI&gt;&lt;/UL&gt;&lt;/LI&gt;&lt;LI&gt;&lt;STRONG&gt;Semi-Automated Synchronization:&lt;/STRONG&gt;&lt;UL&gt;&lt;LI&gt;Use a scripting approach (Python, Scala) to trigger the Glue Crawler whenever changes are made to Unity Catalog.&lt;/LI&gt;&lt;/UL&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;&lt;STRONG&gt;2. Databricks Delta Sharing:&lt;/STRONG&gt;&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;&lt;STRONG&gt;Share Delta Tables:&lt;/STRONG&gt;&lt;UL&gt;&lt;LI&gt;Share your Delta tables from Databricks with external users or applications.&lt;/LI&gt;&lt;LI&gt;Configure Athena to access these shared Delta tables directly.&lt;/LI&gt;&lt;LI&gt;This approach provides a more seamless integration but requires careful management of access controls and data security.&lt;/LI&gt;&lt;/UL&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;&lt;STRONG&gt;3. Custom Connectors or APIs:&lt;/STRONG&gt;&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;&lt;STRONG&gt;Develop a Custom Connector:&lt;/STRONG&gt;&lt;UL&gt;&lt;LI&gt;Build a custom connector to integrate Athena with Databricks Unity Catalog.&lt;/LI&gt;&lt;LI&gt;This approach requires significant development effort and may not be feasible for all use cases.&lt;/LI&gt;&lt;/UL&gt;&lt;/LI&gt;&lt;LI&gt;&lt;STRONG&gt;Use APIs:&lt;/STRONG&gt;&lt;UL&gt;&lt;LI&gt;Leverage the Databricks REST API to retrieve metadata about tables and schemas from Unity Catalog.&lt;/LI&gt;&lt;LI&gt;This information can then be used to update the Glue Data Catalog or create custom Athena views&lt;/LI&gt;&lt;/UL&gt;&lt;/LI&gt;&lt;/UL&gt;</description>
      <pubDate>Mon, 28 Oct 2024 09:58:35 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/unity-catalog/m-p/96442#M39279</guid>
      <dc:creator>ArunKhandelwal</dc:creator>
      <dc:date>2024-10-28T09:58:35Z</dc:date>
    </item>
  </channel>
</rss>

