<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Data profiling monitoring with foreign catalog in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/data-profiling-monitoring-with-foreign-catalog/m-p/134756#M50189</link>
    <description>&lt;P&gt;Hi szymon,&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;Thank you for your quick response. I understand that data quality can be more complex. However, I believe that for “Data Profiling” monitoring, this approach could still be valid, as Unity Catalog generates predefined SQL queries to extract statistical and other relevant metrics and this could be done with SQL pushdowns.&lt;/P&gt;</description>
    <pubDate>Mon, 13 Oct 2025 14:03:03 GMT</pubDate>
    <dc:creator>sta_gas</dc:creator>
    <dc:date>2025-10-13T14:03:03Z</dc:date>
    <item>
      <title>Data profiling monitoring with foreign catalog</title>
      <link>https://community.databricks.com/t5/data-engineering/data-profiling-monitoring-with-foreign-catalog/m-p/134742#M50182</link>
      <description>&lt;P&gt;Hi team,&lt;/P&gt;&lt;P&gt;I’m currently working with &lt;STRONG&gt;Azure Databricks&lt;/STRONG&gt; and have created a &lt;STRONG&gt;foreign catalog&lt;/STRONG&gt; for my source database in &lt;STRONG&gt;Azure SQL&lt;/STRONG&gt;. I can successfully run SELECT statements from Databricks to the Azure SQL database.&lt;/P&gt;&lt;P&gt;However, I would like to set up &lt;STRONG&gt;data profiling monitoring&lt;/STRONG&gt; using the &lt;STRONG&gt;Quality tab&lt;/STRONG&gt;, but I’m facing limitations in terms of availability and functionality.&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="sta_gas_0-1760357690503.png" style="width: 400px;"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/20698iB1133A9121446F10/image-size/medium?v=v2&amp;amp;px=400" role="button" title="sta_gas_0-1760357690503.png" alt="sta_gas_0-1760357690503.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;The table t&lt;SPAN class=""&gt;ype is&amp;nbsp;&lt;/SPAN&gt;FOREIGN and the catalog type is&amp;nbsp;&lt;SPAN&gt;FOREIGN_CATALOG.&lt;BR /&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;Could you please advise on the best approach or any recommended steps to enable this feature in this catalog? I acknowledge that i can create materialize views or replicate the data into managed tables on another catalog, however I would like not to replicate all the data.&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 13 Oct 2025 12:20:44 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/data-profiling-monitoring-with-foreign-catalog/m-p/134742#M50182</guid>
      <dc:creator>sta_gas</dc:creator>
      <dc:date>2025-10-13T12:20:44Z</dc:date>
    </item>
    <item>
      <title>Re: Data profiling monitoring with foreign catalog</title>
      <link>https://community.databricks.com/t5/data-engineering/data-profiling-monitoring-with-foreign-catalog/m-p/134747#M50187</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/191286"&gt;@sta_gas&lt;/a&gt;&amp;nbsp;,&lt;/P&gt;&lt;P&gt;Since data quality monitoring is in beta I'm quite sure they don't support foreign tables as of now (but they forgot to mentioned it in docs).&lt;/P&gt;&lt;P&gt;But more important question if they ever will be supported. For me data quality monitoring applies only to Delta Tables. According to docs description of how it works, we can see that they leverage delta properties to build this functionality. So I guess it won't work for foreign tables (at least there won't be the same feature parity).&lt;/P&gt;&lt;P&gt;"Databricks creates a background job that monitors tables for&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;EM&gt;freshness&lt;/EM&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;and&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;EM&gt;completeness&lt;/EM&gt;. Databricks uses smart scanning to determine when to scan tables.&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;Freshness&lt;/STRONG&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;refers to how recently a table has been updated. Data quality monitoring analyzes the history of commits to a table and builds a per-table model to predict the time of the next commit. If a commit is unusually late, the table is marked as stale."&lt;/P&gt;</description>
      <pubDate>Mon, 13 Oct 2025 12:49:54 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/data-profiling-monitoring-with-foreign-catalog/m-p/134747#M50187</guid>
      <dc:creator>szymon_dybczak</dc:creator>
      <dc:date>2025-10-13T12:49:54Z</dc:date>
    </item>
    <item>
      <title>Re: Data profiling monitoring with foreign catalog</title>
      <link>https://community.databricks.com/t5/data-engineering/data-profiling-monitoring-with-foreign-catalog/m-p/134756#M50189</link>
      <description>&lt;P&gt;Hi szymon,&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;Thank you for your quick response. I understand that data quality can be more complex. However, I believe that for “Data Profiling” monitoring, this approach could still be valid, as Unity Catalog generates predefined SQL queries to extract statistical and other relevant metrics and this could be done with SQL pushdowns.&lt;/P&gt;</description>
      <pubDate>Mon, 13 Oct 2025 14:03:03 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/data-profiling-monitoring-with-foreign-catalog/m-p/134756#M50189</guid>
      <dc:creator>sta_gas</dc:creator>
      <dc:date>2025-10-13T14:03:03Z</dc:date>
    </item>
  </channel>
</rss>

