<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Lakehouse Monitoring &amp;amp; Expectations in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/lakehouse-monitoring-amp-expectations/m-p/101843#M40857</link>
    <description>&lt;P&gt;Hello&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/124839"&gt;@noorbasha534&lt;/a&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thank you for reaching out and for your patience with this reply; below are some of the best practices:&lt;/P&gt;
&lt;OL&gt;
&lt;LI style="font-weight: 400;" aria-level="1"&gt;&lt;SPAN&gt;&lt;STRONG&gt;Monitor Data, Not Just Processes&lt;/STRONG&gt;: Focus on monitoring the quality of your data, not just the processes that handle it. This approach helps catch issues early in the data pipeline.&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI style="font-weight: 400;" aria-level="1"&gt;&lt;SPAN&gt;&lt;STRONG&gt;Set Expectation Rules&lt;/STRONG&gt;: Expectations can help manage data quality, especially when using Delta Live Tables (DLT). You can drop, warn, or quarantine rows that violate expectations or fail the pipeline altogether.&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI style="font-weight: 400;" aria-level="1"&gt;&lt;SPAN&gt;&lt;STRONG&gt;Leverage Unity Catalog Integration&lt;/STRONG&gt;: Since Lakehouse Monitoring is built on Unity Catalog, it can track quality alongside governance, building toward a self-serve data platform.&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI style="font-weight: 400;" aria-level="1"&gt;&lt;SPAN&gt;&lt;STRONG&gt;Utilize Automated Profiling&lt;/STRONG&gt;: Use the automated profiling feature for any Delta table in Unity Catalog to quickly identify potential issues across your entire data estate.&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI style="font-weight: 400;" aria-level="1"&gt;&lt;SPAN&gt;&lt;STRONG&gt;Implement Proactive Alerting&lt;/STRONG&gt;: Use the Expectations feature to set up notifications for quality issues as they arise, shifting from reactive to proactive monitoring.&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI style="font-weight: 400;" aria-level="1"&gt;&lt;SPAN&gt;&lt;STRONG&gt;Customize Dashboards&lt;/STRONG&gt;: Leverage Lakeview dashboard capabilities to create custom visualizations and collaborate across workspaces, teams, and stakeholders.&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI style="font-weight: 400;" aria-level="1"&gt;&lt;SPAN&gt;&lt;STRONG&gt;Monitor Throughout the Data Lifecycle&lt;/STRONG&gt;: Apply monitoring techniques at every step of the medallion architecture (bronze-silver-gold) to ensure data quality throughout the entire data lifecycle.&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI style="font-weight: 400;" aria-level="1"&gt;&lt;SPAN&gt;&lt;STRONG&gt;Leverage Custom Metrics&lt;/STRONG&gt;: Incorporate custom metrics tailored to your specific use case to gain more profound, more relevant insights, performance, and data quality.&lt;/SPAN&gt;&lt;/LI&gt;
&lt;/OL&gt;
&lt;P&gt;&lt;SPAN&gt;Here are some articles and videos about Lakehouse monitoring for your review:&lt;/SPAN&gt;&lt;/P&gt;
&lt;OL&gt;
&lt;LI style="font-weight: 400;" aria-level="1"&gt;&lt;SPAN&gt;Navigating the Waters of Lakehouse Monitoring and Observability by eeezee (Databricks) - &lt;/SPAN&gt;&lt;A href="https://community.databricks.com/t5/technical-blog/navigating-the-waters-of-lakehouse-monitoring-and-observability/ba-p/54655" target="_blank"&gt;&lt;SPAN&gt;https://community.databricks.com/t5/technical-blog/navigating-the-waters-of-lakehouse-monitoring-and-observability/ba-p/54655&lt;/SPAN&gt;&lt;/A&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI style="font-weight: 400;" aria-level="1"&gt;&lt;SPAN&gt;Ensuring Quality Forecasts with Databricks Lakehouse Monitoring by Peter Park - &lt;/SPAN&gt;&lt;A href="https://www.databricks.com/blog/ensuring-quality-forecasts-databricks-lakehouse-monitoring" target="_blank"&gt;&lt;SPAN&gt;https://www.databricks.com/blog/ensuring-quality-forecasts-databricks-lakehouse-monitoring&lt;/SPAN&gt;&lt;/A&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI style="font-weight: 400;" aria-level="1"&gt;&lt;SPAN&gt;Lakehouse Monitoring GA: Profiling, Diagnosing, and Enforcing Data Quality with Intelligence -&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;OL&gt;
&lt;LI style="font-weight: 400;" aria-level="2"&gt;&lt;A href="https://www.youtube.com/watch?v=aDBPoKyA0DQ" target="_blank"&gt;&lt;SPAN&gt;https://www.youtube.com/watch?v=aDBPoKyA0DQ&lt;/SPAN&gt;&lt;/A&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI style="font-weight: 400;" aria-level="2"&gt;&lt;A href="https://www.databricks.com/blog/lakehouse-monitoring-ga-profiling-diagnosing-and-enforcing-data-quality-intelligence" target="_blank"&gt;&lt;SPAN&gt;https://www.databricks.com/blog/lakehouse-monitoring-ga-profiling-diagnosing-and-enforcing-data-quality-intelligence&lt;/SPAN&gt;&lt;/A&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;/OL&gt;
&lt;/OL&gt;
&lt;P&gt;I hope this helps!&lt;/P&gt;</description>
    <pubDate>Thu, 12 Dec 2024 00:40:36 GMT</pubDate>
    <dc:creator>mmayorga</dc:creator>
    <dc:date>2024-12-12T00:40:36Z</dc:date>
    <item>
      <title>Lakehouse Monitoring &amp; Expectations</title>
      <link>https://community.databricks.com/t5/data-engineering/lakehouse-monitoring-amp-expectations/m-p/92851#M38559</link>
      <description>&lt;P&gt;Dears&lt;/P&gt;&lt;P&gt;Has anyone successfully used at scale the lakehouse monitoring &amp;amp; expectations features together to measure data quality of data tables - example, to conduct freshness checks, consistency checks etc.&lt;/P&gt;&lt;P&gt;Appreciate if you could share the lessons learnt, best practices. I would have a need to execute several consistency checks against several tables, and have those red/green lights be updated against them ((as shown during the annual conference &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt; ).&lt;/P&gt;</description>
      <pubDate>Sat, 05 Oct 2024 19:35:21 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/lakehouse-monitoring-amp-expectations/m-p/92851#M38559</guid>
      <dc:creator>noorbasha534</dc:creator>
      <dc:date>2024-10-05T19:35:21Z</dc:date>
    </item>
    <item>
      <title>Re: Lakehouse Monitoring &amp; Expectations</title>
      <link>https://community.databricks.com/t5/data-engineering/lakehouse-monitoring-amp-expectations/m-p/101843#M40857</link>
      <description>&lt;P&gt;Hello&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/124839"&gt;@noorbasha534&lt;/a&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thank you for reaching out and for your patience with this reply; below are some of the best practices:&lt;/P&gt;
&lt;OL&gt;
&lt;LI style="font-weight: 400;" aria-level="1"&gt;&lt;SPAN&gt;&lt;STRONG&gt;Monitor Data, Not Just Processes&lt;/STRONG&gt;: Focus on monitoring the quality of your data, not just the processes that handle it. This approach helps catch issues early in the data pipeline.&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI style="font-weight: 400;" aria-level="1"&gt;&lt;SPAN&gt;&lt;STRONG&gt;Set Expectation Rules&lt;/STRONG&gt;: Expectations can help manage data quality, especially when using Delta Live Tables (DLT). You can drop, warn, or quarantine rows that violate expectations or fail the pipeline altogether.&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI style="font-weight: 400;" aria-level="1"&gt;&lt;SPAN&gt;&lt;STRONG&gt;Leverage Unity Catalog Integration&lt;/STRONG&gt;: Since Lakehouse Monitoring is built on Unity Catalog, it can track quality alongside governance, building toward a self-serve data platform.&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI style="font-weight: 400;" aria-level="1"&gt;&lt;SPAN&gt;&lt;STRONG&gt;Utilize Automated Profiling&lt;/STRONG&gt;: Use the automated profiling feature for any Delta table in Unity Catalog to quickly identify potential issues across your entire data estate.&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI style="font-weight: 400;" aria-level="1"&gt;&lt;SPAN&gt;&lt;STRONG&gt;Implement Proactive Alerting&lt;/STRONG&gt;: Use the Expectations feature to set up notifications for quality issues as they arise, shifting from reactive to proactive monitoring.&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI style="font-weight: 400;" aria-level="1"&gt;&lt;SPAN&gt;&lt;STRONG&gt;Customize Dashboards&lt;/STRONG&gt;: Leverage Lakeview dashboard capabilities to create custom visualizations and collaborate across workspaces, teams, and stakeholders.&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI style="font-weight: 400;" aria-level="1"&gt;&lt;SPAN&gt;&lt;STRONG&gt;Monitor Throughout the Data Lifecycle&lt;/STRONG&gt;: Apply monitoring techniques at every step of the medallion architecture (bronze-silver-gold) to ensure data quality throughout the entire data lifecycle.&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI style="font-weight: 400;" aria-level="1"&gt;&lt;SPAN&gt;&lt;STRONG&gt;Leverage Custom Metrics&lt;/STRONG&gt;: Incorporate custom metrics tailored to your specific use case to gain more profound, more relevant insights, performance, and data quality.&lt;/SPAN&gt;&lt;/LI&gt;
&lt;/OL&gt;
&lt;P&gt;&lt;SPAN&gt;Here are some articles and videos about Lakehouse monitoring for your review:&lt;/SPAN&gt;&lt;/P&gt;
&lt;OL&gt;
&lt;LI style="font-weight: 400;" aria-level="1"&gt;&lt;SPAN&gt;Navigating the Waters of Lakehouse Monitoring and Observability by eeezee (Databricks) - &lt;/SPAN&gt;&lt;A href="https://community.databricks.com/t5/technical-blog/navigating-the-waters-of-lakehouse-monitoring-and-observability/ba-p/54655" target="_blank"&gt;&lt;SPAN&gt;https://community.databricks.com/t5/technical-blog/navigating-the-waters-of-lakehouse-monitoring-and-observability/ba-p/54655&lt;/SPAN&gt;&lt;/A&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI style="font-weight: 400;" aria-level="1"&gt;&lt;SPAN&gt;Ensuring Quality Forecasts with Databricks Lakehouse Monitoring by Peter Park - &lt;/SPAN&gt;&lt;A href="https://www.databricks.com/blog/ensuring-quality-forecasts-databricks-lakehouse-monitoring" target="_blank"&gt;&lt;SPAN&gt;https://www.databricks.com/blog/ensuring-quality-forecasts-databricks-lakehouse-monitoring&lt;/SPAN&gt;&lt;/A&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI style="font-weight: 400;" aria-level="1"&gt;&lt;SPAN&gt;Lakehouse Monitoring GA: Profiling, Diagnosing, and Enforcing Data Quality with Intelligence -&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;OL&gt;
&lt;LI style="font-weight: 400;" aria-level="2"&gt;&lt;A href="https://www.youtube.com/watch?v=aDBPoKyA0DQ" target="_blank"&gt;&lt;SPAN&gt;https://www.youtube.com/watch?v=aDBPoKyA0DQ&lt;/SPAN&gt;&lt;/A&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI style="font-weight: 400;" aria-level="2"&gt;&lt;A href="https://www.databricks.com/blog/lakehouse-monitoring-ga-profiling-diagnosing-and-enforcing-data-quality-intelligence" target="_blank"&gt;&lt;SPAN&gt;https://www.databricks.com/blog/lakehouse-monitoring-ga-profiling-diagnosing-and-enforcing-data-quality-intelligence&lt;/SPAN&gt;&lt;/A&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;/OL&gt;
&lt;/OL&gt;
&lt;P&gt;I hope this helps!&lt;/P&gt;</description>
      <pubDate>Thu, 12 Dec 2024 00:40:36 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/lakehouse-monitoring-amp-expectations/m-p/101843#M40857</guid>
      <dc:creator>mmayorga</dc:creator>
      <dc:date>2024-12-12T00:40:36Z</dc:date>
    </item>
    <item>
      <title>Re: Lakehouse Monitoring &amp; Expectations</title>
      <link>https://community.databricks.com/t5/data-engineering/lakehouse-monitoring-amp-expectations/m-p/107676#M42884</link>
      <description>&lt;P&gt;Not sure if you are still looking for the same. Here is a medium article -&amp;nbsp;&lt;A href="https://piethein.medium.com/data-quality-within-lakehouses-0c9417ce0487" target="_blank" rel="noopener"&gt;https://piethein.medium.com/data-quality-within-lakehouses-0c9417ce0487&lt;/A&gt;&amp;nbsp; that you can see the detailed implementation&lt;/P&gt;</description>
      <pubDate>Thu, 30 Jan 2025 00:19:36 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/lakehouse-monitoring-amp-expectations/m-p/107676#M42884</guid>
      <dc:creator>Satyadeepak</dc:creator>
      <dc:date>2025-01-30T00:19:36Z</dc:date>
    </item>
  </channel>
</rss>

