<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Materialized View Refresh - NUM_JOINS_THRESHOLD_EXCEEDED? in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/materialized-view-refresh-num-joins-threshold-exceeded/m-p/116032#M45228</link>
    <description>&lt;P&gt;Hey&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/112550"&gt;@TheSmike&lt;/a&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;In the DLT Pipeline's top right cornor, you can click on settings and scroll down to Advanced and click on Add Configuration and give the key as `&lt;SPAN&gt;pipelines.enzyme.numberOfJoinsThreshold` and value as 5.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Hope this helps.&lt;/SPAN&gt;&lt;/P&gt;</description>
    <pubDate>Mon, 21 Apr 2025 07:28:24 GMT</pubDate>
    <dc:creator>RiyazAliM</dc:creator>
    <dc:date>2025-04-21T07:28:24Z</dc:date>
    <item>
      <title>Materialized View Refresh - NUM_JOINS_THRESHOLD_EXCEEDED?</title>
      <link>https://community.databricks.com/t5/data-engineering/materialized-view-refresh-num-joins-threshold-exceeded/m-p/100761#M40410</link>
      <description>&lt;P&gt;I have a very basic view with 3 inner joins that will only do a full refresh. Is there a limit to the number of joins you can have and still get an incremental refresh?&lt;/P&gt;&lt;P&gt;"incrementalization_issues": [&lt;BR /&gt;{&lt;BR /&gt;"issue_type": "INCREMENTAL_PLAN_REJECTED_BY_COST_MODEL",&lt;BR /&gt;"prevent_incrementalization": true,&lt;BR /&gt;"cost_model_rejection_subtype": "NUM_JOINS_THRESHOLD_EXCEEDED"&lt;BR /&gt;}&lt;BR /&gt;]&lt;/P&gt;&lt;P&gt;Thanks!&lt;/P&gt;</description>
      <pubDate>Tue, 03 Dec 2024 13:43:50 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/materialized-view-refresh-num-joins-threshold-exceeded/m-p/100761#M40410</guid>
      <dc:creator>GregTyndall</dc:creator>
      <dc:date>2024-12-03T13:43:50Z</dc:date>
    </item>
    <item>
      <title>Re: Materialized View Refresh - NUM_JOINS_THRESHOLD_EXCEEDED?</title>
      <link>https://community.databricks.com/t5/data-engineering/materialized-view-refresh-num-joins-threshold-exceeded/m-p/100918#M40474</link>
      <description>&lt;P&gt;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/117038"&gt;@GregTyndall&lt;/a&gt;&amp;nbsp;We have a limit of 2 joins&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 04 Dec 2024 13:50:44 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/materialized-view-refresh-num-joins-threshold-exceeded/m-p/100918#M40474</guid>
      <dc:creator>MuthuLakshmi</dc:creator>
      <dc:date>2024-12-04T13:50:44Z</dc:date>
    </item>
    <item>
      <title>Re: Materialized View Refresh - NUM_JOINS_THRESHOLD_EXCEEDED?</title>
      <link>https://community.databricks.com/t5/data-engineering/materialized-view-refresh-num-joins-threshold-exceeded/m-p/100928#M40477</link>
      <description>&lt;P&gt;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/117038"&gt;@GregTyndall&lt;/a&gt;&amp;nbsp;Yes, the current limit is 2 by default. But we can increase up to 5 with the below flag added to the pipeline settings.&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;pipelines.enzyme.numberOfJoinsThreshold 5&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 04 Dec 2024 14:07:17 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/materialized-view-refresh-num-joins-threshold-exceeded/m-p/100928#M40477</guid>
      <dc:creator>PotnuruSiva</dc:creator>
      <dc:date>2024-12-04T14:07:17Z</dc:date>
    </item>
    <item>
      <title>Re: Materialized View Refresh - NUM_JOINS_THRESHOLD_EXCEEDED?</title>
      <link>https://community.databricks.com/t5/data-engineering/materialized-view-refresh-num-joins-threshold-exceeded/m-p/115775#M45182</link>
      <description>&lt;P&gt;I have the same issue.&lt;/P&gt;&lt;P&gt;What do you mean exactly with "&lt;SPAN&gt;added to the &lt;STRONG&gt;pipeline settings&lt;/STRONG&gt;&lt;/SPAN&gt;"? How can I set it?&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 17 Apr 2025 15:52:10 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/materialized-view-refresh-num-joins-threshold-exceeded/m-p/115775#M45182</guid>
      <dc:creator>TheSmike</dc:creator>
      <dc:date>2025-04-17T15:52:10Z</dc:date>
    </item>
    <item>
      <title>Re: Materialized View Refresh - NUM_JOINS_THRESHOLD_EXCEEDED?</title>
      <link>https://community.databricks.com/t5/data-engineering/materialized-view-refresh-num-joins-threshold-exceeded/m-p/116032#M45228</link>
      <description>&lt;P&gt;Hey&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/112550"&gt;@TheSmike&lt;/a&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;In the DLT Pipeline's top right cornor, you can click on settings and scroll down to Advanced and click on Add Configuration and give the key as `&lt;SPAN&gt;pipelines.enzyme.numberOfJoinsThreshold` and value as 5.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Hope this helps.&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 21 Apr 2025 07:28:24 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/materialized-view-refresh-num-joins-threshold-exceeded/m-p/116032#M45228</guid>
      <dc:creator>RiyazAliM</dc:creator>
      <dc:date>2025-04-21T07:28:24Z</dc:date>
    </item>
    <item>
      <title>Re: Materialized View Refresh - NUM_JOINS_THRESHOLD_EXCEEDED?</title>
      <link>https://community.databricks.com/t5/data-engineering/materialized-view-refresh-num-joins-threshold-exceeded/m-p/116439#M45314</link>
      <description>&lt;P&gt;Thanks, it works.&lt;/P&gt;</description>
      <pubDate>Thu, 24 Apr 2025 07:50:08 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/materialized-view-refresh-num-joins-threshold-exceeded/m-p/116439#M45314</guid>
      <dc:creator>TheSmike</dc:creator>
      <dc:date>2025-04-24T07:50:08Z</dc:date>
    </item>
    <item>
      <title>Re: Materialized View Refresh - NUM_JOINS_THRESHOLD_EXCEEDED?</title>
      <link>https://community.databricks.com/t5/data-engineering/materialized-view-refresh-num-joins-threshold-exceeded/m-p/119213#M45818</link>
      <description>&lt;P&gt;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/117038"&gt;@GregTyndall&lt;/a&gt;- how did you get those level of details (incrementalization_issues) for the MV build?&lt;/P&gt;</description>
      <pubDate>Wed, 14 May 2025 15:56:06 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/materialized-view-refresh-num-joins-threshold-exceeded/m-p/119213#M45818</guid>
      <dc:creator>_DatabricksUser</dc:creator>
      <dc:date>2025-05-14T15:56:06Z</dc:date>
    </item>
    <item>
      <title>Re: Materialized View Refresh - NUM_JOINS_THRESHOLD_EXCEEDED?</title>
      <link>https://community.databricks.com/t5/data-engineering/materialized-view-refresh-num-joins-threshold-exceeded/m-p/120337#M46143</link>
      <description>&lt;P&gt;To determine which refresh strategy is being used (incremental vs full), refer to the final section of the documentation: &lt;A href="https://docs.databricks.com/aws/en/optimizations/incremental-refresh#determine-the-refresh-type-of-an-update" target="_blank"&gt;https://docs.databricks.com/aws/en/optimizations/incremental-refresh#determine-the-refresh-type-of-an-update&lt;/A&gt;.&lt;/P&gt;&lt;P&gt;According to the docs:&lt;/P&gt;&lt;P&gt;To determine the technique used, query the DLT event log where the "event_type" is "planning_information"...&lt;/P&gt;&lt;P&gt;Note:&lt;BR /&gt;There's a typo in the official documentation. To query the event log correctly, use FROM event_log_[NORMALIZED_DLT_ID]. Where [NORMALIZED_DLT_ID] is your pipeline ID with "_" instead of "-".&lt;/P&gt;&lt;P&gt;Look at the "details" column — if a **full refresh** is triggered, it often contains helpful insights into the reason.&lt;/P&gt;</description>
      <pubDate>Tue, 27 May 2025 15:29:43 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/materialized-view-refresh-num-joins-threshold-exceeded/m-p/120337#M46143</guid>
      <dc:creator>TheSmike</dc:creator>
      <dc:date>2025-05-27T15:29:43Z</dc:date>
    </item>
    <item>
      <title>Re: Materialized View Refresh - NUM_JOINS_THRESHOLD_EXCEEDED?</title>
      <link>https://community.databricks.com/t5/data-engineering/materialized-view-refresh-num-joins-threshold-exceeded/m-p/122535#M46804</link>
      <description>&lt;P&gt;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/60544"&gt;@PotnuruSiva&lt;/a&gt;&amp;nbsp;I set&amp;nbsp;&lt;SPAN&gt;pipelines.enzyme.numberOfJoinsThreshold 5 for a MV with 4 joins. But still I am getting&lt;BR /&gt;"incrementalization_issues": [&lt;BR /&gt;{&lt;BR /&gt;"issue_type": "INCREMENTAL_PLAN_REJECTED_BY_COST_MODEL",&lt;BR /&gt;"prevent_incrementalization": true,&lt;BR /&gt;"cost_model_rejection_subtype": "NUM_JOINS_THRESHOLD_EXCEEDED"&lt;BR /&gt;}&lt;BR /&gt;]&lt;BR /&gt;&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 23 Jun 2025 12:54:53 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/materialized-view-refresh-num-joins-threshold-exceeded/m-p/122535#M46804</guid>
      <dc:creator>maarko</dc:creator>
      <dc:date>2025-06-23T12:54:53Z</dc:date>
    </item>
    <item>
      <title>Re: Materialized View Refresh - NUM_JOINS_THRESHOLD_EXCEEDED?</title>
      <link>https://community.databricks.com/t5/data-engineering/materialized-view-refresh-num-joins-threshold-exceeded/m-p/122539#M46806</link>
      <description>&lt;P&gt;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/171023"&gt;@maarko&lt;/a&gt;- I'd create a separate thread for your issue. But I can tell you though that I had the same problem- the Enzyme number of joins threshold was seemingly not being respected. I reported to Databricks support and they transferred the ticket to the Databricks Spark team to investigate. Currently they do not have any answer for me.&lt;/P&gt;</description>
      <pubDate>Mon, 23 Jun 2025 13:21:04 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/materialized-view-refresh-num-joins-threshold-exceeded/m-p/122539#M46806</guid>
      <dc:creator>_DatabricksUser</dc:creator>
      <dc:date>2025-06-23T13:21:04Z</dc:date>
    </item>
  </channel>
</rss>

