<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Dynamic Partition Pruning override in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/dynamic-partition-pruning-override/m-p/10959#M6001</link>
    <description>&lt;P&gt;Hello everybody,&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Another strange issue I have and I would like to confirm me if this is a bug or expected behaviour:&lt;/P&gt;&lt;P&gt;I'm joining a large dataset with a dimension table and as expected DPP is activated.&lt;/P&gt;&lt;P&gt;I was trying to deactivate the feature as it changes the read partitions so I disabled through &lt;/P&gt;&lt;P&gt;spark.sql.optimizer.dynamicPartitionPruning.enabled and spark.databricks.optimizer.dynamicPartitionPruning but I STILL had  the dynamic partition prunning.&lt;/P&gt;&lt;P&gt;Finally I discovered that by pushing spark.databricks.optimizer.deltaTableFilesThreshold to a big number I managed to see my sql query not to use DPP.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Is this behavior expected? I would say no given the DPP documentation &lt;A href="https://docs.databricks.com/delta/optimizations/dynamic-file-pruning.html?_ga=2.82269526.1572747842.1636542199-1650800278.1636126378" alt="https://docs.databricks.com/delta/optimizations/dynamic-file-pruning.html?_ga=2.82269526.1572747842.1636542199-1650800278.1636126378" target="_blank"&gt;&lt;U&gt;here&lt;/U&gt;&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Tested on both DBR 9.1 and 10&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;</description>
    <pubDate>Thu, 11 Nov 2021 16:00:31 GMT</pubDate>
    <dc:creator>pantelis_mare</dc:creator>
    <dc:date>2021-11-11T16:00:31Z</dc:date>
    <item>
      <title>Dynamic Partition Pruning override</title>
      <link>https://community.databricks.com/t5/data-engineering/dynamic-partition-pruning-override/m-p/10959#M6001</link>
      <description>&lt;P&gt;Hello everybody,&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Another strange issue I have and I would like to confirm me if this is a bug or expected behaviour:&lt;/P&gt;&lt;P&gt;I'm joining a large dataset with a dimension table and as expected DPP is activated.&lt;/P&gt;&lt;P&gt;I was trying to deactivate the feature as it changes the read partitions so I disabled through &lt;/P&gt;&lt;P&gt;spark.sql.optimizer.dynamicPartitionPruning.enabled and spark.databricks.optimizer.dynamicPartitionPruning but I STILL had  the dynamic partition prunning.&lt;/P&gt;&lt;P&gt;Finally I discovered that by pushing spark.databricks.optimizer.deltaTableFilesThreshold to a big number I managed to see my sql query not to use DPP.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Is this behavior expected? I would say no given the DPP documentation &lt;A href="https://docs.databricks.com/delta/optimizations/dynamic-file-pruning.html?_ga=2.82269526.1572747842.1636542199-1650800278.1636126378" alt="https://docs.databricks.com/delta/optimizations/dynamic-file-pruning.html?_ga=2.82269526.1572747842.1636542199-1650800278.1636126378" target="_blank"&gt;&lt;U&gt;here&lt;/U&gt;&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Tested on both DBR 9.1 and 10&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 11 Nov 2021 16:00:31 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/dynamic-partition-pruning-override/m-p/10959#M6001</guid>
      <dc:creator>pantelis_mare</dc:creator>
      <dc:date>2021-11-11T16:00:31Z</dc:date>
    </item>
    <item>
      <title>Re: Dynamic Partition Pruning override</title>
      <link>https://community.databricks.com/t5/data-engineering/dynamic-partition-pruning-override/m-p/10960#M6002</link>
      <description>&lt;P&gt;Hello there! Thanks for your question. I'd like to give this a bit longer to see what the community comes up with. Otherwise, I'll bump this to the SMEs.&lt;/P&gt;</description>
      <pubDate>Fri, 12 Nov 2021 16:48:55 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/dynamic-partition-pruning-override/m-p/10960#M6002</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2021-11-12T16:48:55Z</dc:date>
    </item>
    <item>
      <title>Re: Dynamic Partition Pruning override</title>
      <link>https://community.databricks.com/t5/data-engineering/dynamic-partition-pruning-override/m-p/10962#M6004</link>
      <description>&lt;P&gt;Hello @Kaniz Fatma​&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thank you for taking the time to answer.&lt;/P&gt;&lt;P&gt;The issue in this case was that spark.databricks.optimizer.deltaTableFilesThreshold&amp;nbsp;was activating DPP even if it was formally deactivated by setting all available "enabled"  properties to false (see my initial post)&lt;/P&gt;&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 22 Nov 2021 15:45:02 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/dynamic-partition-pruning-override/m-p/10962#M6004</guid>
      <dc:creator>pantelis_mare</dc:creator>
      <dc:date>2021-11-22T15:45:02Z</dc:date>
    </item>
  </channel>
</rss>

