<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Does partition pruning / partition elimination not work for folder partitioned JSON files? (Spark 3.1.2) in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/does-partition-pruning-partition-elimination-not-work-for-folder/m-p/28529#M20313</link>
    <description>&lt;P&gt;@Kaniz Fatma​&amp;nbsp; could you maybe involve a Databricks expert?&lt;/P&gt;</description>
    <pubDate>Fri, 04 Mar 2022 15:51:13 GMT</pubDate>
    <dc:creator>MartinB</dc:creator>
    <dc:date>2022-03-04T15:51:13Z</dc:date>
    <item>
      <title>Does partition pruning / partition elimination not work for folder partitioned JSON files? (Spark 3.1.2)</title>
      <link>https://community.databricks.com/t5/data-engineering/does-partition-pruning-partition-elimination-not-work-for-folder/m-p/28523#M20307</link>
      <description>&lt;P&gt;Imagine the following setup:&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I have log files stored as JSON files &lt;B&gt;partitioned &lt;/B&gt;by year, month, day and hour in &lt;B&gt;physical folders&lt;/B&gt;:&lt;/P&gt;&lt;PRE&gt;&lt;CODE&gt;"""
/logs
|-- year=2020
|-- year=2021
`-- year=2022
    |-- month=01
    `-- month=02
        |-- day=01
        |-- day=...
        `-- day=13
            |-- hour=0000
            |-- hour=...
            `-- hour=0900
                |-- log000001.json
                |-- &amp;lt;many files&amp;gt;
                `-- log000133.json
""""&lt;/CODE&gt;&lt;/PRE&gt;&lt;P&gt;Spark supports partition discovery for folder structures like this ("All built-in file sources (including Text/CSV/JSON/ORC/Parquet) are able to discover and infer partitioning information automatically" &lt;A href="https://spark.apache.org/docs/latest/sql-data-sources-parquet.html#partition-discovery)" target="test_blank"&gt;https://spark.apache.org/docs/latest/sql-data-sources-parquet.html#partition-discovery)&lt;/A&gt;.&lt;/P&gt;&lt;P&gt;However, in contrast to PARQUET files, I found that Spark does not uses this meta information for &lt;B&gt;partition pruning&lt;/B&gt; / partition elimination &lt;B&gt;when reading JSON files&lt;/B&gt;.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;In my use case I am only interested in logs from a specific time window (see &lt;I&gt;filter&lt;/I&gt;&lt;span class="lia-unicode-emoji" title=":disappointed_face:"&gt;😞&lt;/span&gt;&lt;/P&gt;&lt;PRE&gt;&lt;CODE&gt;(spark
  .read
  .format('json')
  .load('/logs')
  .filter('year=2022 AND month=02 AND day=13 AND hour=0900')
)&lt;/CODE&gt;&lt;/PRE&gt;&lt;P&gt;I'd expect that Spark would be able to &lt;B&gt;apply the filters on the partition columns&lt;/B&gt; "early" and only scan folders matching the filters (e.g. Spark would not need to scan folders and read files under '/logs/year=2020').&lt;/P&gt;&lt;P&gt;However, in practice the execution of my query takes a lot of time. It looks to me as if Spark &lt;B&gt;scans first the whole filesystem&lt;/B&gt; starting at '/logs' reads all files and then applies the filters (on the already read data). Due to the nested folder structure and the large number of folders/files this is very expensive.&lt;/P&gt;&lt;P&gt;Apparently Spark does not push down the filter (applies partition pruning / partition elimination).&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;For me it is weird that the behavior for processing JSON files differs from Parquet.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;B&gt;&lt;U&gt;Is this &lt;/U&gt;&lt;I&gt;&lt;U&gt;as-designed&lt;/U&gt;&lt;/I&gt;&lt;U&gt; or a &lt;/U&gt;&lt;I&gt;&lt;U&gt;bug&lt;/U&gt;&lt;/I&gt;&lt;U&gt;?&lt;/U&gt;&lt;/B&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;For now, I ended up implementing partition pruning myself in a pre-processing step by using &lt;B&gt;dbutils.fs.ls&lt;/B&gt; for scanning the "right" folders iteratively and assembling an explicit file list that I then pass on to the spark read command.&lt;/P&gt;</description>
      <pubDate>Sun, 13 Feb 2022 15:59:14 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/does-partition-pruning-partition-elimination-not-work-for-folder/m-p/28523#M20307</guid>
      <dc:creator>MartinB</dc:creator>
      <dc:date>2022-02-13T15:59:14Z</dc:date>
    </item>
    <item>
      <title>Re: Does partition pruning / partition elimination not work for folder partitioned JSON files? (Spark 3.1.2)</title>
      <link>https://community.databricks.com/t5/data-engineering/does-partition-pruning-partition-elimination-not-work-for-folder/m-p/28524#M20308</link>
      <description>&lt;P&gt;Instead of nested directories, could you try single level partition and have you partition names as `year_month_day_hour` (assuming that you have your JSON files in hour directory only). In that way spark knows in one shot which partition it has to look at.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Querying could be expensive if your JSON files are very small in size (in KBs probably).&lt;/P&gt;&lt;P&gt;Maybe check the file sizes and instead of having log files per hour, you would be better off by having them partitioned by per day.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Last, maybe try querying using col function. Not sure if it'll help, but worth giving a try.&lt;/P&gt;&lt;PRE&gt;&lt;CODE&gt;from pyspark.sql.functions import col
&amp;nbsp;
spark
      .read
      .format('json')
      .load('/logs')
      .filter( (col('year')=2022) &amp;amp; (col('month')=02) &amp;amp; (col('day')=13)  &amp;amp; (col('hour')=0900'))&lt;/CODE&gt;&lt;/PRE&gt;&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Sun, 13 Feb 2022 22:16:01 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/does-partition-pruning-partition-elimination-not-work-for-folder/m-p/28524#M20308</guid>
      <dc:creator>AmanSehgal</dc:creator>
      <dc:date>2022-02-13T22:16:01Z</dc:date>
    </item>
    <item>
      <title>Re: Does partition pruning / partition elimination not work for folder partitioned JSON files? (Spark 3.1.2)</title>
      <link>https://community.databricks.com/t5/data-engineering/does-partition-pruning-partition-elimination-not-work-for-folder/m-p/28525#M20309</link>
      <description>&lt;P&gt;Hi @Aman Sehgal​&amp;nbsp;,&lt;/P&gt;&lt;P&gt;thanks for your advice.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Unfortunately I have no influence on the partitioning of the data, I'm just a consumer &lt;span class="lia-unicode-emoji" title=":persevering_face:"&gt;😣&lt;/span&gt; &lt;/P&gt;&lt;P&gt;Anyhow, I'd like to know why you think that Spark would be able to apply partition elimination if there would be just &lt;B&gt;one &lt;/B&gt;partitioning level.&lt;/P&gt;&lt;P&gt;Imagine there would be data of 3 years, this would mean, that there would be 3*365*24=26,280 folders under \logs. As far as I can tell, Spark would still discover all those directories and load all found JSON files to memory before applying the filter.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Or are you suggesting determining the right folder manually and then loading from the correct folder?&lt;/P&gt;&lt;P&gt;This would be "manual" partition elimination, in my opinion.&lt;/P&gt;&lt;PRE&gt;&lt;CODE&gt;(spark
  .read
  .format('json')
  .load('/logs/2022_02_13_0900')
)&lt;/CODE&gt;&lt;/PRE&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I also tried using the &lt;B&gt;col &lt;/B&gt;function in the &lt;B&gt;filter&lt;/B&gt;. Unfortunately it had no performance impact over specifying the filter als "SQL condition string". &lt;span class="lia-unicode-emoji" title=":worried_face:"&gt;😟&lt;/span&gt; &lt;/P&gt;</description>
      <pubDate>Mon, 14 Feb 2022 19:19:41 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/does-partition-pruning-partition-elimination-not-work-for-folder/m-p/28525#M20309</guid>
      <dc:creator>MartinB</dc:creator>
      <dc:date>2022-02-14T19:19:41Z</dc:date>
    </item>
    <item>
      <title>Re: Does partition pruning / partition elimination not work for folder partitioned JSON files? (Spark 3.1.2)</title>
      <link>https://community.databricks.com/t5/data-engineering/does-partition-pruning-partition-elimination-not-work-for-folder/m-p/28526#M20310</link>
      <description>&lt;P&gt;Spark wouldn't discover all the directories. It'll straightaway go to partition value.&lt;/P&gt;&lt;P&gt;Could you give more proof of your hypothesis? Like spark logs or DAG?&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;My next guess would be that the files in log files are small in size. Could you check that and post the file size in the final partition?&lt;/P&gt;</description>
      <pubDate>Tue, 15 Feb 2022 14:10:50 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/does-partition-pruning-partition-elimination-not-work-for-folder/m-p/28526#M20310</guid>
      <dc:creator>AmanSehgal</dc:creator>
      <dc:date>2022-02-15T14:10:50Z</dc:date>
    </item>
    <item>
      <title>Re: Does partition pruning / partition elimination not work for folder partitioned JSON files? (Spark 3.1.2)</title>
      <link>https://community.databricks.com/t5/data-engineering/does-partition-pruning-partition-elimination-not-work-for-folder/m-p/28527#M20311</link>
      <description>&lt;P&gt;I created a test setup. &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I generated lots of exemplary rows simulating log entries; for 90 days, each day with 600,000 log entries (=54,000,000 rows). Each entry has a log timestamp. I created another column for "binning" all entries in the nearest "5 minute" window.&lt;/P&gt;&lt;P&gt;I saved this data frame as JSON, partitioned by the 5min-timestamp column.&lt;/P&gt;&lt;P&gt;So I ended up with 12,960&amp;nbsp;folders containing each one JSON file.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Then I tried:&lt;/P&gt;&lt;PRE&gt;&lt;CODE&gt;(
    spark
    .read
    .format('json')
    .load(f'{path_base_folder}/some_timestamp_bin_05m=2022-03-23 18%3A00%3A00')
    .explain(extended=True)
)&lt;/CODE&gt;&lt;/PRE&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="image"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/2087i5CEA2B6CB5F7756C/image-size/large?v=v2&amp;amp;px=999" role="button" title="image" alt="image" /&gt;&lt;/span&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;As well as&lt;/P&gt;&lt;PRE&gt;&lt;CODE&gt;(
    spark
    .read
    .format('json')
    .load(f'{path_base_folder}')
    .filter( F.col('some_timestamp_bin_05m')=="2022-03-23 18:00:00")
    .explain(extended=True)
)&lt;/CODE&gt;&lt;/PRE&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="image"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/2091i1EB8361870A27F47/image-size/large?v=v2&amp;amp;px=999" role="button" title="image" alt="image" /&gt;&lt;/span&gt;As you can see, from the "Input Size" metric, the second statement actually &lt;B&gt;read all files&lt;/B&gt; and then applied the filter.&lt;/P&gt;&lt;P&gt;Interestingly, prior to the read job, two more jobs are carried out; to scan the file system:&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="image"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/2089i9340215DC4FC479C/image-size/large?v=v2&amp;amp;px=999" role="button" title="image" alt="image" /&gt;&lt;/span&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 15 Feb 2022 18:31:44 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/does-partition-pruning-partition-elimination-not-work-for-folder/m-p/28527#M20311</guid>
      <dc:creator>MartinB</dc:creator>
      <dc:date>2022-02-15T18:31:44Z</dc:date>
    </item>
    <item>
      <title>Re: Does partition pruning / partition elimination not work for folder partitioned JSON files? (Spark 3.1.2)</title>
      <link>https://community.databricks.com/t5/data-engineering/does-partition-pruning-partition-elimination-not-work-for-folder/m-p/28528#M20312</link>
      <description>&lt;P&gt;Any thoughts, @Aman Sehgal​&amp;nbsp;?&lt;/P&gt;</description>
      <pubDate>Mon, 28 Feb 2022 16:55:25 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/does-partition-pruning-partition-elimination-not-work-for-folder/m-p/28528#M20312</guid>
      <dc:creator>MartinB</dc:creator>
      <dc:date>2022-02-28T16:55:25Z</dc:date>
    </item>
    <item>
      <title>Re: Does partition pruning / partition elimination not work for folder partitioned JSON files? (Spark 3.1.2)</title>
      <link>https://community.databricks.com/t5/data-engineering/does-partition-pruning-partition-elimination-not-work-for-folder/m-p/28529#M20313</link>
      <description>&lt;P&gt;@Kaniz Fatma​&amp;nbsp; could you maybe involve a Databricks expert?&lt;/P&gt;</description>
      <pubDate>Fri, 04 Mar 2022 15:51:13 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/does-partition-pruning-partition-elimination-not-work-for-folder/m-p/28529#M20313</guid>
      <dc:creator>MartinB</dc:creator>
      <dc:date>2022-03-04T15:51:13Z</dc:date>
    </item>
    <item>
      <title>Re: Does partition pruning / partition elimination not work for folder partitioned JSON files? (Spark 3.1.2)</title>
      <link>https://community.databricks.com/t5/data-engineering/does-partition-pruning-partition-elimination-not-work-for-folder/m-p/28531#M20315</link>
      <description>&lt;P&gt;At this stage the only thing I can think of is the file format -JSON. Since you've the test setup can you write all the data in Parquet format or Delta format and then run the query?&lt;/P&gt;</description>
      <pubDate>Fri, 04 Mar 2022 20:32:34 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/does-partition-pruning-partition-elimination-not-work-for-folder/m-p/28531#M20315</guid>
      <dc:creator>AmanSehgal</dc:creator>
      <dc:date>2022-03-04T20:32:34Z</dc:date>
    </item>
    <item>
      <title>Re: Does partition pruning / partition elimination not work for folder partitioned JSON files? (Spark 3.1.2)</title>
      <link>https://community.databricks.com/t5/data-engineering/does-partition-pruning-partition-elimination-not-work-for-folder/m-p/28532#M20316</link>
      <description>&lt;P&gt;Hi @Aman Sehgal​&amp;nbsp;,&lt;/P&gt;&lt;P&gt;Yes, when I change the test setup to Parquet format, the partition elimination works.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;This is my original question: &lt;B&gt;Is the partition elimination &lt;I&gt;supposed &lt;/I&gt;to work &lt;U&gt;only &lt;/U&gt;for &lt;U&gt;Parquet / ORC&lt;/U&gt; or also for JSON&lt;/B&gt;?&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;My findings show that it does not work for JSON - is this a bug or a not supported feature?&lt;/P&gt;</description>
      <pubDate>Sat, 05 Mar 2022 10:45:47 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/does-partition-pruning-partition-elimination-not-work-for-folder/m-p/28532#M20316</guid>
      <dc:creator>MartinB</dc:creator>
      <dc:date>2022-03-05T10:45:47Z</dc:date>
    </item>
    <item>
      <title>Re: Does partition pruning / partition elimination not work for folder partitioned JSON files? (Spark 3.1.2)</title>
      <link>https://community.databricks.com/t5/data-engineering/does-partition-pruning-partition-elimination-not-work-for-folder/m-p/28535#M20319</link>
      <description>&lt;P&gt;Hi @Kaniz Fatma​&amp;nbsp;,&lt;/P&gt;&lt;P&gt;Yes, I'd like to open a feature request.&lt;/P&gt;&lt;P&gt;However, I cannot access the &lt;A href="https://ideas.databricks.com/" alt="https://ideas.databricks.com/" target="_blank"&gt;ideas portal&lt;/A&gt;.&lt;/P&gt;&lt;P&gt;I'm just a Community Edition user and don't have a&lt;B&gt; workspace domain&lt;/B&gt;...&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="image"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/2086i9E17863BA0681DD0/image-size/large?v=v2&amp;amp;px=999" role="button" title="image" alt="image" /&gt;&lt;/span&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 11 Mar 2022 15:58:04 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/does-partition-pruning-partition-elimination-not-work-for-folder/m-p/28535#M20319</guid>
      <dc:creator>MartinB</dc:creator>
      <dc:date>2022-03-11T15:58:04Z</dc:date>
    </item>
    <item>
      <title>Re: Does partition pruning / partition elimination not work for folder partitioned JSON files? (Spark 3.1.2)</title>
      <link>https://community.databricks.com/t5/data-engineering/does-partition-pruning-partition-elimination-not-work-for-folder/m-p/28537#M20321</link>
      <description>&lt;P&gt;Hi @Kaniz Fatma​&amp;nbsp;,&lt;/P&gt;&lt;P&gt;Are you positive that the ideas portal should work for &lt;A href="https://community.cloud.databricks.com/" alt="https://community.cloud.databricks.com/" target="_blank"&gt;&lt;B&gt;&lt;U&gt;Community Edition&lt;/U&gt;&lt;/B&gt;&lt;/A&gt; users?&lt;/P&gt;&lt;P&gt;When I try to log into the ideas portal using "community" workspace I always get an error message:&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="image"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/2099i89DA979960C6FA73/image-size/large?v=v2&amp;amp;px=999" role="button" title="image" alt="image" /&gt;&lt;/span&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 11 Mar 2022 16:55:00 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/does-partition-pruning-partition-elimination-not-work-for-folder/m-p/28537#M20321</guid>
      <dc:creator>MartinB</dc:creator>
      <dc:date>2022-03-11T16:55:00Z</dc:date>
    </item>
    <item>
      <title>Re: Does partition pruning / partition elimination not work for folder partitioned JSON files? (Spark 3.1.2)</title>
      <link>https://community.databricks.com/t5/data-engineering/does-partition-pruning-partition-elimination-not-work-for-folder/m-p/28538#M20322</link>
      <description>&lt;P&gt;Hi @Kaniz Fatma​&amp;nbsp;, any updates?&lt;/P&gt;</description>
      <pubDate>Thu, 17 Mar 2022 17:30:59 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/does-partition-pruning-partition-elimination-not-work-for-folder/m-p/28538#M20322</guid>
      <dc:creator>MartinB</dc:creator>
      <dc:date>2022-03-17T17:30:59Z</dc:date>
    </item>
    <item>
      <title>Re: Does partition pruning / partition elimination not work for folder partitioned JSON files? (Spark 3.1.2)</title>
      <link>https://community.databricks.com/t5/data-engineering/does-partition-pruning-partition-elimination-not-work-for-folder/m-p/28540#M20324</link>
      <description>&lt;P&gt;Hi @Kaniz Fatma​&amp;nbsp;,&lt;/P&gt;&lt;P&gt;Yes, this is what I did - and I end up with the same error every time after the login:&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="image"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/2093i4C74D1C6128D1D25/image-size/large?v=v2&amp;amp;px=999" role="button" title="image" alt="image" /&gt;&lt;/span&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 23 Mar 2022 17:14:57 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/does-partition-pruning-partition-elimination-not-work-for-folder/m-p/28540#M20324</guid>
      <dc:creator>MartinB</dc:creator>
      <dc:date>2022-03-23T17:14:57Z</dc:date>
    </item>
    <item>
      <title>Re: Does partition pruning / partition elimination not work for folder partitioned JSON files? (Spark 3.1.2)</title>
      <link>https://community.databricks.com/t5/data-engineering/does-partition-pruning-partition-elimination-not-work-for-folder/m-p/28542#M20326</link>
      <description>&lt;P&gt;Hi @Kaniz Fatma​&amp;nbsp;,&lt;/P&gt;&lt;P&gt;Any updates on this one?&lt;/P&gt;</description>
      <pubDate>Fri, 15 Apr 2022 10:42:18 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/does-partition-pruning-partition-elimination-not-work-for-folder/m-p/28542#M20326</guid>
      <dc:creator>MartinB</dc:creator>
      <dc:date>2022-04-15T10:42:18Z</dc:date>
    </item>
    <item>
      <title>Re: Does partition pruning / partition elimination not work for folder partitioned JSON files? (Spark 3.1.2)</title>
      <link>https://community.databricks.com/t5/data-engineering/does-partition-pruning-partition-elimination-not-work-for-folder/m-p/28544#M20328</link>
      <description>&lt;P&gt;Hi @Kaniz Fatma​&amp;nbsp;,&lt;/P&gt;&lt;P&gt;Yes I did. This time no more error is displayed as before.&lt;/P&gt;&lt;P&gt;But following your link &lt;A href="https://databricks.com/feedback" target="test_blank"&gt;https://databricks.com/feedback&lt;/A&gt; I end up on the landing page in my community workspace; I had expected a &lt;B&gt;&lt;I&gt;feedback portal&lt;/I&gt;&lt;/B&gt;.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I my workspace, under "help" there is another "feedback" button:&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="image"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/2110i65714D2E4BD2DD77/image-size/large?v=v2&amp;amp;px=999" role="button" title="image" alt="image" /&gt;&lt;/span&gt;But this is just a mailto- link for the address &lt;B&gt;&lt;U&gt;feedback@databricks.com&lt;/U&gt;&lt;/B&gt;.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Is this the intended way to make feature requests?&lt;/P&gt;</description>
      <pubDate>Mon, 18 Apr 2022 12:49:50 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/does-partition-pruning-partition-elimination-not-work-for-folder/m-p/28544#M20328</guid>
      <dc:creator>MartinB</dc:creator>
      <dc:date>2022-04-18T12:49:50Z</dc:date>
    </item>
    <item>
      <title>Re: Does partition pruning / partition elimination not work for folder partitioned JSON files? (Spark 3.1.2)</title>
      <link>https://community.databricks.com/t5/data-engineering/does-partition-pruning-partition-elimination-not-work-for-folder/m-p/28546#M20330</link>
      <description>&lt;P&gt;Hi @Kaniz Fatma​&amp;nbsp;,&lt;/P&gt;&lt;P&gt;When I try to access &lt;A href="https://ideas.databricks.com/" target="test_blank"&gt;https://ideas.databricks.com/&lt;/A&gt; the situation just as I described a month ago: after the login an error is displayed:&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="image.png"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/2098i5F5A73F2A1FB79DF/image-size/large?v=v2&amp;amp;px=999" role="button" title="image.png" alt="image.png" /&gt;&lt;/span&gt;Last month, I had the understanding that you are going to check, why that is the case. &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt; &lt;B&gt;&lt;I&gt;&lt;U&gt;Are you positive that &lt;/U&gt;&lt;/I&gt;&lt;/B&gt;&lt;A href="https://databricks.com/product/faq/community-edition" alt="https://databricks.com/product/faq/community-edition" target="_blank"&gt;&lt;B&gt;&lt;I&gt;&lt;U&gt;Databricks community edition&lt;/U&gt;&lt;/I&gt;&lt;/B&gt;&lt;/A&gt;&lt;B&gt;&lt;I&gt;&lt;U&gt; (=free) users are allowed to access the ideas portal?&lt;/U&gt;&lt;/I&gt;&lt;/B&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 18 Apr 2022 20:02:28 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/does-partition-pruning-partition-elimination-not-work-for-folder/m-p/28546#M20330</guid>
      <dc:creator>MartinB</dc:creator>
      <dc:date>2022-04-18T20:02:28Z</dc:date>
    </item>
    <item>
      <title>Re: Does partition pruning / partition elimination not work for folder partitioned JSON files? (Spark 3.1.2)</title>
      <link>https://community.databricks.com/t5/data-engineering/does-partition-pruning-partition-elimination-not-work-for-folder/m-p/28548#M20332</link>
      <description>&lt;P&gt;Hi @Kaniz Fatma​&amp;nbsp;,&lt;/P&gt;&lt;P&gt;That's unfortunate; but thanks for the answer.&lt;/P&gt;</description>
      <pubDate>Thu, 21 Apr 2022 17:47:54 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/does-partition-pruning-partition-elimination-not-work-for-folder/m-p/28548#M20332</guid>
      <dc:creator>MartinB</dc:creator>
      <dc:date>2022-04-21T17:47:54Z</dc:date>
    </item>
  </channel>
</rss>

