<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: how to partition my Delta table? in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/how-to-partition-my-delta-table/m-p/26277#M18379</link>
    <description>&lt;P&gt;Partitioning depends on the use cases associated with reading and writing data in the table. If the reads and writes are done for a limited set of dates , having the date as a partition would help speed up processing the operations. &lt;/P&gt;</description>
    <pubDate>Mon, 07 Jun 2021 18:09:45 GMT</pubDate>
    <dc:creator>User16783854357</dc:creator>
    <dc:date>2021-06-07T18:09:45Z</dc:date>
    <item>
      <title>how to partition my Delta table?</title>
      <link>https://community.databricks.com/t5/data-engineering/how-to-partition-my-delta-table/m-p/26276#M18378</link>
      <description>&lt;P&gt;I would like to follow  best practices to partition my Delta table. Should I partition by unique ID or date?&lt;/P&gt;</description>
      <pubDate>Fri, 04 Jun 2021 19:12:26 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/how-to-partition-my-delta-table/m-p/26276#M18378</guid>
      <dc:creator>jose_gonzalez</dc:creator>
      <dc:date>2021-06-04T19:12:26Z</dc:date>
    </item>
    <item>
      <title>Re: how to partition my Delta table?</title>
      <link>https://community.databricks.com/t5/data-engineering/how-to-partition-my-delta-table/m-p/26277#M18379</link>
      <description>&lt;P&gt;Partitioning depends on the use cases associated with reading and writing data in the table. If the reads and writes are done for a limited set of dates , having the date as a partition would help speed up processing the operations. &lt;/P&gt;</description>
      <pubDate>Mon, 07 Jun 2021 18:09:45 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/how-to-partition-my-delta-table/m-p/26277#M18379</guid>
      <dc:creator>User16783854357</dc:creator>
      <dc:date>2021-06-07T18:09:45Z</dc:date>
    </item>
    <item>
      <title>Re: how to partition my Delta table?</title>
      <link>https://community.databricks.com/t5/data-engineering/how-to-partition-my-delta-table/m-p/26278#M18380</link>
      <description>&lt;P&gt;Depending on the amount of data per partition - you may also want to consider partitioning by week, month or quarter.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;The partitioning decision is often tied to the tiering model of data storage. For a Bronze ingest layer, the optimal partitioning is to partition by some time value so that all data for a particular ingest is in the same partition. &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;For serving data - such as provided by the Gold tier, the optimal partitioning strategy is to partition so that queries require retrieval of data from the fewest partitions. &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Depending on your consumption patterns, this may be best achieved by partitioning by time, region or some other business criteria.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 07 Jun 2021 18:21:37 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/how-to-partition-my-delta-table/m-p/26278#M18380</guid>
      <dc:creator>RonanStokes_DB</dc:creator>
      <dc:date>2021-06-07T18:21:37Z</dc:date>
    </item>
  </channel>
</rss>

