<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Which file size is better 1 GB file size in target or 128 MB or lesser than that in Machine Learning</title>
    <link>https://community.databricks.com/t5/machine-learning/which-file-size-is-better-1-gb-file-size-in-target-or-128-mb-or/m-p/23227#M1316</link>
    <description>&lt;P&gt;If data is getting appended primarily to the delta table and read ratio is higher than writes ratio  - larger file sizes ( 1GB) would be ideal. &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;However, if your delta table undergoes frequent upserts/merges, having smaller files than the default 1GB can improve MERGE performance as only smaller amounts of data would have to be rewritten.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Check out file size auto tuning for MERGE as well &lt;/P&gt;</description>
    <pubDate>Wed, 23 Jun 2021 05:35:26 GMT</pubDate>
    <dc:creator>sajith_appukutt</dc:creator>
    <dc:date>2021-06-23T05:35:26Z</dc:date>
    <item>
      <title>Which file size is better 1 GB file size in target or 128 MB or lesser than that</title>
      <link>https://community.databricks.com/t5/machine-learning/which-file-size-is-better-1-gb-file-size-in-target-or-128-mb-or/m-p/23226#M1315</link>
      <description>&lt;P&gt;Which file size is better 1 GB file size in target or 128 MB or lesser than that&amp;nbsp;, I am interested in knowing concept too.&lt;/P&gt;</description>
      <pubDate>Thu, 17 Jun 2021 15:05:07 GMT</pubDate>
      <guid>https://community.databricks.com/t5/machine-learning/which-file-size-is-better-1-gb-file-size-in-target-or-128-mb-or/m-p/23226#M1315</guid>
      <dc:creator>User16826994223</dc:creator>
      <dc:date>2021-06-17T15:05:07Z</dc:date>
    </item>
    <item>
      <title>Re: Which file size is better 1 GB file size in target or 128 MB or lesser than that</title>
      <link>https://community.databricks.com/t5/machine-learning/which-file-size-is-better-1-gb-file-size-in-target-or-128-mb-or/m-p/23227#M1316</link>
      <description>&lt;P&gt;If data is getting appended primarily to the delta table and read ratio is higher than writes ratio  - larger file sizes ( 1GB) would be ideal. &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;However, if your delta table undergoes frequent upserts/merges, having smaller files than the default 1GB can improve MERGE performance as only smaller amounts of data would have to be rewritten.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Check out file size auto tuning for MERGE as well &lt;/P&gt;</description>
      <pubDate>Wed, 23 Jun 2021 05:35:26 GMT</pubDate>
      <guid>https://community.databricks.com/t5/machine-learning/which-file-size-is-better-1-gb-file-size-in-target-or-128-mb-or/m-p/23227#M1316</guid>
      <dc:creator>sajith_appukutt</dc:creator>
      <dc:date>2021-06-23T05:35:26Z</dc:date>
    </item>
  </channel>
</rss>

