<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Refreshing DELTA external table in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/refreshing-delta-external-table/m-p/92967#M38595</link>
    <description>&lt;P&gt;Step 3: Insert the Data; don't add it directly to the S3 folder.&lt;/P&gt;
&lt;P&gt;Once it's converted to Delta, it maintains the transaction log. Inserting a Parquet file (followed by another convert /refresh) won't work, as the rest of the dataset is already Delta.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Mon, 07 Oct 2024 12:52:58 GMT</pubDate>
    <dc:creator>gchandra</dc:creator>
    <dc:date>2024-10-07T12:52:58Z</dc:date>
    <item>
      <title>Refreshing DELTA external table</title>
      <link>https://community.databricks.com/t5/data-engineering/refreshing-delta-external-table/m-p/92672#M38500</link>
      <description>&lt;P&gt;I'm having trouble with the REFRESH TABLE command - does it work with DELTA external tables?&amp;nbsp; I'm doing the following steps:&lt;/P&gt;&lt;DIV&gt;&lt;OL&gt;&lt;LI&gt;Create table: &lt;FONT face="courier new,courier"&gt;CREATE TABLE IF NOT EXISTS `catalog`.`default`.`table_name` ( &lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier"&gt;KEY DOUBLE&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier"&gt;, CUSTKEY DOUBLE&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier"&gt;, STATUS STRING&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier"&gt;, PRICE DOUBLE&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier"&gt;, DATE TIMESTAMP&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier"&gt;, PRIORITY STRING&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier"&gt;) &lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier"&gt;USING PARQUET LOCATION 's3://bucket-name/folder-name/';&lt;/FONT&gt;&lt;/LI&gt;&lt;LI&gt;&amp;nbsp;Convert to Delta: &lt;FONT face="courier new,courier"&gt;convert to delta `catalog`.`default`.`table_&lt;SPAN&gt;name`;&lt;/SPAN&gt;&lt;/FONT&gt;&lt;/LI&gt;&lt;LI&gt;Add new parquet file to s3 folder&lt;/LI&gt;&lt;LI&gt;I tried &lt;FONT face="courier new,courier"&gt;REFRESH TABLE&lt;/FONT&gt;&lt;SPAN&gt;, then &lt;FONT face="courier new,courier"&gt;CONVERT TO DELTA&lt;/FONT&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN&gt;again, which didn't work.&amp;nbsp; I also tried converting to delta first and then &lt;FONT face="courier new,courier"&gt;REFRESH TABLE&lt;/FONT&gt;, and I can't get the new file recognized.&lt;/SPAN&gt;&lt;/LI&gt;&lt;/OL&gt;&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;I can't get the new file to show up in the created delta table without dropping and recreating the external table - is &lt;FONT face="courier new,courier"&gt;REFRESH TABLE&lt;/FONT&gt; supposed to work for DELTA external tables? Is there another order of operations I need to do to get the new file to be recognized in the existing delta external table?&lt;/DIV&gt;</description>
      <pubDate>Thu, 03 Oct 2024 16:17:45 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/refreshing-delta-external-table/m-p/92672#M38500</guid>
      <dc:creator>turtleXturtle</dc:creator>
      <dc:date>2024-10-03T16:17:45Z</dc:date>
    </item>
    <item>
      <title>Re: Refreshing DELTA external table</title>
      <link>https://community.databricks.com/t5/data-engineering/refreshing-delta-external-table/m-p/92967#M38595</link>
      <description>&lt;P&gt;Step 3: Insert the Data; don't add it directly to the S3 folder.&lt;/P&gt;
&lt;P&gt;Once it's converted to Delta, it maintains the transaction log. Inserting a Parquet file (followed by another convert /refresh) won't work, as the rest of the dataset is already Delta.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 07 Oct 2024 12:52:58 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/refreshing-delta-external-table/m-p/92967#M38595</guid>
      <dc:creator>gchandra</dc:creator>
      <dc:date>2024-10-07T12:52:58Z</dc:date>
    </item>
  </channel>
</rss>

