<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Creating external tables using gzipped CSV file - S3 URI without extensions in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/creating-external-tables-using-gzipped-csv-file-s3-uri-without/m-p/68449#M33682</link>
    <description>&lt;P&gt;Hey&amp;nbsp;&amp;nbsp;, thanks for your response.&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;I tried using a Serde(I think the&amp;nbsp;OpenCSVSerde&lt;SPAN&gt;&amp;nbsp;should work for me) but unfortunately im getting the below from the Unity Catalog:&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;[UC_DATASOURCE_NOT_SUPPORTED] Data source format hive is not supported in Unity Catalog. SQLSTATE: 0AKUC&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Can you please suggest any other workarounds for the above?&lt;BR /&gt;&lt;BR /&gt;Thanks&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/9"&gt;@Retired_mod&lt;/a&gt;&lt;/P&gt;</description>
    <pubDate>Tue, 07 May 2024 12:44:12 GMT</pubDate>
    <dc:creator>AdityaM</dc:creator>
    <dc:date>2024-05-07T12:44:12Z</dc:date>
    <item>
      <title>Creating external tables using gzipped CSV file - S3 URI without extensions</title>
      <link>https://community.databricks.com/t5/data-engineering/creating-external-tables-using-gzipped-csv-file-s3-uri-without/m-p/67745#M33431</link>
      <description>&lt;P&gt;Hi Databricks community,&lt;/P&gt;&lt;P&gt;Hope you are doing well.&lt;BR /&gt;I am trying to create an external table using a Gzipped CSV file uploaded to an S3 bucket.&lt;BR /&gt;The S3 URI of the resource doesn't have any file extensions, but the content of the file is a Gzipped comma separated file that I want to read into the External Table.&lt;BR /&gt;&lt;BR /&gt;The command I'm using is:&lt;/P&gt;&lt;DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;CREATE&lt;/SPAN&gt; &lt;SPAN&gt;EXTERNAL&lt;/SPAN&gt; &lt;SPAN&gt;TABLE&lt;/SPAN&gt; &lt;SPAN&gt;`mycatalog`&lt;/SPAN&gt;&lt;SPAN&gt;.&lt;/SPAN&gt;&lt;SPAN&gt;`myExternalTable`&lt;/SPAN&gt;&lt;SPAN&gt;(&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &lt;/SPAN&gt;&lt;SPAN&gt;`ID`&lt;/SPAN&gt; &lt;SPAN&gt;STRING&lt;/SPAN&gt;&lt;SPAN&gt;,&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &lt;/SPAN&gt;&lt;SPAN&gt;`value`&lt;/SPAN&gt; &lt;SPAN&gt;STRING&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;)&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;USING&lt;/SPAN&gt;&lt;SPAN&gt; CSV &lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;OPTIONS&lt;/SPAN&gt;&lt;SPAN&gt; (&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; PATH &lt;/SPAN&gt;&lt;SPAN&gt;'s3://mybucket/filename'&lt;/SPAN&gt;&lt;SPAN&gt;, &lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; HEADER &lt;/SPAN&gt;&lt;SPAN&gt;'false'&lt;/SPAN&gt;&lt;SPAN&gt;,&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; encoding &lt;/SPAN&gt;&lt;SPAN&gt;'UTF-8'&lt;/SPAN&gt;&lt;SPAN&gt;,&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; compression &lt;/SPAN&gt;&lt;SPAN&gt;'gzip'&lt;/SPAN&gt;&lt;SPAN&gt;,&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; delimiter &lt;/SPAN&gt;&lt;SPAN&gt;','&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;);&lt;/SPAN&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;P&gt;If I try to create the table using that exact same file, in the same bucket, with the .gz extension, it works.&lt;BR /&gt;But without that extension, it gives me a weird jumbled output(on doing select * on the table) indicating that decompression is not happening properly.&lt;BR /&gt;Is there a way to create the table without adding any extensions to the S3 file path?&lt;BR /&gt;&lt;BR /&gt;Thanks for your time,&lt;BR /&gt;Aditya&lt;/P&gt;</description>
      <pubDate>Tue, 30 Apr 2024 20:26:10 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/creating-external-tables-using-gzipped-csv-file-s3-uri-without/m-p/67745#M33431</guid>
      <dc:creator>AdityaM</dc:creator>
      <dc:date>2024-04-30T20:26:10Z</dc:date>
    </item>
    <item>
      <title>Re: Creating external tables using gzipped CSV file - S3 URI without extensions</title>
      <link>https://community.databricks.com/t5/data-engineering/creating-external-tables-using-gzipped-csv-file-s3-uri-without/m-p/68449#M33682</link>
      <description>&lt;P&gt;Hey&amp;nbsp;&amp;nbsp;, thanks for your response.&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;I tried using a Serde(I think the&amp;nbsp;OpenCSVSerde&lt;SPAN&gt;&amp;nbsp;should work for me) but unfortunately im getting the below from the Unity Catalog:&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;[UC_DATASOURCE_NOT_SUPPORTED] Data source format hive is not supported in Unity Catalog. SQLSTATE: 0AKUC&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Can you please suggest any other workarounds for the above?&lt;BR /&gt;&lt;BR /&gt;Thanks&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/9"&gt;@Retired_mod&lt;/a&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 07 May 2024 12:44:12 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/creating-external-tables-using-gzipped-csv-file-s3-uri-without/m-p/68449#M33682</guid>
      <dc:creator>AdityaM</dc:creator>
      <dc:date>2024-05-07T12:44:12Z</dc:date>
    </item>
  </channel>
</rss>

