<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Unable to create Iceberg tables pointing to data in S3 and run queries against the tables. in Get Started Discussions</title>
    <link>https://community.databricks.com/t5/get-started-discussions/unable-to-create-iceberg-tables-pointing-to-data-in-s3-and-run/m-p/105099#M9474</link>
    <description>&lt;P&gt;It looks like Databricks making things difficult to use iceberg tables. There is no clear online documentation or steps provided to use with plain spark &amp;amp; spark sql,&amp;nbsp; and the errors thrown in the Databricks environment are very cryptic.&lt;/P&gt;&lt;P&gt;They wanted to make things difficult for the customers&lt;/P&gt;</description>
    <pubDate>Fri, 10 Jan 2025 02:33:42 GMT</pubDate>
    <dc:creator>Venkat5</dc:creator>
    <dc:date>2025-01-10T02:33:42Z</dc:date>
    <item>
      <title>Unable to create Iceberg tables pointing to data in S3 and run queries against the tables.</title>
      <link>https://community.databricks.com/t5/get-started-discussions/unable-to-create-iceberg-tables-pointing-to-data-in-s3-and-run/m-p/58224#M9472</link>
      <description>&lt;P&gt;I need to&amp;nbsp;to set up &lt;STRONG&gt;Iceberg&lt;/STRONG&gt; tables in Databricks environment, but the data resides in an S3 bucket. Then read these tables by running SQL queries.&lt;/P&gt;&lt;P&gt;Databricks environment has access to S3. This is done by&lt;/P&gt;&lt;OL&gt;&lt;LI&gt;setting up the access by mapping the &lt;STRONG&gt;Instance Profile&lt;/STRONG&gt; to the &lt;STRONG&gt;compute cluster&lt;/STRONG&gt;&lt;/LI&gt;&lt;LI&gt;AWS access key and secret key are used to connect via Spark code.&lt;/LI&gt;&lt;/OL&gt;&lt;P&gt;Note: Unity catalog has been enabled in our environment.&lt;/P&gt;&lt;P&gt;Access to S3 from databricks environment was tested by copying from S3 into DBFS. This operation was successful.&lt;/P&gt;&lt;P&gt;Tried to create Iceberg tables in running SQL commands&amp;nbsp; from SQL Editor and from Databricks notebook environment by running Python code and calling &lt;STRONG&gt;spark.sql()&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;However, we were unsuccessful in setting up Icebergs.&lt;/P&gt;&lt;P&gt;When PySpark code was run to create iceberg table by providing the location of S3 and access key and secret key, encountered an error “Data source format iceberg is not supported in Unit Catalog” See below screenshot.&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="JohnsonBDSouza_0-1705982713662.png" style="width: 400px;"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/5958i43BB6C484FFE4BF3/image-size/medium/is-moderation-mode/true?v=v2&amp;amp;px=400" role="button" title="JohnsonBDSouza_0-1705982713662.png" alt="JohnsonBDSouza_0-1705982713662.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;When the code was run against Hive metastore&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="JohnsonBDSouza_1-1705982713665.png" style="width: 400px;"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/5960i85208CF618C18D6E/image-size/medium/is-moderation-mode/true?v=v2&amp;amp;px=400" role="button" title="JohnsonBDSouza_1-1705982713665.png" alt="JohnsonBDSouza_1-1705982713665.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;I got a java exception “Iceberg is not valid Spark SQL data source”&lt;/P&gt;&lt;P&gt;Also, we tried iceberg and apache-iceber Python packages. That did not work as well.&lt;/P&gt;&lt;P&gt;Tried many things from various tech foruns including Demio and Community.databricks.com, but in vain.&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;References used:&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;&lt;A href="https://www.dremio.com/blog/getting-started-with-apache-iceberg-in-databricks/" target="_blank"&gt;https://www.dremio.com/blog/getting-started-with-apache-iceberg-in-databricks/&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&lt;A href="https://community.databricks.com/t5/data-engineering/reading-iceberg-table-present-in-s3-from-databricks-console/td-p/4640" target="_blank"&gt;https://community.databricks.com/t5/data-engineering/reading-iceberg-table-present-in-s3-from-databricks-console/td-p/4640&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;Cluster configurations:&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="JohnsonBDSouza_2-1705982713667.jpeg" style="width: 400px;"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/5959iAC7CB63A942515A9/image-size/medium/is-moderation-mode/true?v=v2&amp;amp;px=400" role="button" title="JohnsonBDSouza_2-1705982713667.jpeg" alt="JohnsonBDSouza_2-1705982713667.jpeg" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="JohnsonBDSouza_3-1705982713676.png" style="width: 400px;"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/5961i93C37E0DF6EDA326/image-size/medium/is-moderation-mode/true?v=v2&amp;amp;px=400" role="button" title="JohnsonBDSouza_3-1705982713676.png" alt="JohnsonBDSouza_3-1705982713676.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;What support I need from Databricks community&lt;/STRONG&gt;&lt;STRONG&gt;?&lt;/STRONG&gt;&lt;STRONG&gt;&amp;nbsp;&lt;/STRONG&gt;&lt;/P&gt;&lt;OL&gt;&lt;LI&gt;Detailed and specific steps to create Iceberg table and point to data in S3 via SQL or Pyspark code.&lt;/LI&gt;&lt;LI&gt;List of libraries to attach to &lt;STRONG&gt;Compute resource&lt;/STRONG&gt;, Spark variables and Environment variables to set.&lt;/LI&gt;&lt;LI&gt;Configuration required on &lt;STRONG&gt;SQL Compute resource&lt;/STRONG&gt;&lt;/LI&gt;&lt;LI&gt;List of Python libraries required and location of repository.&lt;/LI&gt;&lt;/OL&gt;</description>
      <pubDate>Tue, 23 Jan 2024 04:08:30 GMT</pubDate>
      <guid>https://community.databricks.com/t5/get-started-discussions/unable-to-create-iceberg-tables-pointing-to-data-in-s3-and-run/m-p/58224#M9472</guid>
      <dc:creator>JohnsonBDSouza</dc:creator>
      <dc:date>2024-01-23T04:08:30Z</dc:date>
    </item>
    <item>
      <title>Re: Unable to create Iceberg tables pointing to data in S3 and run queries against the tables.</title>
      <link>https://community.databricks.com/t5/get-started-discussions/unable-to-create-iceberg-tables-pointing-to-data-in-s3-and-run/m-p/58454#M9473</link>
      <description>&lt;P&gt;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/98583"&gt;@JohnsonBDSouza&lt;/a&gt;&amp;nbsp; - could you please let me know if you had a chance to review the Uniform feature that allows to create iceberg tables from the delta format.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Based on what i could understand from the above, you can create a delta table and use the below example&amp;nbsp;&lt;/P&gt;
&lt;LI-CODE lang="markup"&gt;CREATE TABLE T
TBLPROPERTIES(
  'delta.columnMapping.mode' = 'name',
  'delta.universalFormat.enabledFormats' = 'iceberg')
AS
  SELECT * FROM source_table;&lt;/LI-CODE&gt;
&lt;P&gt;Please refer to the documentation on pre-requisites, configs to use and limitations associated with using uniform &lt;A href="https://docs.databricks.com/en/delta/uniform.html" target="_blank"&gt;https://docs.databricks.com/en/delta/uniform.html&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 25 Jan 2024 20:06:17 GMT</pubDate>
      <guid>https://community.databricks.com/t5/get-started-discussions/unable-to-create-iceberg-tables-pointing-to-data-in-s3-and-run/m-p/58454#M9473</guid>
      <dc:creator>shan_chandra</dc:creator>
      <dc:date>2024-01-25T20:06:17Z</dc:date>
    </item>
    <item>
      <title>Re: Unable to create Iceberg tables pointing to data in S3 and run queries against the tables.</title>
      <link>https://community.databricks.com/t5/get-started-discussions/unable-to-create-iceberg-tables-pointing-to-data-in-s3-and-run/m-p/105099#M9474</link>
      <description>&lt;P&gt;It looks like Databricks making things difficult to use iceberg tables. There is no clear online documentation or steps provided to use with plain spark &amp;amp; spark sql,&amp;nbsp; and the errors thrown in the Databricks environment are very cryptic.&lt;/P&gt;&lt;P&gt;They wanted to make things difficult for the customers&lt;/P&gt;</description>
      <pubDate>Fri, 10 Jan 2025 02:33:42 GMT</pubDate>
      <guid>https://community.databricks.com/t5/get-started-discussions/unable-to-create-iceberg-tables-pointing-to-data-in-s3-and-run/m-p/105099#M9474</guid>
      <dc:creator>Venkat5</dc:creator>
      <dc:date>2025-01-10T02:33:42Z</dc:date>
    </item>
    <item>
      <title>Re: Unable to create Iceberg tables pointing to data in S3 and run queries against the tables.</title>
      <link>https://community.databricks.com/t5/get-started-discussions/unable-to-create-iceberg-tables-pointing-to-data-in-s3-and-run/m-p/125697#M10382</link>
      <description>&lt;P&gt;Hi @&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;A class="" href="https://community.databricks.com/t5/user/viewprofilepage/user-id/616" target="_self"&gt;&lt;SPAN class=""&gt;shan_chandra&lt;/SPAN&gt;&lt;/A&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;While I am doing a POC on databricks Iceberg Format ,The below code is working Fine for me&amp;nbsp;&lt;BR /&gt;In databricks Notebook&lt;BR /&gt;&lt;BR /&gt;%sql&lt;BR /&gt;CREATE TABLE genai_demo.default.iceberg_table2&lt;BR /&gt;TBLPROPERTIES(&lt;BR /&gt;'delta.columnMapping.mode' = 'name',&lt;BR /&gt;'delta.universalFormat.enabledFormats' = 'iceberg',&lt;BR /&gt;'delta.enableIcebergCompatV2' = 'true')&lt;BR /&gt;AS&lt;BR /&gt;SELECT * FROM catalog.db.table;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;Details which I used are :&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;Databricks Cluster of 16.4LT&lt;/LI&gt;&lt;LI&gt;Iceberg jar which need to compactible&amp;nbsp;&lt;BR /&gt;org.apache.iceberg:iceberg-spark-runtime-3.5_2.12:1.4.2&amp;nbsp;&lt;/LI&gt;&lt;LI&gt;The catalog need to be MANAGED type This is Mandatory (Both read and write need to MANAGED Type )&amp;nbsp;&lt;/LI&gt;&lt;/UL&gt;</description>
      <pubDate>Fri, 18 Jul 2025 09:55:59 GMT</pubDate>
      <guid>https://community.databricks.com/t5/get-started-discussions/unable-to-create-iceberg-tables-pointing-to-data-in-s3-and-run/m-p/125697#M10382</guid>
      <dc:creator>PujithaKarnati</dc:creator>
      <dc:date>2025-07-18T09:55:59Z</dc:date>
    </item>
    <item>
      <title>Re: Unable to create Iceberg tables pointing to data in S3 and run queries against the tables.</title>
      <link>https://community.databricks.com/t5/get-started-discussions/unable-to-create-iceberg-tables-pointing-to-data-in-s3-and-run/m-p/125760#M10387</link>
      <description>&lt;P&gt;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/98583"&gt;@JohnsonBDSouza&lt;/a&gt;&amp;nbsp;,&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/157693"&gt;@PujithaKarnati&lt;/a&gt;&amp;nbsp;,&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/142126"&gt;@Venkat5&lt;/a&gt;&amp;nbsp;,&lt;/P&gt;&lt;P&gt;There are 3 concepts to use Iceberg format in databricks based on recent updates in DAIS 2025.&lt;/P&gt;&lt;P&gt;1) Managed Iceberg tables 2) Foreign Iceberg tables 3) Enabling Iceberg reads on delta tables. Please refer below links for detailed explanation and reference slides also.&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;&lt;A href="https://www.databricks.com/dataaisummit/session/databricks-apache-icebergtm-managed-and-foreign-tables-unity-catalog" target="_blank"&gt;https://www.databricks.com/dataaisummit/session/databricks-apache-icebergtm-managed-and-foreign-tables-unity-catalog&lt;/A&gt;&lt;/LI&gt;&lt;LI&gt;&lt;A href="https://docs.databricks.com/aws/en/iceberg/" target="_blank"&gt;https://docs.databricks.com/aws/en/iceberg/&lt;/A&gt;&lt;/LI&gt;&lt;LI&gt;&lt;A href="https://www.databricks.com/blog/announcing-full-apache-iceberg-support-databricks" target="_blank"&gt;https://www.databricks.com/blog/announcing-full-apache-iceberg-support-databricks&lt;/A&gt;&lt;/LI&gt;&lt;LI&gt;&lt;A href="https://community.databricks.com/t5/technical-blog/read-delta-tables-with-snowflake-via-unity-catalog/ba-p/115877" target="_blank"&gt;https://community.databricks.com/t5/technical-blog/read-delta-tables-with-snowflake-via-unity-catalog/ba-p/115877&lt;/A&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;Thanks.&lt;/P&gt;</description>
      <pubDate>Sat, 19 Jul 2025 17:32:09 GMT</pubDate>
      <guid>https://community.databricks.com/t5/get-started-discussions/unable-to-create-iceberg-tables-pointing-to-data-in-s3-and-run/m-p/125760#M10387</guid>
      <dc:creator>sridharplv</dc:creator>
      <dc:date>2025-07-19T17:32:09Z</dc:date>
    </item>
  </channel>
</rss>

