<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Potential Challenges of Using Iceberg Format (Databricks + Iceberg) in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/potential-challenges-of-using-iceberg-format-databricks-iceberg/m-p/119445#M45880</link>
    <description>&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Hi Team,&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;What are the potential challenges of using Iceberg format instead of Delta for saving data in databricks?&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;Regards,&lt;/P&gt;&lt;P&gt;Phani&lt;/P&gt;</description>
    <pubDate>Fri, 16 May 2025 10:45:52 GMT</pubDate>
    <dc:creator>Phani1</dc:creator>
    <dc:date>2025-05-16T10:45:52Z</dc:date>
    <item>
      <title>Potential Challenges of Using Iceberg Format (Databricks + Iceberg)</title>
      <link>https://community.databricks.com/t5/data-engineering/potential-challenges-of-using-iceberg-format-databricks-iceberg/m-p/119445#M45880</link>
      <description>&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Hi Team,&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;What are the potential challenges of using Iceberg format instead of Delta for saving data in databricks?&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;Regards,&lt;/P&gt;&lt;P&gt;Phani&lt;/P&gt;</description>
      <pubDate>Fri, 16 May 2025 10:45:52 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/potential-challenges-of-using-iceberg-format-databricks-iceberg/m-p/119445#M45880</guid>
      <dc:creator>Phani1</dc:creator>
      <dc:date>2025-05-16T10:45:52Z</dc:date>
    </item>
    <item>
      <title>Re: Potential Challenges of Using Iceberg Format (Databricks + Iceberg)</title>
      <link>https://community.databricks.com/t5/data-engineering/potential-challenges-of-using-iceberg-format-databricks-iceberg/m-p/119487#M45889</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/36892"&gt;@Phani1&lt;/a&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;From my understanding, Delta Lake tends to perform better when it comes to frequent data modifications and operations. It also integrates more seamlessly with various Databricks features and tools compared to Iceberg.&lt;/P&gt;</description>
      <pubDate>Fri, 16 May 2025 15:06:11 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/potential-challenges-of-using-iceberg-format-databricks-iceberg/m-p/119487#M45889</guid>
      <dc:creator>SP_6721</dc:creator>
      <dc:date>2025-05-16T15:06:11Z</dc:date>
    </item>
    <item>
      <title>Re: Potential Challenges of Using Iceberg Format (Databricks + Iceberg)</title>
      <link>https://community.databricks.com/t5/data-engineering/potential-challenges-of-using-iceberg-format-databricks-iceberg/m-p/119495#M45892</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/36892"&gt;@Phani1&lt;/a&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Using Apache Iceberg instead of Delta Lake for saving data in Databricks can unlock cross-platform compatibility but comes with several potential challenges,&lt;BR /&gt;especially within the Databricks ecosystem which is natively optimized for Delta Lake.&lt;/P&gt;&lt;P&gt;Key Challenges When Using Iceberg Instead of Delta in Databricks&lt;BR /&gt;&lt;STRONG&gt;1. Feature Parity and Platform Support&lt;/STRONG&gt;&lt;BR /&gt;Delta Lake is first-class in Databricks with full support for advanced features like:&lt;BR /&gt;- Time travel, schema evolution, Z-ordering&lt;BR /&gt;- Change data capture (CDC)&lt;BR /&gt;- Unity Catalog lineage, data masking, constraints&lt;BR /&gt;Iceberg support is newer and more limited in Databricks:&lt;BR /&gt;- Not all Delta-native features are supported yet.&lt;BR /&gt;- CDC and Z-order are not currently available for Iceberg.&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;2. Performance Optimization Limitations&lt;/STRONG&gt;&lt;BR /&gt;- Delta benefits from Databricks-specific optimizations (e.g., Photon, dynamic file pruning, OPTIMIZE/ZORDER).&lt;BR /&gt;- Iceberg tables may see slower query performance due to:&lt;BR /&gt;Lack of automatic file compaction&lt;BR /&gt;Weaker runtime query optimizations on Databricks&lt;BR /&gt;Inability to leverage features like Delta caching&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;3. Limited Write &amp;amp; Maintenance Commands&lt;/STRONG&gt;&lt;BR /&gt;- Supports commands like:&lt;BR /&gt;OPTIMIZE, VACUUM, MERGE INTO, DELETE&lt;BR /&gt;- Iceberg in Databricks only partially supports these, and some may behave differently or&lt;BR /&gt;be missing altogether depending on the workspace version and configuration.&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;4. Unity Catalog Constraints&lt;/STRONG&gt;&lt;BR /&gt;-As of mid-2025:&lt;BR /&gt;1. Iceberg tables must reside in Unity Catalog to be fully supported.&lt;BR /&gt;2. There may be limited support for fine-grained governance, lineage, or row-level security vs Delta tables.&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;5. Tooling and Compatibility&lt;/STRONG&gt;&lt;BR /&gt;- While Iceberg is designed for multi-engine interoperability (e.g., Trino, Snowflake, Flink), in Databricks pipelines,&lt;BR /&gt;many tools (e.g., MLflow, Auto Loader, Streaming) still assume Delta tables.&lt;BR /&gt;- Using Iceberg can break expectations in Delta-native workflows, such as:&lt;BR /&gt;- Streaming reads/writes&lt;BR /&gt;- ML feature store integrations&lt;BR /&gt;- Delta Sharing&lt;BR /&gt;&lt;STRONG&gt;6. Fewer Validations &amp;amp; API Support&lt;/STRONG&gt;&lt;BR /&gt;- Spark APIs in Databricks are richer and more stable for Delta.&lt;BR /&gt;- Iceberg operations may not support full schema enforcement, constraints, or write audit capabilities.&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;When to Consider Iceberg&lt;/STRONG&gt;&lt;BR /&gt;- Use Apache Iceberg only if:&lt;BR /&gt;- You need interoperability across multiple engines (e.g., Trino, Presto, Snowflake).&lt;BR /&gt;- You have an enterprise data lake architecture that mandates open table formats beyond Spark.&lt;BR /&gt;- You're ready to invest in managing performance manually (compaction, snapshot cleanup, etc.).&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 16 May 2025 15:25:30 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/potential-challenges-of-using-iceberg-format-databricks-iceberg/m-p/119495#M45892</guid>
      <dc:creator>lingareddy_Alva</dc:creator>
      <dc:date>2025-05-16T15:25:30Z</dc:date>
    </item>
    <item>
      <title>Re: Potential Challenges of Using Iceberg Format (Databricks + Iceberg)</title>
      <link>https://community.databricks.com/t5/data-engineering/potential-challenges-of-using-iceberg-format-databricks-iceberg/m-p/124330#M47155</link>
      <description>&lt;P&gt;Late to the party but also noteworthy to mention you&amp;nbsp;&lt;EM&gt;cannot&lt;/EM&gt; apply masking policies on iceberg tables, atleast in what I found testing&lt;/P&gt;</description>
      <pubDate>Mon, 07 Jul 2025 14:10:40 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/potential-challenges-of-using-iceberg-format-databricks-iceberg/m-p/124330#M47155</guid>
      <dc:creator>andreapeterson</dc:creator>
      <dc:date>2025-07-07T14:10:40Z</dc:date>
    </item>
    <item>
      <title>Re: Potential Challenges of Using Iceberg Format (Databricks + Iceberg)</title>
      <link>https://community.databricks.com/t5/data-engineering/potential-challenges-of-using-iceberg-format-databricks-iceberg/m-p/124353#M47163</link>
      <description>&lt;P&gt;Hi &lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/36892"&gt;@Phani1&lt;/a&gt;&amp;nbsp;, Please find the below link which details out maintaining icerberg metadata along with delta metadata.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;A href="https://community.databricks.com/t5/technical-blog/read-delta-tables-with-snowflake-via-unity-catalog/ba-p/115877" target="_blank"&gt;https://community.databricks.com/t5/technical-blog/read-delta-tables-with-snowflake-via-unity-catalog/ba-p/115877&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 07 Jul 2025 19:06:16 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/potential-challenges-of-using-iceberg-format-databricks-iceberg/m-p/124353#M47163</guid>
      <dc:creator>sridharplv</dc:creator>
      <dc:date>2025-07-07T19:06:16Z</dc:date>
    </item>
  </channel>
</rss>

