12-05-2023 02:40 PM
Hello all,
We are building a data warehouse on Unity Catalog and we use the SHALLOW CLONE command to allow folks to spin up their own dev environments by light copying the prod tables.
We also started using Liquid Clustering on our feature tables, though we are running into an error when trying to shallow clone these tables. It seems CLONE command is not allowed for tables that have been liquid clustered.
We'd like to know if this was a temporary restriction / is it in the roadmap at some point in the future to allow a cloning operations on tables that have been liquid clustered ?
Exception :
UnsupportedOperationException: CLONE is not supported for Delta table with Liquid clustering.
Runtime : 13.3
Thanks !
Seydou
12-07-2023 03:15 PM
Thanks Kaniz for your reply. I was able to get it make it work using runtime 14.0.
Regards,
12-05-2023 09:58 PM
Hi @seydouHR, The SHALLOW CLONE command in Unity Catalog allows you to create lightweight copies of production tables for development purposes. These clones share metadata with the original tables but do not duplicate the underlying data files, which helps save storage space. It’s a handy feature for creating dev environments.
Now, let’s address the issue you’re encountering with Liquid Clustering. Liquid clustering is a powerful feature available in Delta Lake that simplifies data layout decisions and optimizes query performance. It allows you to redefine clustering keys without rewriting existing data, which is beneficial for evolving analytic needs over time.
However, there are some important considerations:
Liquid Clustering Benefits:
Enabling Liquid Clustering:
Important Notes:
Limitations:
In summary, while liquid clustering offers significant benefits, it currently restricts the use of the CLONE command. Keep an eye on updates from Databricks for any changes in this regard! 🚀🔍
12-07-2023 03:15 PM
Thanks Kaniz for your reply. I was able to get it make it work using runtime 14.0.
Regards,
03-02-2024 07:48 PM
03-03-2024 05:46 AM
Hi @Wolfoflag ,
Yes, whithin Databricks I was able to shallow clone using runtime 14.0.
Also, outside Databricks, it worked when using the databricks sdk with a SQL Warehouse of version 2023.50 or above.
Example with the sdk:
Join our fast-growing data practitioner and expert community of 80K+ members, ready to discover, help and collaborate together while making meaningful connections.
Click here to register and join today!
Engage in exciting technical discussions, join a group with your peers and meet our Featured Members.