cancel
Showing results forย 
Search instead forย 
Did you mean:ย 
Warehousing & Analytics
Engage in discussions on data warehousing, analytics, and BI solutions within the Databricks Community. Share insights, tips, and best practices for leveraging data for informed decision-making.
cancel
Showing results forย 
Search instead forย 
Did you mean:ย 

Setting up a snowflake catalog via spark config next to unity catalog

Laurens
New Contributor II

Im trying to set up a connection to Iceberg on S3 via Snowflake as described https://medium.com/snowflake/how-to-integrate-databricks-with-snowflake-managed-iceberg-tables-7a889... and https://docs.snowflake.com/en/user-guide/tables-iceberg-catalog

On a cluster with Unity Catalog disabled this works fine and I can read from the tables without connecting to a snowflake warehouse. When Unity Catalog is enabled however, the `snowflake_catalog` is nowhere to be found.

Is there a way to setup a catalog via Spark config like described above while using Unity Catalog?

Thanks in advance!

3 REPLIES 3

Kaniz_Fatma
Community Manager
Community Manager

Hi @Laurens, Integrating Iceberg tables on S3 with Snowflake can be done effectively, even when using the Unity Catalog.

Letโ€™s break down the steps to achieve this:

  1. Configure an External Volume to Amazon S3:

    • Set up an external volume that points to the location of your Iceberg tables on Amazon S3. This volume will grant Snowflake read-only access to the Iceberg data.
  2. Catalog Integration with AWS Glue:

    • Create a catalog integration with AWS Glue. Ensure that the integration has the necessary permissions to access the database containing your Iceberg tables. This step allows Snowflake to seamlessly interact with the Iceberg metadata.
  3. Create Iceberg Tables:

    • Now you can create Iceberg tables within Snowflake, using AWS Glue as the catalog. These tables will be linked to the Iceberg datasets stored in S3.

By following these steps, youโ€™ll be able to query your Iceberg datasets from Snowflake, even with Unity Catalog enabled. The integration ensures that you can work with Iceberg tables efficiently while benefiting from the power of Snowflakeโ€™s data platform. ๐ŸŒŸ

For more detailed information, you can refer to the resources you mentioned:

Feel free to explore these links for deeper insights into the process. If you have any further questions, donโ€™t hesitate to ask! ๐Ÿ˜Š

 

Laurens
New Contributor II

Hi Kaniz,

Thanks for your elaborate reply! The iceberg data that we have to interact with uses Snowflake as catalog, not glue. Can we also use a catalog integration with Snowflake?

Thanks!

Laurens
New Contributor II

Hi @Kaniz_Fatma ,

We've been working on setting up Glue as catalog, which is working fine so far. However, Glue takes place of the hive_metastore, which appears to be a legacy way of setting this up. Is the way proposed here the recommended way to set it up, or is there a more "unity catalog" way of setting it up? https://docs.databricks.com/en/archive/external-metastores/aws-glue-metastore.html

Thanks in advance,

Laurens

Connect with Databricks Users in Your Area

Join a Regional User Group to connect with local Databricks users. Events will be happening in your city, and you wonโ€™t want to miss the chance to attend and share knowledge.

If there isnโ€™t a group near you, start one and help create a community that brings people together.

Request a New Group