Create delta files from Unity Catalog Objects
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
08-06-2024 08:57 AM
Hello,
I have tables created on unity catalog that point to the raw area , from these tables I need to create a data model (facts and dimensions) that will aggregate this data, transform certain things. Then I need to store in the Azure Datalake in delta format. What script is used to do that as I'm only used to to the reverse (create External Tables and not External sources).
Thanks in advance
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
08-08-2024 08:16 AM
Hi @ad_k, To create a data model from Unity Catalog tables and store it in Azure data lake in Delta format, use Databricks Notebooks with PySpark or SQL. The process involves reading raw data from Unity Catalog, transforming it into fact and dimension tables, and then writing the transformed data to Azure data lake in Delta format. Here's a sample PySpark script: initialize a Spark session, read the data from Unity Catalog, perform necessary transformations, and save the results to Azure Data Lake in Delta format by replacing placeholders with your Azure details.

