Re: Migrating data from synapse to databricks

Rishabh-Pandey · ‎08-17-2024

Use Azure Data Factory (ADF):

Configure a pipeline in ADF to copy data from Synapse SQL to Azure Data Lake. Set up an ADF Copy Activity to handle this data transfer.
Source Dataset: Azure Synapse SQL table.
Sink Dataset: Azure Data Lake Storage or Azure Blob Storage.
Data Movement Activity: Use the Copy Data activity in ADF.

Create Delta Tables in Databricks:
- Use Delta Lake for storing and managing data in Databricks. Delta Lake provides ACID transactions and supports efficient queries.
Automate Data Ingestion:
- Set up Databricks jobs or Delta Live Tables to automate the ingestion of data from Azure Data Lake into Delta tables.
Update Data Pipeline
1. Adjust ADF Pipelines:
  - Update your Azure Data Factory pipelines to load new data directly into Databricks instead of Synapse.
  - Source Dataset: Azure Data Lake Storage.
  - Sink Dataset: Databricks Delta table.

Rishabh Pandey