Rishabh-Pandey
Databricks MVP

Use Azure Data Factory (ADF):

  • Configure a pipeline in ADF to copy data from Synapse SQL to Azure Data Lake. Set up an ADF Copy Activity to handle this data transfer.

  • Source Dataset: Azure Synapse SQL table.

  • Sink Dataset: Azure Data Lake Storage or Azure Blob Storage.

  • Data Movement Activity: Use the Copy Data activity in ADF.

 

Load Data into Databricks

  1. Create Delta Tables in Databricks:

    • Use Delta Lake for storing and managing data in Databricks. Delta Lake provides ACID transactions and supports efficient queries.
  2. Automate Data Ingestion:

    • Set up Databricks jobs or Delta Live Tables to automate the ingestion of data from Azure Data Lake into Delta tables.
  3. Update Data Pipeline

    1. Adjust ADF Pipelines:

      • Update your Azure Data Factory pipelines to load new data directly into Databricks instead of Synapse.

      • Source Dataset: Azure Data Lake Storage.

      • Sink Dataset: Databricks Delta table.

Rishabh Pandey