cancel
Showing results for 
Search instead for 
Did you mean: 
Data Engineering
cancel
Showing results for 
Search instead for 
Did you mean: 

Prevent Duplicate Entries to enter to delta lake Storage

User16826994223
Honored Contributor III

I have a data frame and I write this data frame to adls table, next day I get an updated data frame which has some records from the past also and i want to update the delta table without creating duplicate

1 REPLY 1

User16826994223
Honored Contributor III

This is a task for Merge command - you define condition for merge (your unique column) and then actions.

MERGE INTO target
USING  src
ON target.column = source.column
WHEN MATCHED THEN
  UPDATE SET *
WHEN NOT MATCHED
  THEN INSERT *

could be your dataframe registered as temporary view):

Welcome to Databricks Community: Lets learn, network and celebrate together

Join our fast-growing data practitioner and expert community of 80K+ members, ready to discover, help and collaborate together while making meaningful connections. 

Click here to register and join today! 

Engage in exciting technical discussions, join a group with your peers and meet our Featured Members.