Data Engineering

Forum Posts

Sorted by:

by pskchai • New Contributor

06-16-2023 12:42:20 AM

2241 Views
2 replies
0 kudos

Resolved! Using DLT with a non-streaming large table

We have a source table that receives daily append operations, but the rows created within the last 30 days in this table can be updated or deleted. Thus, the source table is not exactly a streaming source.Our processing workflow involves performing "...

Data Engineering

2241 Views
2 replies
0 kudos

06-16-2023 12:42:20 AM

View Replies

Latest Reply

Anonymous
Not applicable

06-17-2023 1:48:46 AM

0 kudos

Hi @Pongsakorn Chairatanakul Hope everything is going great.Just wanted to check in if you were able to resolve your issue. If yes, would you be happy to mark an answer as best so that other members can find the solution more quickly? If not, please...

0 kudos

06-17-2023 1:48:46 AM

1 More Replies

by gilo12 • New Contributor III

05-12-2023 1:42:22 PM

10272 Views
3 replies
2 kudos

merge into deletes from SOURCE

I am using the following query to make an upsert:MERGE INTO my_target_table AS target USING (SELECT MAX(__my_timestamp) AS checkpoint FROM my_source_table) AS source ON target.name = 'some_name' AND target.address = 'some_address' WHEN MATCHED AN...

Data Engineering

10272 Views
3 replies
2 kudos

05-12-2023 1:42:22 PM

View Replies

Latest Reply

gilo12
New Contributor III

05-12-2023 6:46:46 PM

2 kudos

I was using a view for my_source_table, once I changed that to be a table the issue stoped.That unblocked me, but I think Databricks has a bug with using MERGE INTO from a VIEW

2 kudos

05-12-2023 6:46:46 PM

2 More Replies

by gg_047320_gg_94 • New Contributor II

05-27-2023 9:09:48 PM

8385 Views
1 replies
1 kudos

DLT Spark readstream fails on the source table which is overwritten

I am reading the source table which gets updated every day. It is usually append/merge with updates and is occasionally overwritten for other reasons. df = spark.readStream.schema(schema).format("delta").option("ignoreChanges", True).option('starting...

Data Engineering

8385 Views
1 replies
1 kudos

05-27-2023 9:09:48 PM

View Replies

Latest Reply

Debayan
Databricks Employee

06-05-2023 12:31:43 AM

1 kudos

Hi, Could you please confirm DLT and DBR versions? Also please tag @Debayan with your next response which will notify me, Thank you!

1 kudos

06-05-2023 12:31:43 AM

by weldermartins • Honored Contributor

08-19-2022 4:35:10 AM

9087 Views
9 replies
13 kudos

Resolved! Delta table upsert - databricks community

Hello guys,I'm trying to use upsert via delta lake following the documentation, but the command doesn't update or insert newlines.scenario: my source table is separated in bronze layer and updates or inserts are in silver layer.from delta.tables impo...

Data Engineering

9087 Views
9 replies
13 kudos

08-19-2022 4:35:10 AM

View Replies

Latest Reply

weldermartins
Honored Contributor

08-22-2022 11:55:40 AM

13 kudos

I managed to find the solution. In insert and update I was setting the target.tanks @Werner Stinckens !delta_df = DeltaTable.forPath(spark, 'dbfs:/mnt/silver/vendas/') delta_df.alias('target').m...

13 kudos

08-22-2022 11:55:40 AM

8 More Replies

by 577391 • New Contributor II

07-20-2022 4:58:03 PM

2699 Views
2 replies
0 kudos

Resolved! How do I merge two tables and track changes to missing rows as well as new rows

In my scenario, the new data coming in are the current, valid records. Any records that are not in the new data should be labeled as 'Gone", any matching records should be labeled with "Updated". And finally, any new records should be added.So in sum...

Data Engineering

2699 Views
2 replies
0 kudos

07-20-2022 4:58:03 PM

View Replies

Latest Reply

-werners-
Esteemed Contributor III

07-25-2022 1:04:59 AM

0 kudos

Detection deletions does not work out of the box.The merge statement will evaluate the incoming data against the existing data. It will not check the existing data against the incoming data.To mark deletions, you will have to specifically update tho...

0 kudos

07-25-2022 1:04:59 AM

1 More Replies

by _Orc • New Contributor

03-02-2022 12:19:52 PM

3955 Views
2 replies
1 kudos

Resolved! Checkpoint is getting created even the though the microbatch append has failed

Use caseRead data from source table using structured spark streaming(Round the clock).Apply transformation logic etc etc and finally merge the dataframe in the target table.If there is any failure during transformation or merge ,databricks job should...

Data Engineering

3955 Views
2 replies
1 kudos

03-02-2022 12:19:52 PM

View Replies

Latest Reply

Anonymous
Not applicable

04-12-2022 9:34:32 AM

1 kudos

Hi @Om Singh Hope you are doing well. Just wanted to check in and see if you were able to find a solution to your question?Cheers

1 kudos

04-12-2022 9:34:32 AM

1 More Replies

Databricks Community

Resolved! Using DLT with a non-streaming large table

merge into deletes from SOURCE

DLT Spark readstream fails on the source table which is overwritten

Resolved! Delta table upsert - databricks community

Resolved! How do I merge two tables and track changes to missing rows as well as new rows

Resolved! Checkpoint is getting created even the though the microbatch append has failed