Databricks Community

JohnA · 06-21-2023

You were running out of memory using .dropDuplicates on a stream because you need to specify a streaming watermark to define a threshold at which late data can be ignored and the state no longer needs to be kept for that time frame.

JohnA · 07-31-2022

@Kaniz Fatma Hi Kaniz, can we please circle around to this? Like @Zachary Higgins , I am unsure how to set the ignoreDeletes or ignoreChanges spark.sql configuration for my Delta Live Table Pipeline defined in SQL.Thanks

Databricks Community

User Stats

User Activity

Re: How to use foreachbatch in deltalivetable or DLT?

Re: ignoreDeletes' option with Delta Live Table streaming source