Hello, I have a question regarding the full refresh of a DLT pipeline, where the data source is an external table.
When running the pipeline without a full refresh, then the streaming will pull data which are currently present in the external source table, meaning that if the table has N rows, then these N rows are going to be processed by the pipeline.
However, when running a full refresh, it seems that older data will be reprocessed. Meaning, it will pull all data from the source table, considering the retention period for the source table but ignoring any DELETE statement on the source table.
Is my understanding correct ?
Thanks in advance