SteveOstrowski
Databricks Employee
Databricks Employee

Hi zahid6793,

Thanks for engaging with the thread. To address your question about the void datatype persisting in the Unity Catalog UI after a full refresh: this can happen when the target table metadata in Unity Catalog has not been fully rebuilt. A full refresh of the streaming table in the pipeline clears the checkpoint and reprocesses data, but if the previous table definition still carries stale column metadata, you may see void as a residual artifact.

A few things to verify after running a full refresh:

  1. Run DESCRIBE TABLE EXTENDED <catalog.schema.table> to confirm the current schema matches expectations. If the dropped column still appears as void, the metadata has not fully updated.
  2. If the stale column persists, dropping and recreating the target table (or letting the pipeline recreate it via full refresh with pipelines.reset.allowed = true) will force a clean metadata rebuild.
  3. Check that the source table's Delta log no longer contains the non-additive schema change in the range the stream needs to replay. The error is a read-side failure triggered by encountering that transaction in the log history, not by the current schema state.

The key distinction is that the streaming engine reads the Delta log sequentially from the checkpoint offset. Even if the current schema looks correct, if the log still contains the drop/rename transaction in the replay range, the stream will fail. A full refresh resets the checkpoint so it starts from the latest state, which should bypass the problematic log entry.

Sources: