Join discussions on data engineering best practices, architectures, and optimization strategies within the Databricks Community. Exchange insights and solutions with fellow data engineers.
the underlying data format is parquet. parquet is immutable so if one of the parquet files has to be changed, a new version of that file is created (and the old one is kept for a while). The transaction log keeps track on what file is the current one.
the underlying data format is parquet. parquet is immutable so if one of the parquet files has to be changed, a new version of that file is created (and the old one is kept for a while). The transaction log keeps track on what file is the current one.
Just a friendly follow-up. Did any of the responses help you to resolve your question? if it did, please mark it as best. Otherwise, please let us know if you still need help.
Join Us as a Local Community Builder!
Passionate about hosting events and connecting people? Help us grow a vibrant local communityโsign up today to get started!