Join discussions on data engineering best practices, architectures, and optimization strategies within the Databricks Community. Exchange insights and solutions with fellow data engineers.
the underlying data format is parquet. parquet is immutable so if one of the parquet files has to be changed, a new version of that file is created (and the old one is kept for a while). The transaction log keeps track on what file is the current one.
the underlying data format is parquet. parquet is immutable so if one of the parquet files has to be changed, a new version of that file is created (and the old one is kept for a while). The transaction log keeps track on what file is the current one.
Just a friendly follow-up. Did any of the responses help you to resolve your question? if it did, please mark it as best. Otherwise, please let us know if you still need help.
Connect with Databricks Users in Your Area
Join a Regional User Group to connect with local Databricks users. Events will be happening in your city, and you wonโt want to miss the chance to attend and share knowledge.
If there isnโt a group near you, start one and help create a community that brings people together.