cancel
Showing results forย 
Search instead forย 
Did you mean:ย 
Data Engineering
Join discussions on data engineering best practices, architectures, and optimization strategies within the Databricks Community. Exchange insights and solutions with fellow data engineers.
cancel
Showing results forย 
Search instead forย 
Did you mean:ย 

How to set file size for MERGE

Brad
Contributor II

Hi team,

I use MERGE to merge source to target table. Source is incremental reading with checkpoint on delta table. Target is delta table without any partition. If the table is empty, with spark.databricks.delta.optimizeWrite.enabled it can create files in around 128M per file. However next time run it shows meany small files. Is there a way to control the file size after MERGE?

Thanks

1 REPLY 1

filipniziol
Contributor III

Hi @Brad ,

There are a couple of considerations here, the main being your runtime version and also whether you are using unit catalog.

Check this document:
https://docs.databricks.com/en/delta/tune-file-size.html


Connect with Databricks Users in Your Area

Join a Regional User Group to connect with local Databricks users. Events will be happening in your city, and you wonโ€™t want to miss the chance to attend and share knowledge.

If there isnโ€™t a group near you, start one and help create a community that brings people together.

Request a New Group