cancel
Showing results forย 
Search instead forย 
Did you mean:ย 
Machine Learning
Dive into the world of machine learning on the Databricks platform. Explore discussions on algorithms, model training, deployment, and more. Connect with ML enthusiasts and experts.
cancel
Showing results forย 
Search instead forย 
Did you mean:ย 

Which file size is better 1 GB file size in target or 128 MB or lesser than that

User16826994223
Honored Contributor III

Which file size is better 1 GB file size in target or 128 MB or lesser than that , I am interested in knowing concept too.

1 REPLY 1

sajith_appukutt
Honored Contributor II

If data is getting appended primarily to the delta table and read ratio is higher than writes ratio - larger file sizes ( 1GB) would be ideal.

However, if your delta table undergoes frequent upserts/merges, having smaller files than the default 1GB can improve MERGE performance as only smaller amounts of data would have to be rewritten.

Check out file size auto tuning for MERGE as well

Connect with Databricks Users in Your Area

Join a Regional User Group to connect with local Databricks users. Events will be happening in your city, and you wonโ€™t want to miss the chance to attend and share knowledge.

If there isnโ€™t a group near you, start one and help create a community that brings people together.

Request a New Group