cancel
Showing results for 
Search instead for 
Did you mean: 
Data Engineering
Join discussions on data engineering best practices, architectures, and optimization strategies within the Databricks Community. Exchange insights and solutions with fellow data engineers.
cancel
Showing results for 
Search instead for 
Did you mean: 

Delta table partition folder names is getting changed

SakthiGanesh
New Contributor II

I am facing an issue where the expected date partition folder should be named in format like "campaign_created_date=2024-01-17", but instead it is writing as "ad", "8B" looks like a random folder names.

Usually it will be like below:
SakthiGanesh_0-1751013736357.png

Now it changed like below:

SakthiGanesh_1-1751013840570.png

When I searched on net I got some info like if MinWorkerVersion is greater than 2 will make this variation in folder name. But it is not clear for me. 

Can anyone let me know why this is happening suddenly? I would like to know how the MinWorkerVersion changes will affect the folder names and its Pros and Cons. What is the reason behind it.



 

2 REPLIES 2

UmaMahesh1
Honored Contributor III

Can you attach the delta logs or check the partition column using desc extended once.

To me it looks like the data is partitioned on a diff column.

Uma Mahesh D

Krishnamatta
Contributor

Hi Satish,

This is due to the column mapping enabled on the table

From Databricks Docs:

When you enable column mapping for a Delta table, random prefixes replace column names in partition directories for Hive-style partitioning. See Rename and drop columns with Delta Lake column mapping.

Reference:

https://docs.databricks.com/gcp/en/tables/partitions#do-delta-lake-and-parquet-share-partitioning-st...

 

Join Us as a Local Community Builder!

Passionate about hosting events and connecting people? Help us grow a vibrant local community—sign up today to get started!

Sign Up Now