cancel
Showing results for 
Search instead for 
Did you mean: 
Data Engineering
Join discussions on data engineering best practices, architectures, and optimization strategies within the Databricks Community. Exchange insights and solutions with fellow data engineers.
cancel
Showing results for 
Search instead for 
Did you mean: 

Databricks upon inserting delta table data inserts into folders in Dev

NandaKishoreI
New Contributor II

We have a Delta Table in Databricks. When we are inserting data into the Delta Table, in the storage account, it creates folders like: 05, 0H, 0F, 0O, 1T,1W, etc... and adds the parquet files there.

We have not defined any partitions. We are inserting data using following code:

df.write\
                    .mode('append')\
                    .option("mergeSchema", "true")\
                    .option("ignoreLeadingWhiteSpace","false") \
                    .option("ignoreTrailingWhiteSpace","false") \
                    .format('delta').save(prepped_path)
 
The Table Properties are:
delta.columnMapping.mode=name
delta.minReaderVersion=2
delta.minWriterVersion=6
delta.columnMapping.maxColumnId=18
1 ACCEPTED SOLUTION

Accepted Solutions

Kaniz_Fatma
Community Manager
Community Manager

Hi @NandaKishoreI

  • The folder structure you’re observing is typical for Delta tables. Each folder corresponds to a specific transaction log or checkpoint. These folders contain the Parquet files that represent the data at different points in time.
  • The alphanumeric folder names are based on the hexadecimal representation of the transaction log version or checkpoint.
  • Even without explicitly defining partitions, Delta Tables automatically organize data into these folders to support efficient query execution and time travel features.

View solution in original post

2 REPLIES 2

Kaniz_Fatma
Community Manager
Community Manager

Hi @NandaKishoreI

  • The folder structure you’re observing is typical for Delta tables. Each folder corresponds to a specific transaction log or checkpoint. These folders contain the Parquet files that represent the data at different points in time.
  • The alphanumeric folder names are based on the hexadecimal representation of the transaction log version or checkpoint.
  • Even without explicitly defining partitions, Delta Tables automatically organize data into these folders to support efficient query execution and time travel features.

Rishabh_Tiwari
Community Manager
Community Manager

Hi @NandaKishoreI ,

Thank you for reaching out to our community! We're here to help you. 

To ensure we provide you with the best support, could you please take a moment to review the response and choose the one that best answers your question? Your feedback not only helps us assist you better but also benefits other community members who may have similar questions in the future.

If you found the answer helpful, consider giving it a kudo. If the response fully addresses your question, please mark it as the accepted solution. This will help us close the thread and ensure your question is resolved.

We appreciate your participation and are here to assist you further if you need it!

Thanks,

Rishabh

Join 100K+ Data Experts: Register Now & Grow with Us!

Excited to expand your horizons with us? Click here to Register and begin your journey to success!

Already a member? Login and join your local regional user group! If there isn’t one near you, fill out this form and we’ll create one for you to join!