cancel
Showing results for 
Search instead for 
Did you mean: 
Data Engineering
Join discussions on data engineering best practices, architectures, and optimization strategies within the Databricks Community. Exchange insights and solutions with fellow data engineers.
cancel
Showing results for 
Search instead for 
Did you mean: 

Hi Experts : I am new to Databricks please help me on below. Question : How is delta table stored in DBFS ?

BasavarajAngadi
Contributor

If I create delta table the table is stored in parque format in DBFS location ?

and please share how the parque files supports schema evolution if i do DML operation.

As per my understanding : we read data from data lake first in data frame and try to write the dataframe to delta tables and these delta tables when created as tables are stored in parque format ??

1 ACCEPTED SOLUTION

Accepted Solutions

-werners-
Esteemed Contributor III

delta lake is parquet on steroids. The actual data is stored in parquet files, but you get a bunch of extra functionalities (time traveling, ACID, optimized writes, MERGE etc).

check this page for lots of info.

Delta lake does support schema evolution but it has it´s limitations.

About the location: they will indeed be stored in dbfs, but dbfs is a 'virtual' layer on top of your actual storage. So if you mount your data lake into dbfs, dbfs will contain your data (it will link it, not copy it).

View solution in original post

1 REPLY 1

-werners-
Esteemed Contributor III

delta lake is parquet on steroids. The actual data is stored in parquet files, but you get a bunch of extra functionalities (time traveling, ACID, optimized writes, MERGE etc).

check this page for lots of info.

Delta lake does support schema evolution but it has it´s limitations.

About the location: they will indeed be stored in dbfs, but dbfs is a 'virtual' layer on top of your actual storage. So if you mount your data lake into dbfs, dbfs will contain your data (it will link it, not copy it).

Connect with Databricks Users in Your Area

Join a Regional User Group to connect with local Databricks users. Events will be happening in your city, and you won’t want to miss the chance to attend and share knowledge.

If there isn’t a group near you, start one and help create a community that brings people together.

Request a New Group