cancel
Showing results forย 
Search instead forย 
Did you mean:ย 
Data Engineering
Join discussions on data engineering best practices, architectures, and optimization strategies within the Databricks Community. Exchange insights and solutions with fellow data engineers.
cancel
Showing results forย 
Search instead forย 
Did you mean:ย 

When does delta lake actually compute a table?

zeta_load
New Contributor II

Maybe I'm completely wrong, but from my understanding delta lake only calculates a table at certain points, for instance when you display your data. Before that point, operations are only written to the log file and are not executed (meaning no changes are made to your data). If that assumption is correct, how can I force an actual computation?

Please correct me here if I missunderstood anything!

1 ACCEPTED SOLUTION

Accepted Solutions

Lakshay
Databricks Employee
Databricks Employee

Hi @Lukas Goldschmiedโ€‹ , The operations are calculated when you execute them on the delta table. So, to elaborate when you do an insert operation, a new data file in parquet format is created with new records and the operation metrics are captured in delta log in JSON format. Now, when you run the select command on this table, it will use the information in the delta log to collect the records from the data files and display them

View solution in original post

2 REPLIES 2

Lakshay
Databricks Employee
Databricks Employee

Hi @Lukas Goldschmiedโ€‹ , The operations are calculated when you execute them on the delta table. So, to elaborate when you do an insert operation, a new data file in parquet format is created with new records and the operation metrics are captured in delta log in JSON format. Now, when you run the select command on this table, it will use the information in the delta log to collect the records from the data files and display them

Anonymous
Not applicable

Hi @Lukas Goldschmiedโ€‹ 

Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. 

We'd love to hear from you.

Thanks!

Connect with Databricks Users in Your Area

Join a Regional User Group to connect with local Databricks users. Events will be happening in your city, and you wonโ€™t want to miss the chance to attend and share knowledge.

If there isnโ€™t a group near you, start one and help create a community that brings people together.

Request a New Group