cancel
Showing results forย 
Search instead forย 
Did you mean:ย 
Data Engineering
Join discussions on data engineering best practices, architectures, and optimization strategies within the Databricks Community. Exchange insights and solutions with fellow data engineers.
cancel
Showing results forย 
Search instead forย 
Did you mean:ย 

I read that Delta supports concurrent writes to separate partitions of the table but I'm getting an error when doing so

aladda
Databricks Employee
Databricks Employee

Iโ€™m running 3 separate dbt processes in parallel. all of them are reading data from different databrick databases, creating different staging tables by using dbt alias, but they all at the end update/insert to the same target table. the 3 processes run well except in the last step. sometimes one of the process fails in the last step and sometimes 2 of them fail. 

1 ACCEPTED SOLUTION

Accepted Solutions

aladda
Databricks Employee
Databricks Employee

Youโ€™re likely running into the issue described here and a solution to it as well. While Delta does support concurrent writers to separate partitions of a table, depending on your query structure join/filter/where in particular, there may still be a need to scan the entire table. Solution typically is to have explicit filtering on the partition columns (which youโ€™d also be using in your joins).

View solution in original post

1 REPLY 1

aladda
Databricks Employee
Databricks Employee

Youโ€™re likely running into the issue described here and a solution to it as well. While Delta does support concurrent writers to separate partitions of a table, depending on your query structure join/filter/where in particular, there may still be a need to scan the entire table. Solution typically is to have explicit filtering on the partition columns (which youโ€™d also be using in your joins).

Join Us as a Local Community Builder!

Passionate about hosting events and connecting people? Help us grow a vibrant local communityโ€”sign up today to get started!

Sign Up Now