<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic I read that Delta supports concurrent writes to separate partitions of the table but I'm getting an error when doing so in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/i-read-that-delta-supports-concurrent-writes-to-separate/m-p/22040#M15058</link>
    <description>&lt;P&gt;I’m running 3 separate dbt processes in parallel. all of them are reading data from different databrick databases, creating different staging tables by using dbt alias, but they all at the end update/insert to the same target table. the 3 processes run well except in the last step. sometimes one of the process fails in the last step and sometimes 2 of them fail.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;</description>
    <pubDate>Mon, 21 Jun 2021 20:09:36 GMT</pubDate>
    <dc:creator>aladda</dc:creator>
    <dc:date>2021-06-21T20:09:36Z</dc:date>
    <item>
      <title>I read that Delta supports concurrent writes to separate partitions of the table but I'm getting an error when doing so</title>
      <link>https://community.databricks.com/t5/data-engineering/i-read-that-delta-supports-concurrent-writes-to-separate/m-p/22040#M15058</link>
      <description>&lt;P&gt;I’m running 3 separate dbt processes in parallel. all of them are reading data from different databrick databases, creating different staging tables by using dbt alias, but they all at the end update/insert to the same target table. the 3 processes run well except in the last step. sometimes one of the process fails in the last step and sometimes 2 of them fail.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 21 Jun 2021 20:09:36 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/i-read-that-delta-supports-concurrent-writes-to-separate/m-p/22040#M15058</guid>
      <dc:creator>aladda</dc:creator>
      <dc:date>2021-06-21T20:09:36Z</dc:date>
    </item>
    <item>
      <title>Re: I read that Delta supports concurrent writes to separate partitions of the table but I'm getting an error when doing so</title>
      <link>https://community.databricks.com/t5/data-engineering/i-read-that-delta-supports-concurrent-writes-to-separate/m-p/22041#M15059</link>
      <description>&lt;P&gt;You’re likely running into the issue described&amp;nbsp;&lt;A href="https://urldefense.com/v3/__https:/docs.databricks.com/delta/concurrency-control.html*concurrentappendexception__;Iw!!L8-7AA!B28cKug9QPIyUnJOfDQZg7-dX5W29RgLt5_jAw9H0m2xYvqEDP15VM_dFWAV$" alt="https://urldefense.com/v3/__https:/docs.databricks.com/delta/concurrency-control.html*concurrentappendexception__;Iw!!L8-7AA!B28cKug9QPIyUnJOfDQZg7-dX5W29RgLt5_jAw9H0m2xYvqEDP15VM_dFWAV$" target="_blank"&gt;here&lt;/A&gt;&amp;nbsp;and a solution to it as well. While Delta does support concurrent writers to separate partitions of a table, depending on your query structure join/filter/where in particular, there may still be a need to scan the entire table. Solution typically is to have explicit filtering on the partition columns (which you’d also be using in your joins).&lt;/P&gt;</description>
      <pubDate>Mon, 21 Jun 2021 20:10:01 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/i-read-that-delta-supports-concurrent-writes-to-separate/m-p/22041#M15059</guid>
      <dc:creator>aladda</dc:creator>
      <dc:date>2021-06-21T20:10:01Z</dc:date>
    </item>
  </channel>
</rss>

