<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: write operation to the Delta table is not completing. in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/write-operation-to-the-delta-table-is-not-completing/m-p/92970#M38597</link>
    <description>&lt;P&gt;&lt;SPAN&gt;Thank you for your advice. I couldn't come to a conclusion based on what you told me, but it gave me an opportunity to review all the logs again. The cause was that the amount of data became too large due to joining between null data. The advice was very helpful as it allowed me to reanalyze the cause. thank you&lt;/SPAN&gt;&lt;/P&gt;</description>
    <pubDate>Mon, 07 Oct 2024 13:41:19 GMT</pubDate>
    <dc:creator>borori</dc:creator>
    <dc:date>2024-10-07T13:41:19Z</dc:date>
    <item>
      <title>write operation to the Delta table is not completing.</title>
      <link>https://community.databricks.com/t5/data-engineering/write-operation-to-the-delta-table-is-not-completing/m-p/89500#M37829</link>
      <description>&lt;P&gt;Using a cluster in serverless mode, three tables are joined and the data frame is written as follows&lt;/P&gt;&lt;P&gt;df.write.mode('append').saveAsTable('table name')&lt;/P&gt;&lt;P&gt;and shema is below&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;date string (ymd format)&lt;/LI&gt;&lt;LI&gt;id bigint&lt;/LI&gt;&lt;LI&gt;value string&lt;/LI&gt;&lt;LI&gt;partition by date&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;After about one minute of execution as a job, the profiler stops progressing and does not return any response.&lt;/P&gt;&lt;P&gt;When I canceled the job, the progress of the profiler was updated with progress, and the tree looked as if it stopped writing to the delta table.&lt;/P&gt;&lt;P&gt;When I reduced the amount of data, it succeeded, but when I rerun the job, the same problem occurs. The same problem occurs when the table is re-created and re-run.&lt;/P&gt;&lt;P&gt;Also, I comment out the write process and modify the display to show the result, the result comes back in about 1 minute.&lt;BR /&gt;The select seems to work fine, but only the write is causing this problem.&lt;/P&gt;&lt;P&gt;What kind of analysis should i do?&lt;/P&gt;</description>
      <pubDate>Wed, 11 Sep 2024 15:52:38 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/write-operation-to-the-delta-table-is-not-completing/m-p/89500#M37829</guid>
      <dc:creator>borori</dc:creator>
      <dc:date>2024-09-11T15:52:38Z</dc:date>
    </item>
    <item>
      <title>Re: write operation to the Delta table is not completing.</title>
      <link>https://community.databricks.com/t5/data-engineering/write-operation-to-the-delta-table-is-not-completing/m-p/90914#M38028</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/120336"&gt;@borori&lt;/a&gt;,&lt;/P&gt;&lt;P&gt;How are you doing today?&lt;/P&gt;&lt;P&gt;As per my understanding,&amp;nbsp;Consider &lt;STRONG&gt;checking the cluster's resource limits&lt;/STRONG&gt; in serverless mode to ensure it's not hitting any memory or I/O constraints. You might also want to &lt;STRONG&gt;repartition the DataFrame&lt;/STRONG&gt; based on the date column before writing to balance the load across partitions. It could be helpful to &lt;STRONG&gt;examine Delta logs&lt;/STRONG&gt; to see if they provide any insights on the issue during the write process. Also, &lt;STRONG&gt;review your partitioning strategy&lt;/STRONG&gt;—too many or too few partitions can affect performance. Lastly, try adjusting the &lt;STRONG&gt;job's parallelism settings&lt;/STRONG&gt; by tuning parameters like spark.sql.shuffle.partitions to improve the write performance.&lt;/P&gt;&lt;P&gt;Give a try and let me know if it works.&lt;/P&gt;&lt;P&gt;Regards,&lt;/P&gt;&lt;P&gt;Brahma&lt;/P&gt;</description>
      <pubDate>Wed, 18 Sep 2024 14:30:58 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/write-operation-to-the-delta-table-is-not-completing/m-p/90914#M38028</guid>
      <dc:creator>Brahmareddy</dc:creator>
      <dc:date>2024-09-18T14:30:58Z</dc:date>
    </item>
    <item>
      <title>Re: write operation to the Delta table is not completing.</title>
      <link>https://community.databricks.com/t5/data-engineering/write-operation-to-the-delta-table-is-not-completing/m-p/92970#M38597</link>
      <description>&lt;P&gt;&lt;SPAN&gt;Thank you for your advice. I couldn't come to a conclusion based on what you told me, but it gave me an opportunity to review all the logs again. The cause was that the amount of data became too large due to joining between null data. The advice was very helpful as it allowed me to reanalyze the cause. thank you&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 07 Oct 2024 13:41:19 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/write-operation-to-the-delta-table-is-not-completing/m-p/92970#M38597</guid>
      <dc:creator>borori</dc:creator>
      <dc:date>2024-10-07T13:41:19Z</dc:date>
    </item>
  </channel>
</rss>

